Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 2 of 2 for builtBy (0.03 sec)

  1. README.md

    ## Overview
    
    **Fess Crawler** is a powerful, flexible Java-based web crawling framework designed for enterprise-scale content extraction and processing. Built with a modular architecture, it supports multiple protocols (HTTP/HTTPS, File System, FTP, SMB, Cloud Storage) and provides extensive content extraction capabilities from various document formats.
    
    ### Key Features
    
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  2. fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml

      <mime-type type="application/macwriteii"/>
    
      <mime-type type="application/marc">
        <!-- todo add marc xml <marc:collection> -->
        <glob pattern="*.mrc"/>
        <magic priority="50">
          <!-- built from, e.g. https://www.loc.gov/marc/community/cileader.html -->
          <match value="[0-9]{5,5}" type="regex" offset="0">
            <match value="45" type="string" offset="20">
              <!-- bibliographic -->
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Mar 13 08:18:01 UTC 2025
    - 320.1K bytes
    - Viewed (2)
Back to top