Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 6 of 6 for LHA (0.02 seconds)

  1. src/test/java/org/codelibs/fess/crawler/rule/CrawlerRuleMimeTypePatternTest.java

                + "|application/rdf\\+xml" //
                + "|application/pdf" //
                + "|application/x-freemind" //
                + "|application/lha" //
                + "|application/x-lha" //
                + "|application/x-lha-compressed" //
                + "|text/xml" //
                + "|text/xml-external-parsed-entity" //
                + "|text/html)";
    
        // HTML rule pattern from webHtmlRule in rule.xml
    Created: 2026-03-31 13:07
    - Last Modified: 2026-02-04 14:24
    - 8.7K bytes
    - Click Count (0)
  2. src/main/resources/crawler/rule.xml

    			<!-- Supported MIME type -->
    			<arg>
      "(application/xml"
    + "|application/xhtml\+xml"
    + "|application/rdf\+xml"
    + "|application/pdf"
    + "|application/x-freemind"
    + "|application/lha"
    + "|application/x-lha"
    + "|application/x-lha-compressed"
    + "|text/xml"
    + "|text/xml-external-parsed-entity"
    + "|text/html)"
    			</arg>
    		</postConstruct>
    	</component>
    
    
    Created: 2026-03-31 13:07
    - Last Modified: 2026-03-29 08:21
    - 4.6K bytes
    - Click Count (0)
  3. CLAUDE.md

    ### Protocols
    
    HTTP/HTTPS, File, FTP/FTPS, SMB/CIFS (SMB1/SMB2+), Storage (MinIO via `storage://`), S3 (`s3://`), GCS (`gcs://`)
    
    ### Content Formats
    
    Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ, LHA), HTML, XML, JSON, Markdown, Media metadata, Images (EXIF/IPTC/XMP), Email (EML)
    
    ---
    
    ## Architecture
    
    ### Module Structure
    
    ```
    fess-crawler-parent/
    ├── fess-crawler/              # Core framework
    Created: 2026-04-12 03:50
    - Last Modified: 2026-03-12 03:39
    - 8.1K bytes
    - Click Count (0)
  4. README.md

    #### PDFs and Images
    - PDF documents (text and metadata extraction)
    - Images (JPEG, PNG, GIF, TIFF, BMP)
    - Image metadata (EXIF, IPTC, XMP)
    
    #### Archives and Compressed Files
    - ZIP, TAR, GZ archives
    - LHA compression format
    - Nested archive extraction
    
    #### Web and Markup
    - HTML, XHTML with XPath support
    - XML documents
    - JSON and structured data
    
    #### Media Files
    - Audio formats (MP3, WAV, FLAC)
    Created: 2026-04-12 03:50
    - Last Modified: 2025-08-31 05:32
    - 15.3K bytes
    - Click Count (0)
  5. fess-crawler-lasta/src/main/resources/crawler/extractor.xml

    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/pdf"
    				]</arg>
    			<arg>pdfExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/x-lha",
    				"application/x-lharc"
    				]</arg>
    			<arg>lhaExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"message/rfc822"
    				]</arg>
    			<arg>emlExtractor</arg>
    Created: 2026-04-12 03:50
    - Last Modified: 2026-02-11 01:15
    - 50.4K bytes
    - Click Count (0)
  6. okhttp-idna-mapping-table/src/main/resources/okhttp3/internal/idna/IdnaMappingTable.txt

    0513          ; valid                                  # 5.0  CYRILLIC SMALL LETTER EL WITH HOOK
    0514          ; mapped                 ; 0515          # 5.1  CYRILLIC CAPITAL LETTER LHA
    0515          ; valid                                  # 5.1  CYRILLIC SMALL LETTER LHA
    0516          ; mapped                 ; 0517          # 5.1  CYRILLIC CAPITAL LETTER RHA
    0517          ; valid                                  # 5.1  CYRILLIC SMALL LETTER RHA
    Created: 2026-04-03 11:42
    - Last Modified: 2024-02-10 11:25
    - 854.1K bytes
    - Click Count (0)
Back to Top