Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 4 of 4 for LHA (0.03 seconds)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/LhaExtractor.java

    import org.codelibs.fess.crawler.util.IgnoreCloseInputStream;
    
    import jp.gr.java_conf.dangan.util.lha.LhaFile;
    import jp.gr.java_conf.dangan.util.lha.LhaHeader;
    
    /**
     * Extractor implementation for LHA (LZH) archive files.
     * This extractor can extract text content from files within LHA archives
     * by using appropriate extractors for each contained file type.
     *
     * @author shinsuke
     */
    Created: Sat Dec 20 11:21:39 GMT 2025
    - Last Modified: Sun Nov 23 12:19:14 GMT 2025
    - 5.9K bytes
    - Click Count (0)
  2. README.md

    #### PDFs and Images
    - PDF documents (text and metadata extraction)
    - Images (JPEG, PNG, GIF, TIFF, BMP)
    - Image metadata (EXIF, IPTC, XMP)
    
    #### Archives and Compressed Files
    - ZIP, TAR, GZ archives
    - LHA compression format
    - Nested archive extraction
    
    #### Web and Markup
    - HTML, XHTML with XPath support
    - XML documents
    - JSON and structured data
    
    #### Media Files
    - Audio formats (MP3, WAV, FLAC)
    Created: Sat Dec 20 11:21:39 GMT 2025
    - Last Modified: Sun Aug 31 05:32:52 GMT 2025
    - 15.3K bytes
    - Click Count (0)
  3. fess-crawler-lasta/src/main/resources/crawler/extractor.xml

    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/pdf"
    				]</arg>
    			<arg>pdfExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/x-lha",
    				"application/x-lharc"
    				]</arg>
    			<arg>lhaExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"message/rfc822"
    				]</arg>
    			<arg>emlExtractor</arg>
    Created: Sat Dec 20 11:21:39 GMT 2025
    - Last Modified: Sun Nov 23 03:46:53 GMT 2025
    - 50.1K bytes
    - Click Count (0)
  4. fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml

          <match value="\377\037" type="string" offset="0"/>
          <match value="0145405" type="host16" offset="0"/>
        </magic>
        <glob pattern="*.bin"/>
        <glob pattern="*.dms"/>
        <glob pattern="*.lha"/>
        <glob pattern="*.lrf"/>
        <glob pattern="*.lzh"/>
        <glob pattern="*.so"/>
        <glob pattern="*.dist"/>
        <glob pattern="*.distz"/>
        <glob pattern="*.pkg"/>
        <glob pattern="*.bpk"/>
    Created: Sat Dec 20 11:21:39 GMT 2025
    - Last Modified: Thu Oct 16 07:46:32 GMT 2025
    - 320.2K bytes
    - Click Count (5)
Back to Top