- Sort Score
- Result 10 results
- Languages All
Results 1 - 5 of 5 for lha (0.01 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/LhaExtractor.java
import org.codelibs.fess.crawler.util.IgnoreCloseInputStream; import jp.gr.java_conf.dangan.util.lha.LhaFile; import jp.gr.java_conf.dangan.util.lha.LhaHeader; /** * Extractor implementation for LHA (LZH) archive files. * This extractor can extract text content from files within LHA archives * by using appropriate extractors for each contained file type. * * @author shinsuke */
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 12:19:14 UTC 2025 - 5.9K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/LhaExtractorTest.java
factory.addExtractor("application/x-lha", lhaExtractor); })// ; lhaExtractor = container.getComponent("lhaExtractor"); } public void test_getText() { final InputStream in = ResourceUtil.getResourceAsStream("extractor/lha/test.lzh"); final String content = lhaExtractor.getText(in, null).getContent();
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 3.4K bytes - Viewed (0) -
README.md
#### PDFs and Images - PDF documents (text and metadata extraction) - Images (JPEG, PNG, GIF, TIFF, BMP) - Image metadata (EXIF, IPTC, XMP) #### Archives and Compressed Files - ZIP, TAR, GZ archives - LHA compression format - Nested archive extraction #### Web and Markup - HTML, XHTML with XPath support - XML documents - JSON and structured data #### Media Files - Audio formats (MP3, WAV, FLAC)
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler-lasta/src/main/resources/crawler/extractor.xml
<postConstruct name="addExtractor"> <arg>[ "application/pdf" ]</arg> <arg>pdfExtractor</arg> </postConstruct> <postConstruct name="addExtractor"> <arg>[ "application/x-lha", "application/x-lharc" ]</arg> <arg>lhaExtractor</arg> </postConstruct> <postConstruct name="addExtractor"> <arg>[ "message/rfc822" ]</arg> <arg>emlExtractor</arg>
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 03:46:53 UTC 2025 - 50.1K bytes - Viewed (0) -
fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml
<match value="\377\037" type="string" offset="0"/> <match value="0145405" type="host16" offset="0"/> </magic> <glob pattern="*.bin"/> <glob pattern="*.dms"/> <glob pattern="*.lha"/> <glob pattern="*.lrf"/> <glob pattern="*.lzh"/> <glob pattern="*.so"/> <glob pattern="*.dist"/> <glob pattern="*.distz"/> <glob pattern="*.pkg"/> <glob pattern="*.bpk"/>
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Oct 16 07:46:32 UTC 2025 - 320.2K bytes - Viewed (5)