- Sort Score
- Num 10 results
- Language All
Results 1 - 6 of 6 for LHA (0.02 seconds)
-
src/test/java/org/codelibs/fess/crawler/rule/CrawlerRuleMimeTypePatternTest.java
+ "|application/rdf\\+xml" // + "|application/pdf" // + "|application/x-freemind" // + "|application/lha" // + "|application/x-lha" // + "|application/x-lha-compressed" // + "|text/xml" // + "|text/xml-external-parsed-entity" // + "|text/html)"; // HTML rule pattern from webHtmlRule in rule.xmlCreated: 2026-03-31 13:07 - Last Modified: 2026-02-04 14:24 - 8.7K bytes - Click Count (0) -
src/main/resources/crawler/rule.xml
<!-- Supported MIME type --> <arg> "(application/xml" + "|application/xhtml\+xml" + "|application/rdf\+xml" + "|application/pdf" + "|application/x-freemind" + "|application/lha" + "|application/x-lha" + "|application/x-lha-compressed" + "|text/xml" + "|text/xml-external-parsed-entity" + "|text/html)" </arg> </postConstruct> </component>
Created: 2026-03-31 13:07 - Last Modified: 2026-03-29 08:21 - 4.6K bytes - Click Count (0) -
CLAUDE.md
### Protocols HTTP/HTTPS, File, FTP/FTPS, SMB/CIFS (SMB1/SMB2+), Storage (MinIO via `storage://`), S3 (`s3://`), GCS (`gcs://`) ### Content Formats Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ, LHA), HTML, XML, JSON, Markdown, Media metadata, Images (EXIF/IPTC/XMP), Email (EML) --- ## Architecture ### Module Structure ``` fess-crawler-parent/ ├── fess-crawler/ # Core framework
Created: 2026-04-12 03:50 - Last Modified: 2026-03-12 03:39 - 8.1K bytes - Click Count (0) -
README.md
#### PDFs and Images - PDF documents (text and metadata extraction) - Images (JPEG, PNG, GIF, TIFF, BMP) - Image metadata (EXIF, IPTC, XMP) #### Archives and Compressed Files - ZIP, TAR, GZ archives - LHA compression format - Nested archive extraction #### Web and Markup - HTML, XHTML with XPath support - XML documents - JSON and structured data #### Media Files - Audio formats (MP3, WAV, FLAC)
Created: 2026-04-12 03:50 - Last Modified: 2025-08-31 05:32 - 15.3K bytes - Click Count (0) -
fess-crawler-lasta/src/main/resources/crawler/extractor.xml
<postConstruct name="addExtractor"> <arg>[ "application/pdf" ]</arg> <arg>pdfExtractor</arg> </postConstruct> <postConstruct name="addExtractor"> <arg>[ "application/x-lha", "application/x-lharc" ]</arg> <arg>lhaExtractor</arg> </postConstruct> <postConstruct name="addExtractor"> <arg>[ "message/rfc822" ]</arg> <arg>emlExtractor</arg>
Created: 2026-04-12 03:50 - Last Modified: 2026-02-11 01:15 - 50.4K bytes - Click Count (0) -
okhttp-idna-mapping-table/src/main/resources/okhttp3/internal/idna/IdnaMappingTable.txt
0513 ; valid # 5.0 CYRILLIC SMALL LETTER EL WITH HOOK 0514 ; mapped ; 0515 # 5.1 CYRILLIC CAPITAL LETTER LHA 0515 ; valid # 5.1 CYRILLIC SMALL LETTER LHA 0516 ; mapped ; 0517 # 5.1 CYRILLIC CAPITAL LETTER RHA 0517 ; valid # 5.1 CYRILLIC SMALL LETTER RHA
Created: 2026-04-03 11:42 - Last Modified: 2024-02-10 11:25 - 854.1K bytes - Click Count (0)