- Sort Score
- Num 10 results
- Language All
Results 1 - 4 of 4 for jlha (0.01 seconds)
-
src/test/java/org/codelibs/fess/crawler/rule/CrawlerRuleMimeTypePatternTest.java
+ "|application/rdf\\+xml" // + "|application/pdf" // + "|application/x-freemind" // + "|application/lha" // + "|application/x-lha" // + "|application/x-lha-compressed" // + "|text/xml" // + "|text/xml-external-parsed-entity" // + "|text/html)"; // HTML rule pattern from webHtmlRule in rule.xmlCreated: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Wed Feb 04 14:24:39 GMT 2026 - 8.7K bytes - Click Count (0) -
src/main/resources/crawler/rule.xml
<!-- Supported MIME type --> <arg> "(application/xml" + "|application/xhtml\+xml" + "|application/rdf\+xml" + "|application/pdf" + "|application/x-freemind" + "|application/lha" + "|application/x-lha" + "|application/x-lha-compressed" + "|text/xml" + "|text/xml-external-parsed-entity" + "|text/html)" </arg> </postConstruct> </component>
Created: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Sun Mar 29 08:21:02 GMT 2026 - 4.6K bytes - Click Count (0) -
docs/pt/docs/tutorial/schema-extra-example.md
Created: Sun Apr 05 07:19:11 GMT 2026 - Last Modified: Thu Mar 19 18:20:43 GMT 2026 - 9.5K bytes - Click Count (0) -
CLAUDE.md
### Protocols HTTP/HTTPS, File, FTP/FTPS, SMB/CIFS (SMB1/SMB2+), Storage (MinIO via `storage://`), S3 (`s3://`), GCS (`gcs://`) ### Content Formats Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ, LHA), HTML, XML, JSON, Markdown, Media metadata, Images (EXIF/IPTC/XMP), Email (EML) --- ## Architecture ### Module Structure ``` fess-crawler-parent/ ├── fess-crawler/ # Core framework
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Thu Mar 12 03:39:20 GMT 2026 - 8.1K bytes - Click Count (0)