- Sort Score
- Num 10 results
- Language All
Results 1 - 7 of 7 for jlha (0.03 seconds)
-
fess-crawler/pom.xml
<artifactId>servlet-api</artifactId> </exclusion> </exclusions> </dependency> <dependency> <groupId>jp.gr.java_conf.dangan</groupId> <artifactId>jlha</artifactId> <version>${jlha.version}</version> </dependency> <dependency> <groupId>org.jodconverter</groupId> <artifactId>jodconverter-local</artifactId> <version>${jodconverter.version}</version> </dependency>
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Sun Mar 29 01:35:48 GMT 2026 - 12.5K bytes - Click Count (0) -
src/test/java/org/codelibs/fess/crawler/rule/CrawlerRuleMimeTypePatternTest.java
+ "|application/rdf\\+xml" // + "|application/pdf" // + "|application/x-freemind" // + "|application/lha" // + "|application/x-lha" // + "|application/x-lha-compressed" // + "|text/xml" // + "|text/xml-external-parsed-entity" // + "|text/html)"; // HTML rule pattern from webHtmlRule in rule.xmlCreated: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Wed Feb 04 14:24:39 GMT 2026 - 8.7K bytes - Click Count (0) -
src/main/resources/crawler/rule.xml
<!-- Supported MIME type --> <arg> "(application/xml" + "|application/xhtml\+xml" + "|application/rdf\+xml" + "|application/pdf" + "|application/x-freemind" + "|application/lha" + "|application/x-lha" + "|application/x-lha-compressed" + "|text/xml" + "|text/xml-external-parsed-entity" + "|text/html)" </arg> </postConstruct> </component>
Created: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Sun Mar 29 08:21:02 GMT 2026 - 4.6K bytes - Click Count (0) -
docs/pt/docs/tutorial/schema-extra-example.md
Created: Sun Apr 05 07:19:11 GMT 2026 - Last Modified: Thu Mar 19 18:20:43 GMT 2026 - 9.5K bytes - Click Count (0) -
CLAUDE.md
### Protocols HTTP/HTTPS, File, FTP/FTPS, SMB/CIFS (SMB1/SMB2+), Storage (MinIO via `storage://`), S3 (`s3://`), GCS (`gcs://`) ### Content Formats Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ, LHA), HTML, XML, JSON, Markdown, Media metadata, Images (EXIF/IPTC/XMP), Email (EML) --- ## Architecture ### Module Structure ``` fess-crawler-parent/ ├── fess-crawler/ # Core framework
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Thu Mar 12 03:39:20 GMT 2026 - 8.1K bytes - Click Count (0) -
fess-crawler-lasta/src/main/resources/crawler/extractor.xml
<postConstruct name="addExtractor"> <arg>[ "application/pdf" ]</arg> <arg>pdfExtractor</arg> </postConstruct> <postConstruct name="addExtractor"> <arg>[ "application/x-lha", "application/x-lharc" ]</arg> <arg>lhaExtractor</arg> </postConstruct> <postConstruct name="addExtractor"> <arg>[ "message/rfc822" ]</arg> <arg>emlExtractor</arg>
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Wed Feb 11 01:15:55 GMT 2026 - 50.4K bytes - Click Count (0) -
RELEASE.md
## Thanks to our Contributors This release contains contributions from many people at Google, as well as:
Created: Tue Apr 07 12:39:13 GMT 2026 - Last Modified: Mon Mar 30 18:31:38 GMT 2026 - 746.5K bytes - Click Count (3)