Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 4 of 4 for jlha (0.03 seconds)

The search processing time has exceeded the limit. The displayed results may be partial.

  1. fess-crawler/pom.xml

    					<artifactId>servlet-api</artifactId>
    				</exclusion>
    			</exclusions>
    		</dependency>
    		<dependency>
    			<groupId>jp.gr.java_conf.dangan</groupId>
    			<artifactId>jlha</artifactId>
    			<version>${jlha.version}</version>
    		</dependency>
    		<dependency>
    			<groupId>org.jodconverter</groupId>
    			<artifactId>jodconverter-local</artifactId>
    			<version>${jodconverter.version}</version>
    		</dependency>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Sun Mar 29 01:35:48 GMT 2026
    - 12.5K bytes
    - Click Count (0)
  2. src/test/java/org/codelibs/fess/crawler/rule/CrawlerRuleMimeTypePatternTest.java

                + "|application/rdf\\+xml" //
                + "|application/pdf" //
                + "|application/x-freemind" //
                + "|application/lha" //
                + "|application/x-lha" //
                + "|application/x-lha-compressed" //
                + "|text/xml" //
                + "|text/xml-external-parsed-entity" //
                + "|text/html)";
    
        // HTML rule pattern from webHtmlRule in rule.xml
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Wed Feb 04 14:24:39 GMT 2026
    - 8.7K bytes
    - Click Count (0)
  3. CLAUDE.md

    ### Protocols
    
    HTTP/HTTPS, File, FTP/FTPS, SMB/CIFS (SMB1/SMB2+), Storage (MinIO via `storage://`), S3 (`s3://`), GCS (`gcs://`)
    
    ### Content Formats
    
    Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ, LHA), HTML, XML, JSON, Markdown, Media metadata, Images (EXIF/IPTC/XMP), Email (EML)
    
    ---
    
    ## Architecture
    
    ### Module Structure
    
    ```
    fess-crawler-parent/
    ├── fess-crawler/              # Core framework
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Thu Mar 12 03:39:20 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  4. fess-crawler-lasta/src/main/resources/crawler/extractor.xml

    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/pdf"
    				]</arg>
    			<arg>pdfExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/x-lha",
    				"application/x-lharc"
    				]</arg>
    			<arg>lhaExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"message/rfc822"
    				]</arg>
    			<arg>emlExtractor</arg>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Wed Feb 11 01:15:55 GMT 2026
    - 50.4K bytes
    - Click Count (0)
Back to Top