Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 7 of 7 for jlha (0.03 seconds)

  1. fess-crawler/pom.xml

    					<artifactId>servlet-api</artifactId>
    				</exclusion>
    			</exclusions>
    		</dependency>
    		<dependency>
    			<groupId>jp.gr.java_conf.dangan</groupId>
    			<artifactId>jlha</artifactId>
    			<version>${jlha.version}</version>
    		</dependency>
    		<dependency>
    			<groupId>org.jodconverter</groupId>
    			<artifactId>jodconverter-local</artifactId>
    			<version>${jodconverter.version}</version>
    		</dependency>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Sun Mar 29 01:35:48 GMT 2026
    - 12.5K bytes
    - Click Count (0)
  2. src/test/java/org/codelibs/fess/crawler/rule/CrawlerRuleMimeTypePatternTest.java

                + "|application/rdf\\+xml" //
                + "|application/pdf" //
                + "|application/x-freemind" //
                + "|application/lha" //
                + "|application/x-lha" //
                + "|application/x-lha-compressed" //
                + "|text/xml" //
                + "|text/xml-external-parsed-entity" //
                + "|text/html)";
    
        // HTML rule pattern from webHtmlRule in rule.xml
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Wed Feb 04 14:24:39 GMT 2026
    - 8.7K bytes
    - Click Count (0)
  3. src/main/resources/crawler/rule.xml

    			<!-- Supported MIME type -->
    			<arg>
      "(application/xml"
    + "|application/xhtml\+xml"
    + "|application/rdf\+xml"
    + "|application/pdf"
    + "|application/x-freemind"
    + "|application/lha"
    + "|application/x-lha"
    + "|application/x-lha-compressed"
    + "|text/xml"
    + "|text/xml-external-parsed-entity"
    + "|text/html)"
    			</arg>
    		</postConstruct>
    	</component>
    
    
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Sun Mar 29 08:21:02 GMT 2026
    - 4.6K bytes
    - Click Count (0)
  4. docs/pt/docs/tutorial/schema-extra-example.md

    ### Resumo { #summary }
    
    Eu costumava dizer que não gostava tanto de história... e olha eu aqui agora dando aulas de "história tech". 😅
    
    Created: Sun Apr 05 07:19:11 GMT 2026
    - Last Modified: Thu Mar 19 18:20:43 GMT 2026
    - 9.5K bytes
    - Click Count (0)
  5. CLAUDE.md

    ### Protocols
    
    HTTP/HTTPS, File, FTP/FTPS, SMB/CIFS (SMB1/SMB2+), Storage (MinIO via `storage://`), S3 (`s3://`), GCS (`gcs://`)
    
    ### Content Formats
    
    Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ, LHA), HTML, XML, JSON, Markdown, Media metadata, Images (EXIF/IPTC/XMP), Email (EML)
    
    ---
    
    ## Architecture
    
    ### Module Structure
    
    ```
    fess-crawler-parent/
    ├── fess-crawler/              # Core framework
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Thu Mar 12 03:39:20 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  6. fess-crawler-lasta/src/main/resources/crawler/extractor.xml

    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/pdf"
    				]</arg>
    			<arg>pdfExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/x-lha",
    				"application/x-lharc"
    				]</arg>
    			<arg>lhaExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"message/rfc822"
    				]</arg>
    			<arg>emlExtractor</arg>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Wed Feb 11 01:15:55 GMT 2026
    - 50.4K bytes
    - Click Count (0)
  7. RELEASE.md

    ## Thanks to our Contributors
    
    This release contains contributions from many people at Google, as well as:
    
    Created: Tue Apr 07 12:39:13 GMT 2026
    - Last Modified: Mon Mar 30 18:31:38 GMT 2026
    - 746.5K bytes
    - Click Count (3)
Back to Top