Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 10 of 10 for jlha (0.04 seconds)

  1. fess-crawler/pom.xml

    					<artifactId>servlet-api</artifactId>
    				</exclusion>
    			</exclusions>
    		</dependency>
    		<dependency>
    			<groupId>jp.gr.java_conf.dangan</groupId>
    			<artifactId>jlha</artifactId>
    			<version>${jlha.version}</version>
    		</dependency>
    		<dependency>
    			<groupId>org.jodconverter</groupId>
    			<artifactId>jodconverter-local</artifactId>
    			<version>${jodconverter.version}</version>
    		</dependency>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Sun Mar 29 01:35:48 GMT 2026
    - 12.5K bytes
    - Click Count (0)
  2. src/test/java/org/codelibs/fess/crawler/rule/CrawlerRuleMimeTypePatternTest.java

                + "|application/rdf\\+xml" //
                + "|application/pdf" //
                + "|application/x-freemind" //
                + "|application/lha" //
                + "|application/x-lha" //
                + "|application/x-lha-compressed" //
                + "|text/xml" //
                + "|text/xml-external-parsed-entity" //
                + "|text/html)";
    
        // HTML rule pattern from webHtmlRule in rule.xml
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Wed Feb 04 14:24:39 GMT 2026
    - 8.7K bytes
    - Click Count (0)
  3. src/main/resources/crawler/rule.xml

    			<!-- Supported MIME type -->
    			<arg>
      "(application/xml"
    + "|application/xhtml\+xml"
    + "|application/rdf\+xml"
    + "|application/pdf"
    + "|application/x-freemind"
    + "|application/lha"
    + "|application/x-lha"
    + "|application/x-lha-compressed"
    + "|text/xml"
    + "|text/xml-external-parsed-entity"
    + "|text/html)"
    			</arg>
    		</postConstruct>
    	</component>
    
    
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Sun Mar 29 08:21:02 GMT 2026
    - 4.6K bytes
    - Click Count (0)
  4. android/guava/src/com/google/common/base/Throwables.java

                    requireNonNull(getStackTraceElementMethod), requireNonNull(jla), t, n);
          }
    
          @Override
          public int size() {
            return (Integer)
                invokeAccessibleNonThrowingMethod(
                    requireNonNull(getStackTraceDepthMethod), requireNonNull(jla), t);
          }
        };
      }
    
      @J2ktIncompatible
      @GwtIncompatible // java.lang.reflect
    Created: Fri Apr 03 12:43:13 GMT 2026
    - Last Modified: Thu Aug 07 16:05:33 GMT 2025
    - 20.6K bytes
    - Click Count (0)
  5. guava/src/com/google/common/base/Throwables.java

                    requireNonNull(getStackTraceElementMethod), requireNonNull(jla), t, n);
          }
    
          @Override
          public int size() {
            return (Integer)
                invokeAccessibleNonThrowingMethod(
                    requireNonNull(getStackTraceDepthMethod), requireNonNull(jla), t);
          }
        };
      }
    
      @J2ktIncompatible
      @GwtIncompatible // java.lang.reflect
    Created: Fri Apr 03 12:43:13 GMT 2026
    - Last Modified: Thu Aug 07 16:05:33 GMT 2025
    - 20.6K bytes
    - Click Count (0)
  6. docs/pt/docs/tutorial/schema-extra-example.md

    ### Resumo { #summary }
    
    Eu costumava dizer que não gostava tanto de história... e olha eu aqui agora dando aulas de "história tech". 😅
    
    Created: Sun Apr 05 07:19:11 GMT 2026
    - Last Modified: Thu Mar 19 18:20:43 GMT 2026
    - 9.5K bytes
    - Click Count (0)
  7. CLAUDE.md

    ### Protocols
    
    HTTP/HTTPS, File, FTP/FTPS, SMB/CIFS (SMB1/SMB2+), Storage (MinIO via `storage://`), S3 (`s3://`), GCS (`gcs://`)
    
    ### Content Formats
    
    Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ, LHA), HTML, XML, JSON, Markdown, Media metadata, Images (EXIF/IPTC/XMP), Email (EML)
    
    ---
    
    ## Architecture
    
    ### Module Structure
    
    ```
    fess-crawler-parent/
    ├── fess-crawler/              # Core framework
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Thu Mar 12 03:39:20 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  8. README.md

    #### PDFs and Images
    - PDF documents (text and metadata extraction)
    - Images (JPEG, PNG, GIF, TIFF, BMP)
    - Image metadata (EXIF, IPTC, XMP)
    
    #### Archives and Compressed Files
    - ZIP, TAR, GZ archives
    - LHA compression format
    - Nested archive extraction
    
    #### Web and Markup
    - HTML, XHTML with XPath support
    - XML documents
    - JSON and structured data
    
    #### Media Files
    - Audio formats (MP3, WAV, FLAC)
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Sun Aug 31 05:32:52 GMT 2025
    - 15.3K bytes
    - Click Count (0)
  9. fess-crawler-lasta/src/main/resources/crawler/extractor.xml

    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/pdf"
    				]</arg>
    			<arg>pdfExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"application/x-lha",
    				"application/x-lharc"
    				]</arg>
    			<arg>lhaExtractor</arg>
    		</postConstruct>
    		<postConstruct name="addExtractor">
    			<arg>[
    				"message/rfc822"
    				]</arg>
    			<arg>emlExtractor</arg>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Wed Feb 11 01:15:55 GMT 2026
    - 50.4K bytes
    - Click Count (0)
  10. RELEASE.md

    ## Thanks to our Contributors
    
    This release contains contributions from many people at Google, as well as:
    
    Created: Tue Apr 07 12:39:13 GMT 2026
    - Last Modified: Mon Mar 30 18:31:38 GMT 2026
    - 746.5K bytes
    - Click Count (3)
Back to Top