Search Options

Results per page
Sort
Preferred Languages
Advance

Results 21 - 30 of 96 for PDF$ (0.1 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorBuilder.java

     * </p>
     *
     * <p>
     * Example usage:
     * </p>
     *
     * <pre>
     * {@code
     * try (InputStream in = new FileInputStream("example.pdf")) {
     *     ExtractData extractData = new ExtractorBuilder(crawlerContainer, in, new HashMap<>())
     *         .mimeType("application/pdf")
     *         .filename("example.pdf")
     *         .maxContentLength(1024 * 1024)
     *         .extract();
     *
     *     String content = extractData.getContent();
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.1K bytes
    - Viewed (0)
  2. src/main/java/org/codelibs/fess/helper/FileTypeHelper.java

            }
        }
    
        /**
         * Adds or updates a MIME type to file type mapping.
         *
         * @param mimetype the MIME type to map (e.g., "application/pdf")
         * @param filetype the file type classification (e.g., "pdf")
         */
        public void add(final String mimetype, final String filetype) {
            mimetypeMap.put(mimetype, filetype);
        }
    
        /**
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 4.4K bytes
    - Viewed (0)
  3. src/main/java/jcifs/smb1/util/mime.map

    application/msword             doc              # Microsoft Word
    application/octet-stream       bin exe ani      # Binary File
    application/oda                oda
    application/pagemaker          pm5 pt5 pm       # PageMaker
    application/pdf                pdf              # Adobe Acrobat
    application/postscript         ai eps ps        # Postscript File
    application/rtf                rtf              # Rich Text File
    application/toolbook           tbk              # Toolbook
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Fri Mar 22 20:39:42 UTC 2019
    - 5.9K bytes
    - Viewed (0)
  4. fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots_wildcard.txt

    # Test robots.txt for wildcard (*) and end-of-path ($) support
    # Based on RFC 9309 specification
    
    # Test wildcard patterns
    User-agent: WildcardBot
    Disallow: /*.pdf$
    Disallow: /admin/*.php
    Disallow: /*/private/
    Allow: /public/*.html
    
    # Test end-of-path ($) patterns
    User-agent: EndPathBot
    Disallow: /fish$
    Disallow: /temp$
    Allow: /fishing
    
    # Test complex patterns
    User-agent: ComplexBot
    Disallow: /
    Allow: /$
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 14:03:41 UTC 2025
    - 910 bytes
    - Viewed (0)
  5. src/test/java/jcifs/smb/FileEntryTest.java

        void fakeImplementation_happyPath() {
            // Arrange
            FileEntry e = new TestFileEntry("doc.pdf", 2, 0x10, 10L, 20L, 30L, 4096L, 3);
    
            // Act & Assert: getters return exactly what was provided
            assertAll(() -> assertEquals("doc.pdf", e.getName()), () -> assertEquals(2, e.getType()),
                    () -> assertEquals(0x10, e.getAttributes()), () -> assertEquals(10L, e.createTime()),
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Thu Aug 14 05:31:44 UTC 2025
    - 10K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractor.java

     *   <li>Maximum term sizes for alphanumeric and symbolic terms</li>
     *   <li>Custom Tika configuration</li>
     *   <li>Tesseract OCR configuration for image-based documents</li>
     *   <li>PDF Parser configuration for PDF documents</li>
     * </ul>
     *
     * <p>
     * The {@link TikaDetectParser} inner class extends {@link CompositeParser} to provide auto-detection of the MIME type
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 12:19:14 UTC 2025
    - 30.8K bytes
    - Viewed (0)
  7. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/ExtractorFactoryTest.java

            extractorFactory.addExtractor("application/vnd.ms-powerpoint", tikaExtractor);
            extractorFactory.addExtractor("application/vnd.visio", tikaExtractor);
            extractorFactory.addExtractor("application/pdf", pdfExtractor);
            extractorFactory.addExtractor("application/x-lha", lhaExtractor);
            extractorFactory.addExtractor("application/x-lharc", lhaExtractor);
    
        }
    
        public void test_addExtractor() {
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Mar 15 06:52:00 UTC 2025
    - 6.9K bytes
    - Viewed (0)
  8. src/test/resources/jcifs/smb1/util/mime.map

    application/msword             doc              # Microsoft Word
    application/octet-stream       bin exe ani      # Binary File
    application/oda                oda
    application/pagemaker          pm5 pt5 pm       # PageMaker
    application/pdf                pdf              # Adobe Acrobat
    application/postscript         ai eps ps        # Postscript File
    application/rtf                rtf              # Rich Text File
    application/toolbook           tbk              # Toolbook
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Thu Aug 14 05:31:44 UTC 2025
    - 5.9K bytes
    - Viewed (0)
  9. docs/compression/README.md

    ```
    
    Default config includes most common highly compressible content extensions and mime-types.
    
    ```bash
    ~ mc admin config set myminio compression extensions=".pdf" mime_types="application/pdf"
    ```
    
    To show help on setting compression config values.
    
    ```bash
    ~ mc admin config set myminio compression
    ```
    
    To enable compression for all content, no matter the extension and content type
    Registered: Sun Dec 28 19:28:13 UTC 2025
    - Last Modified: Tue Aug 12 18:20:36 UTC 2025
    - 5.2K bytes
    - Viewed (0)
  10. fess-crawler-opensearch/src/test/java/org/codelibs/fess/crawler/service/impl/OpenSearchDataServiceTest.java

            pdfResult.setMethod("GET");
            pdfResult.setMimeType("application/pdf");
            pdfResult.setParentUrl("http://www.parent.com/");
            pdfResult.setRuleId("pdfRule");
            pdfResult.setSessionId(sessionId);
            pdfResult.setStatus(200);
            pdfResult.setUrl("http://www.example.com/document.pdf");
            dataService.store(pdfResult);
    
            // Filter by MIME type
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 20 08:40:57 UTC 2025
    - 15.5K bytes
    - Viewed (0)
Back to top