Search Options

Results per page
Sort
Preferred Languages
Advance

Results 81 - 90 of 116 for texts (0.01 sec)

  1. LICENSE

          (d) If the Work includes a "NOTICE" text file as part of its
              distribution, then any Derivative Works that You distribute must
              include a readable copy of the attribution notices contained
              within such NOTICE file, excluding those notices that do not
              pertain to any part of the Derivative Works, in at least one
              of the following places: within a NOTICE text file distributed
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Mon Jan 11 04:26:17 UTC 2021
    - 11.1K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/client/storage/StorageClientTest.java

            try (final ResponseData responseData = storageClient.doGet("storage://fess/file1.txt")) {
                assertEquals("storage://fess/file1.txt", responseData.getUrl());
                assertEquals("text/plain", responseData.getMimeType());
                assertEquals("file1", new String(InputStreamUtil.getBytes(responseData.getResponseBody())));
                assertEquals(5, responseData.getContentLength());
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 13.8K bytes
    - Viewed (0)
  3. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/PdfExtractorTest.java

                        PdfExtractor pdfExtractor = container.getComponent("pdfExtractor");
                        factory.addExtractor("text/plain", tikaExtractor);
                        factory.addExtractor("text/html", tikaExtractor);
                        factory.addExtractor("application/pdf", pdfExtractor);
                    });
            pdfExtractor = container.getComponent("pdfExtractor");
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Mar 15 06:52:00 UTC 2025
    - 7.6K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/rule/impl/RegexRule.java

     * RegexRule rule = new RegexRule();
     * rule.addRule("url", "https://example.com/.*");
     * rule.addRule("contentType", "text/html");
     * rule.setAllRequired(true); // Both URL and content type must match
     *
     * ResponseData responseData = new ResponseData();
     * responseData.setUrl("https://example.com/page1");
     * responseData.setContentType("text/html");
     *
     * boolean matches = rule.match(responseData); // Returns true
     * }
     * </pre>
     *
     */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 6.2K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlXpathExtractor.java

     * It uses XPath expressions to extract text content from HTML documents.
     * <p>
     * This class provides methods to configure the XPath expressions, parser features, and properties.
     * It also includes caching mechanism for XPathAPI instances to improve performance.
     * </p>
     * <p>
     * The extracted text is obtained from the nodes selected by the {@code targetNodePath} XPath expression.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.3K bytes
    - Viewed (0)
  6. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/ContentLengthHelperTest.java

            contentLengthHelper = container.getComponent("contentLengthHelper");
        }
    
        public void test_getMaxLength() {
            final String mimeType = "text/plain";
    
            assertEquals(DEFAULT_MAX_LENGTH, contentLengthHelper.getMaxLength(mimeType));
            contentLengthHelper.addMaxLength(mimeType, 1000L);
            assertEquals(1000L, contentLengthHelper.getMaxLength(mimeType));
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Mar 15 06:52:00 UTC 2025
    - 2K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/XmlExtractor.java

     * governing permissions and limitations under the License.
     */
    package org.codelibs.fess.crawler.extractor.impl;
    
    import java.util.regex.Pattern;
    
    /**
     * Extracts text content from XML documents.
     */
    public class XmlExtractor extends AbstractXmlExtractor {
    
        /**
         * Creates a new XmlExtractor instance.
         */
        public XmlExtractor() {
            super();
        }
    
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 2.6K bytes
    - Viewed (0)
  8. fess-crawler/pom.xml

    			<artifactId>commons-lang3</artifactId>
    			<version>${commons.lang3.version}</version>
    		</dependency>
    		<dependency>
    			<groupId>org.apache.commons</groupId>
    			<artifactId>commons-text</artifactId>
    			<version>${commons.text.version}</version>
    		</dependency>
    		<dependency>
    			<groupId>commons-net</groupId>
    			<artifactId>commons-net</artifactId>
    			<version>${commons.net.version}</version>
    		</dependency>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 11.3K bytes
    - Viewed (0)
  9. fess-crawler/src/test/java/org/codelibs/fess/crawler/client/smb/SmbClientTest.java

                assertEquals(200, responseData.getHttpStatusCode());
                assertEquals(5, responseData.getContentLength());
                assertEquals(baseUrl + "file1.txt", responseData.getUrl());
                assertEquals("text/plain", responseData.getMimeType());
                assertNull(responseData.getResponseBody());
            }
            try (final ResponseData responseData = smbClient.doHead(baseUrl + "dir1/file2.txt")) {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 13.7K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/util/TemporaryFileInputStream.java

         */
        @Override
        public synchronized void mark(final int readlimit) {
            fileInputStream.mark(readlimit);
        }
    
        /**
         * Tests if this input stream supports the mark and reset methods.
         * This method delegates to {@link FileInputStream#markSupported()}.
         *
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 4.3K bytes
    - Viewed (0)
Back to top