Search Options

Results per page
Sort
Preferred Languages
Advance

Results 21 - 30 of 31 for remain (0.03 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/EmlExtractor.java

            }
        }
    
        /**
         * Gets the mail properties used for email processing.
         *
         * @return the mail properties
         */
        public Properties getMailProperties() {
            return mailProperties;
        }
    
        /**
         * Sets the mail properties used for email processing.
         *
         * @param mailProperties the mail properties to set
         */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 12.6K bytes
    - Viewed (0)
  2. README.md

    import org.codelibs.fess.crawler.container.StandardCrawlerContainer;
    import org.codelibs.fess.crawler.transformer.impl.FileTransformer;
    
    public class BasicCrawlerExample {
        public static void main(String[] args) throws Exception {
            // Create crawler container
            StandardCrawlerContainer container = new StandardCrawlerContainer();
            
            // Configure basic components
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  3. fess-crawler/src/test/java/org/codelibs/fess/crawler/exception/CrawlerSystemExceptionTest.java

         */
        public void test_stackTraceWithCause() {
            Exception cause = new IllegalArgumentException("Cause exception");
            CrawlerSystemException exception = new CrawlerSystemException("Main exception", cause);
    
            StackTraceElement[] mainStackTrace = exception.getStackTrace();
            StackTraceElement[] causeStackTrace = cause.getStackTrace();
    
            assertNotNull(mainStackTrace);
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Wed Sep 03 14:42:53 UTC 2025
    - 20K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorBuilder.java

     * The builder allows setting parameters such as MIME type, filename, extractor name, maximum content length,
     * and cache file size to optimize the extraction process.
     *
     * <p>
     * The main purpose of this class is to simplify the extraction process by providing a fluent interface
     * for configuring the extraction parameters and handling the underlying complexities of content processing,
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.1K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractor.java

     *   <li>Handling resource names and content types</li>
     *   <li>Retrying extraction without resource name or content type if the initial attempt fails</li>
     *   <li>Extracting text from metadata if the main content extraction fails</li>
     *   <li>Reading content as plain text if all other methods fail</li>
     *   <li>Applying post-extraction filters</li>
     *   <li>Handling Tika exceptions, including zip bomb exceptions</li>
     * </ul>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 30.7K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/util/TextUtil.java

    /**
     * Utility class for text normalization and processing.
     *
     * This class provides methods to normalize text by reading characters from a provided Reader
     * and processing them according to specific rules. The main functionality is encapsulated
     * within the nested {@link TextNormalizeContext} class.
     *
     * <p>The text normalization process includes:
     * <ul>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 12K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/FileTransformer.java

     * {@link org.codelibs.fess.crawler.exception.CrawlerSystemException} in case of errors.
     * </p>
     *
     * <p>
     * The {@link #storeData(ResponseData, ResultData)} method is the main entry point for storing
     * the content of a crawled resource. The {@link #getData(AccessResultData)} method retrieves
     * the stored file path as a File object.
     * </p>
     */
    public class FileTransformer extends HtmlTransformer {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 11.7K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java

    import org.codelibs.fess.crawler.service.DataService;
    import org.codelibs.fess.crawler.service.UrlQueueService;
    
    import jakarta.annotation.Resource;
    
    /**
     * The Crawler class is the main class for web crawling. It manages the crawling process,
     * including adding URLs to the queue, filtering URLs, managing crawler threads,
     * and handling the overall crawling lifecycle.
     *
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14K bytes
    - Viewed (0)
  9. fess-crawler-lasta/src/main/resources/crawler/extractor.xml

    				"application/vnd.oma.poc.invocation-descriptor+xml",
    				"application/vnd.oma.poc.optimized-progress-report+xml",
    				"application/vnd.oma.xcap-directory+xml",
    				"application/vnd.omads-email+xml",
    				"application/vnd.omads-file+xml",
    				"application/vnd.omads-folder+xml",
    				"application/vnd.omaloc-supl-init",
    				"application/vnd.openofficeorg.extension",
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Aug 01 21:40:30 UTC 2020
    - 49K bytes
    - Viewed (0)
  10. fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml

      <mime-type type="application/vnd.oma.poc.optimized-progress-report+xml"/>
      <mime-type type="application/vnd.oma.xcap-directory+xml"/>
      <mime-type type="application/vnd.omads-email+xml"/>
      <mime-type type="application/vnd.omads-file+xml"/>
      <mime-type type="application/vnd.omads-folder+xml"/>
      <mime-type type="application/vnd.omaloc-supl-init"/>
    
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Mar 13 08:18:01 UTC 2025
    - 320.1K bytes
    - Viewed (2)
Back to top