Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 7 of 7 for Wain (0.01 sec)

  1. fess-crawler/src/test/java/org/codelibs/fess/crawler/exception/CrawlerSystemExceptionTest.java

         */
        public void test_stackTraceWithCause() {
            Exception cause = new IllegalArgumentException("Cause exception");
            CrawlerSystemException exception = new CrawlerSystemException("Main exception", cause);
    
            StackTraceElement[] mainStackTrace = exception.getStackTrace();
            StackTraceElement[] causeStackTrace = cause.getStackTrace();
    
            assertNotNull(mainStackTrace);
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Wed Sep 03 14:42:53 UTC 2025
    - 20K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/util/TextUtil.java

    /**
     * Utility class for text normalization and processing.
     *
     * This class provides methods to normalize text by reading characters from a provided Reader
     * and processing them according to specific rules. The main functionality is encapsulated
     * within the nested {@link TextNormalizeContext} class.
     *
     * <p>The text normalization process includes:
     * <ul>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 12K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/FileTransformer.java

     * {@link org.codelibs.fess.crawler.exception.CrawlerSystemException} in case of errors.
     * </p>
     *
     * <p>
     * The {@link #storeData(ResponseData, ResultData)} method is the main entry point for storing
     * the content of a crawled resource. The {@link #getData(AccessResultData)} method retrieves
     * the stored file path as a File object.
     * </p>
     */
    public class FileTransformer extends HtmlTransformer {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 11.7K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java

     *   <li><b>invalidUrlPattern:</b> A regular expression pattern used to identify
     *       invalid URLs.</li>
     * </ul>
     *
     * <p>
     * <b>Usage:</b>
     * </p>
     * <p>
     * The {@code transform} method is the main entry point for transforming an HTML
     * response. It takes a {@link ResponseData} object as input and returns a
     * {@link ResultData} object containing the extracted data and child URLs.
     * </p>
     */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 28.5K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractor.java

     *   <li>Handling resource names and content types</li>
     *   <li>Retrying extraction without resource name or content type if the initial attempt fails</li>
     *   <li>Extracting text from metadata if the main content extraction fails</li>
     *   <li>Reading content as plain text if all other methods fail</li>
     *   <li>Applying post-extraction filters</li>
     *   <li>Handling Tika exceptions, including zip bomb exceptions</li>
     * </ul>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 30.7K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorBuilder.java

     * The builder allows setting parameters such as MIME type, filename, extractor name, maximum content length,
     * and cache file size to optimize the extraction process.
     *
     * <p>
     * The main purpose of this class is to simplify the extraction process by providing a fluent interface
     * for configuring the extraction parameters and handling the underlying complexities of content processing,
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.1K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java

    import org.codelibs.fess.crawler.service.DataService;
    import org.codelibs.fess.crawler.service.UrlQueueService;
    
    import jakarta.annotation.Resource;
    
    /**
     * The Crawler class is the main class for web crawling. It manages the crawling process,
     * including adding URLs to the queue, filtering URLs, managing crawler threads,
     * and handling the overall crawling lifecycle.
     *
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14K bytes
    - Viewed (0)
Back to top