- Sort Score
- Result 10 results
- Languages All
Results 21 - 30 of 31 for remain (0.03 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/EmlExtractor.java
} } /** * Gets the mail properties used for email processing. * * @return the mail properties */ public Properties getMailProperties() { return mailProperties; } /** * Sets the mail properties used for email processing. * * @param mailProperties the mail properties to set */Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 12.6K bytes - Viewed (0) -
README.md
import org.codelibs.fess.crawler.container.StandardCrawlerContainer; import org.codelibs.fess.crawler.transformer.impl.FileTransformer; public class BasicCrawlerExample { public static void main(String[] args) throws Exception { // Create crawler container StandardCrawlerContainer container = new StandardCrawlerContainer(); // Configure basic componentsRegistered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/exception/CrawlerSystemExceptionTest.java
*/ public void test_stackTraceWithCause() { Exception cause = new IllegalArgumentException("Cause exception"); CrawlerSystemException exception = new CrawlerSystemException("Main exception", cause); StackTraceElement[] mainStackTrace = exception.getStackTrace(); StackTraceElement[] causeStackTrace = cause.getStackTrace(); assertNotNull(mainStackTrace);Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Wed Sep 03 14:42:53 UTC 2025 - 20K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorBuilder.java
* The builder allows setting parameters such as MIME type, filename, extractor name, maximum content length, * and cache file size to optimize the extraction process. * * <p> * The main purpose of this class is to simplify the extraction process by providing a fluent interface * for configuring the extraction parameters and handling the underlying complexities of content processing,
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractor.java
* <li>Handling resource names and content types</li> * <li>Retrying extraction without resource name or content type if the initial attempt fails</li> * <li>Extracting text from metadata if the main content extraction fails</li> * <li>Reading content as plain text if all other methods fail</li> * <li>Applying post-extraction filters</li> * <li>Handling Tika exceptions, including zip bomb exceptions</li> * </ul>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 30.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/util/TextUtil.java
/** * Utility class for text normalization and processing. * * This class provides methods to normalize text by reading characters from a provided Reader * and processing them according to specific rules. The main functionality is encapsulated * within the nested {@link TextNormalizeContext} class. * * <p>The text normalization process includes: * <ul>Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 12K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/FileTransformer.java
* {@link org.codelibs.fess.crawler.exception.CrawlerSystemException} in case of errors. * </p> * * <p> * The {@link #storeData(ResponseData, ResultData)} method is the main entry point for storing * the content of a crawled resource. The {@link #getData(AccessResultData)} method retrieves * the stored file path as a File object. * </p> */ public class FileTransformer extends HtmlTransformer {Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 11.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java
import org.codelibs.fess.crawler.service.DataService; import org.codelibs.fess.crawler.service.UrlQueueService; import jakarta.annotation.Resource; /** * The Crawler class is the main class for web crawling. It manages the crawling process, * including adding URLs to the queue, filtering URLs, managing crawler threads, * and handling the overall crawling lifecycle. *
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14K bytes - Viewed (0) -
fess-crawler-lasta/src/main/resources/crawler/extractor.xml
"application/vnd.oma.poc.invocation-descriptor+xml", "application/vnd.oma.poc.optimized-progress-report+xml", "application/vnd.oma.xcap-directory+xml", "application/vnd.omads-email+xml", "application/vnd.omads-file+xml", "application/vnd.omads-folder+xml", "application/vnd.omaloc-supl-init", "application/vnd.openofficeorg.extension",
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Aug 01 21:40:30 UTC 2020 - 49K bytes - Viewed (0) -
fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml
<mime-type type="application/vnd.oma.poc.optimized-progress-report+xml"/> <mime-type type="application/vnd.oma.xcap-directory+xml"/> <mime-type type="application/vnd.omads-email+xml"/> <mime-type type="application/vnd.omads-file+xml"/> <mime-type type="application/vnd.omads-folder+xml"/> <mime-type type="application/vnd.omaloc-supl-init"/>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Mar 13 08:18:01 UTC 2025 - 320.1K bytes - Viewed (2)