- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 631 for crawled (0.12 sec)
-
src/main/java/org/codelibs/fess/job/CrawlJob.java
* This job launches a separate crawler process that can crawl web sites, file systems, * and data sources based on the configured crawling settings. * * <p>The job supports selective crawling by specifying configuration IDs for different * types of crawlers (web, file, data). It manages the crawler process lifecycle, * handles timeout scenarios, and ensures proper cleanup of resources.</p> *
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Jul 17 08:28:31 UTC 2025 - 19.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java
* * <p>Example usage: * <pre> * Crawler crawler = new Crawler(); * crawler.addUrl("http://example.com/"); * crawler.execute(); * crawler.close(); * </pre> */ public class Crawler implements Runnable, AutoCloseable { private static final Logger logger = LogManager.getLogger(Crawler.class); /** * Service for managing URL queues during crawling. */
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/crawler/processor/FessResponseProcessor.java
*/ package org.codelibs.fess.crawler.processor; import org.apache.logging.log4j.LogManager; import org.apache.logging.log4j.Logger; import org.codelibs.fess.crawler.entity.AccessResult; import org.codelibs.fess.crawler.entity.ResponseData; import org.codelibs.fess.crawler.entity.ResultData; import org.codelibs.fess.crawler.processor.impl.DefaultResponseProcessor;
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Jul 17 08:28:31 UTC 2025 - 3.7K bytes - Viewed (0) -
README.md
```java // Create multiple crawler instances Crawler crawler1 = container.getComponent("crawler"); crawler1.setSessionId("session1"); crawler1.addUrl("https://site1.com"); Crawler crawler2 = container.getComponent("crawler"); crawler2.setSessionId("session2"); crawler2.addUrl("https://site2.com"); // Execute concurrently crawler1.setBackground(true); crawler2.setBackground(true);
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/ResponseData.java
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 11.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/processor/impl/SitemapsResponseProcessor.java
import org.codelibs.fess.crawler.entity.Sitemap; import org.codelibs.fess.crawler.entity.SitemapSet; import org.codelibs.fess.crawler.exception.ChildUrlsException; import org.codelibs.fess.crawler.helper.SitemapsHelper; import org.codelibs.fess.crawler.processor.ResponseProcessor; import jakarta.annotation.Resource; /** * A response processor implementation that handles sitemaps.
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 3.4K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/indexer/IndexUpdater.java
import org.codelibs.fess.Constants; import org.codelibs.fess.crawler.Crawler; import org.codelibs.fess.crawler.entity.AccessResult; import org.codelibs.fess.crawler.entity.AccessResultData; import org.codelibs.fess.crawler.entity.OpenSearchAccessResult; import org.codelibs.fess.crawler.entity.OpenSearchUrlQueue; import org.codelibs.fess.crawler.service.DataService; import org.codelibs.fess.crawler.service.UrlFilterService;
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Tue Aug 19 14:09:36 UTC 2025 - 32.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/FileTransformer.java
import org.codelibs.core.io.CopyUtil; import org.codelibs.fess.crawler.Constants; import org.codelibs.fess.crawler.entity.AccessResultData; import org.codelibs.fess.crawler.entity.ResponseData; import org.codelibs.fess.crawler.entity.ResultData; import org.codelibs.fess.crawler.exception.CrawlerSystemException; /** * <p> * FileTransformer stores the content of a crawled resource as a file on the file system.
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 11.7K bytes - Viewed (0) -
fess-crawler-opensearch/src/main/java/org/codelibs/fess/crawler/service/impl/OpenSearchUrlQueueService.java
import org.codelibs.core.lang.StringUtil; import org.codelibs.fess.crawler.Constants; import org.codelibs.fess.crawler.entity.AccessResult; import org.codelibs.fess.crawler.entity.OpenSearchUrlQueue; import org.codelibs.fess.crawler.entity.UrlQueue; import org.codelibs.fess.crawler.exception.OpenSearchAccessException; import org.codelibs.fess.crawler.service.UrlQueueService; import org.codelibs.fess.crawler.util.OpenSearchCrawlerConfig;
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 17K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/app/web/admin/general/EditForm.java
// Default constructor } /** * Enable or disable incremental crawling. * When enabled, only new or modified documents are crawled. */ @Size(max = 10) public String incrementalCrawling; /** * Number of days to keep crawled documents before cleanup. * Set to -1 to disable automatic cleanup. */ @Required @Min(-1) @Max(1000) @ValidateTypeFailure
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Jul 17 08:28:31 UTC 2025 - 9.9K bytes - Viewed (0)