- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 243 for crawler (0.04 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java
* * <p>Example usage: * <pre> * Crawler crawler = new Crawler(); * crawler.addUrl("http://example.com/"); * crawler.execute(); * crawler.close(); * </pre> */ public class Crawler implements Runnable, AutoCloseable { private static final Logger logger = LogManager.getLogger(Crawler.class); /** * Service for managing URL queues during crawling. */
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 17K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/exec/Crawler.java
* </pre> */ public class Crawler { /** * Creates a new instance of Crawler. */ public Crawler() { // Default constructor } /** Logger instance for this class. */ private static final Logger logger = LogManager.getLogger(Crawler.class); /** Thread name for web and file system crawling process. */Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 31.4K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
crawler1.addUrl(url1); crawler1.getCrawlerContext().setMaxAccessCount(maxCount); crawler1.getCrawlerContext().setNumOfThread(numOfThread); final Crawler crawler2 = container.getComponent("crawler"); crawler2.setSessionId(crawler2.getSessionId() + "2"); crawler2.setBackground(true);
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Tue Nov 11 13:40:14 UTC 2025 - 25.8K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/indexer/IndexUpdater.java
import org.codelibs.fess.Constants; import org.codelibs.fess.crawler.Crawler; import org.codelibs.fess.crawler.entity.AccessResult; import org.codelibs.fess.crawler.entity.AccessResultData; import org.codelibs.fess.crawler.entity.OpenSearchAccessResult; import org.codelibs.fess.crawler.entity.OpenSearchUrlQueue; import org.codelibs.fess.crawler.service.DataService; import org.codelibs.fess.crawler.service.UrlFilterService;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 32.9K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/job/CrawlJob.java
* This job launches a separate crawler process that can crawl web sites, file systems, * and data sources based on the configured crawling settings. * * <p>The job supports selective crawling by specifying configuration IDs for different * types of crawlers (web, file, data). It manages the crawler process lifecycle, * handles timeout scenarios, and ensures proper cleanup of resources.</p> *
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 19.6K bytes - Viewed (0) -
CLAUDE.md
### Key File Locations **Core**: `fess-crawler/src/main/java/org/codelibs/fess/crawler/` - `Crawler.java`, `CrawlerContext.java`, `CrawlerThread.java` **Clients**: `fess-crawler/src/main/java/org/codelibs/fess/crawler/client/` - `http/HcHttpClient.java`, `fs/FileSystemClient.java`, `storage/StorageClient.java` **DI Config**: `fess-crawler-lasta/src/main/resources/crawler.xml` ### Exception Hierarchy ```
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/helper/WebFsIndexHelper.java
import org.codelibs.fess.Constants; import org.codelibs.fess.crawler.Crawler; import org.codelibs.fess.crawler.CrawlerContext; import org.codelibs.fess.crawler.CrawlerStatus; import org.codelibs.fess.crawler.interval.FessIntervalController; import org.codelibs.fess.crawler.service.impl.OpenSearchDataService; import org.codelibs.fess.crawler.service.impl.OpenSearchUrlFilterService; import org.codelibs.fess.crawler.service.impl.OpenSearchUrlQueueService;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 25K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/fs/FileSystemClient.java
import org.codelibs.fess.crawler.Constants; import org.codelibs.fess.crawler.builder.RequestDataBuilder; import org.codelibs.fess.crawler.client.AbstractCrawlerClient; import org.codelibs.fess.crawler.client.AccessTimeoutTarget; import org.codelibs.fess.crawler.entity.RequestData; import org.codelibs.fess.crawler.entity.ResponseData; import org.codelibs.fess.crawler.exception.ChildUrlsException;
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 12:19:14 UTC 2025 - 15.1K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/crawler/FessCrawlerThread.java
import org.codelibs.fess.app.service.FailureUrlService; import org.codelibs.fess.crawler.builder.RequestDataBuilder; import org.codelibs.fess.crawler.client.CrawlerClient; import org.codelibs.fess.crawler.entity.RequestData; import org.codelibs.fess.crawler.entity.ResponseData; import org.codelibs.fess.crawler.entity.UrlQueue; import org.codelibs.fess.crawler.log.LogType; import org.codelibs.fess.exception.ContainerNotAvailableException;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Thu Dec 11 09:47:03 UTC 2025 - 19.5K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/ds/callback/FileListIndexUpdateCallbackImpl.java
import org.codelibs.fess.Constants; import org.codelibs.fess.crawler.builder.RequestDataBuilder; import org.codelibs.fess.crawler.client.CrawlerClient; import org.codelibs.fess.crawler.client.CrawlerClientFactory; import org.codelibs.fess.crawler.entity.ResponseData; import org.codelibs.fess.crawler.entity.ResultData; import org.codelibs.fess.crawler.exception.ChildUrlsException; import org.codelibs.fess.crawler.exception.CrawlerSystemException;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 29.7K bytes - Viewed (3)