- Sort Score
- Result 10 results
- Languages All
Results 11 - 20 of 529 for crawlen (0.8 sec)
-
src/main/java/org/codelibs/fess/job/CrawlJob.java
/** * CrawlJob is responsible for executing the crawling process in Fess. * This job launches a separate crawler process that can crawl web sites, file systems, * and data sources based on the configured crawling settings. * * <p>The job supports selective crawling by specifying configuration IDs for different * types of crawlers (web, file, data). It manages the crawler process lifecycle,
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 19.6K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/indexer/IndexUpdater.java
import org.codelibs.fess.Constants; import org.codelibs.fess.crawler.Crawler; import org.codelibs.fess.crawler.entity.AccessResult; import org.codelibs.fess.crawler.entity.AccessResultData; import org.codelibs.fess.crawler.entity.OpenSearchAccessResult; import org.codelibs.fess.crawler.entity.OpenSearchUrlQueue; import org.codelibs.fess.crawler.service.DataService; import org.codelibs.fess.crawler.service.UrlFilterService;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 32.9K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/crawler/FessCrawlerThread.java
import org.codelibs.fess.app.service.FailureUrlService; import org.codelibs.fess.crawler.builder.RequestDataBuilder; import org.codelibs.fess.crawler.client.CrawlerClient; import org.codelibs.fess.crawler.entity.RequestData; import org.codelibs.fess.crawler.entity.ResponseData; import org.codelibs.fess.crawler.entity.UrlQueue; import org.codelibs.fess.crawler.log.LogType; import org.codelibs.fess.exception.ContainerNotAvailableException;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Thu Dec 11 09:47:03 UTC 2025 - 19.5K bytes - Viewed (0) -
fess-crawler-opensearch/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
crawler1.getCrawlerContext().setMaxAccessCount(maxCount); crawler1.getCrawlerContext().setNumOfThread(numOfThread); final Crawler crawler2 = getComponent(Crawler.class); crawler2.setBackground(true); ((UrlFilterImpl) crawler2.urlFilter).setIncludeFilteringPattern("$1$2$3.*"); crawler2.addUrl(url2); crawler2.getCrawlerContext().setMaxAccessCount(maxCount);
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 7.7K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
crawler1.addUrl(url1); crawler1.getCrawlerContext().setMaxAccessCount(maxCount); crawler1.getCrawlerContext().setNumOfThread(numOfThread); final Crawler crawler2 = container.getComponent("crawler"); crawler2.setSessionId(crawler2.getSessionId() + "2"); crawler2.setBackground(true);
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Tue Nov 11 13:40:14 UTC 2025 - 25.8K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/helper/WebFsIndexHelper.java
import org.codelibs.fess.Constants; import org.codelibs.fess.crawler.Crawler; import org.codelibs.fess.crawler.CrawlerContext; import org.codelibs.fess.crawler.CrawlerStatus; import org.codelibs.fess.crawler.interval.FessIntervalController; import org.codelibs.fess.crawler.service.impl.OpenSearchDataService; import org.codelibs.fess.crawler.service.impl.OpenSearchUrlFilterService; import org.codelibs.fess.crawler.service.impl.OpenSearchUrlQueueService;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 25K bytes - Viewed (0) -
MIGRATION.md
-F "file=@documents.bulk" ``` ### 4. Direct Crawler Configuration Instead of importing documents, configure Fess to crawl your data sources: **Web Crawling**: - **Admin Path**: Crawler > Web - **Supports**: HTTP/HTTPS websites - **Features**: JavaScript rendering, authentication, custom headers **File Crawling**: - **Admin Path**: Crawler > File - **Supports**: SMB, FTP, local file systems
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Thu Nov 06 12:40:11 UTC 2025 - 23.2K bytes - Viewed (0) -
CLAUDE.md
│ ├── config/ # Config index (crawl configs, schedules) │ ├── log/ # Log index │ ├── user/ # User index │ ├── exbhv/ # Extended behaviors (repositories) │ └── exentity/ # Extended entities (domain models) ├── helper/ # Cross-cutting utilities ├── crawler/ # Crawling engine ├── sso/ # SSO implementations
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 4.8K bytes - Viewed (0) -
CLAUDE.md
### Key File Locations **Core**: `fess-crawler/src/main/java/org/codelibs/fess/crawler/` - `Crawler.java`, `CrawlerContext.java`, `CrawlerThread.java` **Clients**: `fess-crawler/src/main/java/org/codelibs/fess/crawler/client/` - `http/HcHttpClient.java`, `fs/FileSystemClient.java`, `storage/StorageClient.java` **DI Config**: `fess-crawler-lasta/src/main/resources/crawler.xml` ### Exception Hierarchy ```
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/crawler/processor/FessResponseProcessor.java
*/ package org.codelibs.fess.crawler.processor; import org.apache.logging.log4j.LogManager; import org.apache.logging.log4j.Logger; import org.codelibs.fess.crawler.entity.AccessResult; import org.codelibs.fess.crawler.entity.ResponseData; import org.codelibs.fess.crawler.entity.ResultData; import org.codelibs.fess.crawler.processor.impl.DefaultResponseProcessor;
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Thu Jul 17 08:28:31 UTC 2025 - 3.7K bytes - Viewed (0)