- Sort Score
- Result 10 results
- Languages All
Results 71 - 80 of 82 for crawl (0.07 sec)
-
src/main/java/org/codelibs/fess/job/CrawlJob.java
import org.codelibs.fess.util.SystemUtil; import jakarta.servlet.ServletContext; /** * CrawlJob is responsible for executing the crawling process in Fess. * This job launches a separate crawler process that can crawl web sites, file systems, * and data sources based on the configured crawling settings. * * <p>The job supports selective crawling by specifying configuration IDs for different
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Jul 17 08:28:31 UTC 2025 - 19.6K bytes - Viewed (0) -
README.md
Fess also contains a Crawler, which can crawl documents on a [web server](https://fess.codelibs.org/15.0/admin/webconfig-guide.html), [file system](https://fess.codelibs.org/15.0/admin/fileconfig-guide.html), or [Data Store](https://fess.codelibs.org/15.0/admin/dataconfig-guide.html) (such as...
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Jul 10 06:34:32 UTC 2025 - 7.2K bytes - Viewed (2) -
src/main/java/org/codelibs/fess/crawler/FessCrawlerThread.java
*/ protected ConcurrentHashMap<String, Pair<String, Pattern>> clientRuleCache = new ConcurrentHashMap<>(); /** * Determines whether the content at the given URL has been updated since the last crawl. * This method implements incremental crawling by comparing timestamps and checking document * expiration. It also handles special cases for different URL schemes (SMB, file, FTP). *
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Aug 07 03:06:29 UTC 2025 - 19.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/fs/FileSystemClient.java
import org.codelibs.fess.crawler.helper.ContentLengthHelper; import org.codelibs.fess.crawler.helper.MimeTypeHelper; import jakarta.annotation.Resource; /** * FileSystemClient is CrawlerClient implementation to crawl files on a file * system. * * @author shinsuke * */ public class FileSystemClient extends AbstractCrawlerClient { /** Logger instance for this class */
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 13.8K bytes - Viewed (0) -
src/main/webapp/WEB-INF/view/admin/fileconfig/admin_fileconfig_edit.jsp
<div class="wrapper"> <jsp:include page="/WEB-INF/view/common/admin/header.jsp"></jsp:include> <jsp:include page="/WEB-INF/view/common/admin/sidebar.jsp"> <jsp:param name="menuCategoryType" value="crawl"/> <jsp:param name="menuType" value="fileConfig"/> </jsp:include> <div class="content-wrapper"> <div class="content-header"> <div class="container-fluid">
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Feb 13 07:47:04 UTC 2020 - 15.3K bytes - Viewed (0) -
src/main/webapp/WEB-INF/view/admin/webconfig/admin_webconfig_edit.jsp
<div class="wrapper"> <jsp:include page="/WEB-INF/view/common/admin/header.jsp"></jsp:include> <jsp:include page="/WEB-INF/view/common/admin/sidebar.jsp"> <jsp:param name="menuCategoryType" value="crawl"/> <jsp:param name="menuType" value="webConfig"/> </jsp:include> <div class="content-wrapper"> <div class="content-header"> <div class="container-fluid">
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Feb 13 07:47:04 UTC 2020 - 15.9K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/app/web/admin/reqheader/AdminReqheaderAction.java
@Resource private RequestHeaderService requestHeaderService; @Resource private ReqHeaderPager reqHeaderPager; /** Web configuration service for managing web crawl configurations. */ @Resource protected WebConfigService webConfigService; // ===================================================================================
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Thu Jul 17 08:28:31 UTC 2025 - 18.3K bytes - Viewed (0) -
src/test/java/org/codelibs/fess/job/CrawlJobTest.java
try { crawlJob.execute(); fail("Should throw JobProcessingException"); } catch (JobProcessingException e) { assertTrue(e.getMessage().contains("Failed to execute a crawl job")); assertNotNull(e.getCause()); assertEquals("Test exception", e.getCause().getMessage()); } } // Test execute with JobProcessingException in executeCrawler
Registered: Thu Sep 04 12:52:25 UTC 2025 - Last Modified: Tue Aug 19 14:09:36 UTC 2025 - 25K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/CrawlerThread.java
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 20.4K bytes - Viewed (0) -
guava/src/com/google/common/util/concurrent/AbstractScheduledService.java
* toCrawl = readStartingUris(); * } * * protected void runOneIteration() throws Exception { * Uri uri = toCrawl.remove(); * Collection<Uri> newUris = crawl(uri); * visited.add(uri); * for (Uri newUri : newUris) { * if (!visited.contains(newUri)) { toCrawl.add(newUri); } * } * } * * protected void shutDown() throws Exception {
Registered: Fri Sep 05 12:43:10 UTC 2025 - Last Modified: Sat Aug 09 01:14:59 UTC 2025 - 27.8K bytes - Viewed (0)