- Sort Score
- Result 10 results
- Languages All
Results 11 - 14 of 14 for addInclude (0.05 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java
* @param regexp The regular expression for the include filter. */ public void addIncludeFilter(final String regexp) { if (StringUtil.isNotBlank(regexp)) { urlFilter.addInclude(regexp); } } /** * Adds an exclude filter for URLs. * URLs matching this regular expression will not be crawled. * @param regexp The regular expression for the exclude filter.Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 17K bytes - Viewed (0) -
CLAUDE.md
3. **Add test with sample file** in `src/test/resources/` ### Configuring URL Filtering ```java // Include patterns (must match) crawler.urlFilter.addInclude("https://example.com/.*"); // Exclude patterns (must not match) crawler.urlFilter.addExclude(".*\\.(css|js|png|jpg)$"); ``` ### Setting Crawl Limits ```java context.setMaxAccessCount(1000); // Max URLs (0 = unlimited)
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 25.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/HcHttpClient.java
final String urlValue = hostUrl + urlPattern; crawlerContext.getUrlFilter().addInclude(urlValue); if (logger.isInfoEnabled()) { logger.info("Included URL: {}", urlValue); }Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 12:19:14 UTC 2025 - 53.7K bytes - Viewed (0)