- Sort Score
- Result 10 results
- Languages All
Results 1 - 8 of 8 for addExclude (0.04 sec)
-
fess-crawler/src/test/java/org/codelibs/fess/crawler/filter/UrlFilterTest.java
// Test empty pattern urlFilter.addInclude(""); urlFilter.addExclude(""); // Test single character pattern urlFilter.addInclude("."); urlFilter.addExclude("*"); // Test patterns with only special characters urlFilter.addInclude("^$"); urlFilter.addExclude(".*"); // Should handle boundary conditions gracefullyRegistered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Wed Sep 03 14:42:53 UTC 2025 - 19K bytes - Viewed (0) -
README.md
``` ### URL Filtering ```java // Include patterns crawler.urlFilter.addInclude("https://example.com/.*"); crawler.urlFilter.addInclude(".*\\.pdf$"); // Exclude patterns crawler.urlFilter.addExclude(".*\\.js$"); crawler.urlFilter.addExclude(".*login.*"); ``` ## Supported Protocols and Formats ### ProtocolsRegistered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler-lasta/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
crawler.crawlerContext.setMaxAccessCount(maxCount); crawler.crawlerContext.setNumOfThread(numOfThread); crawler.urlFilter.addInclude(url + ".*"); crawler.urlFilter.addExclude(url + "/dir1/.*"); final String sessionId = crawler.execute(); assertEquals(maxCount, dataService.getCount(sessionId)); dataService.delete(sessionId); }
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 12.8K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java
*/ public void addExcludeFilter(final String regexp) { if (StringUtil.isNotBlank(regexp)) { urlFilter.addExclude(regexp); } } /** * Stops the crawling process. * Sets the crawler status to DONE and interrupts all crawler threads. */ public void stop() { if (logger.isInfoEnabled()) {Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 17K bytes - Viewed (0) -
CLAUDE.md
3. **Add test with sample file** in `src/test/resources/` ### Configuring URL Filtering ```java // Include patterns (must match) crawler.urlFilter.addInclude("https://example.com/.*"); // Exclude patterns (must not match) crawler.urlFilter.addExclude(".*\\.(css|js|png|jpg)$"); ``` ### Setting Crawl Limits ```java context.setMaxAccessCount(1000); // Max URLs (0 = unlimited)
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
crawler.crawlerContext.setMaxAccessCount(maxCount); crawler.crawlerContext.setNumOfThread(numOfThread); crawler.urlFilter.addInclude(url + ".*"); crawler.urlFilter.addExclude(url + "/dir1/.*"); final String sessionId = crawler.execute(); assertEquals(maxCount, dataService.getCount(sessionId)); dataService.delete(sessionId); }
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Tue Nov 11 13:40:14 UTC 2025 - 25.8K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 25.6K bytes - Viewed (0) -
impl/maven-core/src/test/java/org/apache/maven/project/ResourceIncludeTest.java
} @Test void testAddMultipleIncludes() { Resource resource = project.getResources().get(0); // Add multiple includes resource.addInclude("*.xml"); resource.addInclude("*.properties"); // Verify both includes are present assertEquals(2, resource.getIncludes().size(), "Should have two includes");Registered: Sun Dec 28 03:35:09 UTC 2025 - Last Modified: Fri Nov 07 13:11:07 UTC 2025 - 12.6K bytes - Viewed (0)