Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 8 of 8 for addInclude (0.04 sec)

  1. fess-crawler/src/test/java/org/codelibs/fess/crawler/filter/UrlFilterTest.java

            // Test empty pattern
            urlFilter.addInclude("");
            urlFilter.addExclude("");
    
            // Test single character pattern
            urlFilter.addInclude(".");
            urlFilter.addExclude("*");
    
            // Test patterns with only special characters
            urlFilter.addInclude("^$");
            urlFilter.addExclude(".*");
    
            // Should handle boundary conditions gracefully
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Wed Sep 03 14:42:53 UTC 2025
    - 19K bytes
    - Viewed (0)
  2. fess-crawler-lasta/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java

            crawler.crawlerContext.setMaxAccessCount(maxCount);
            crawler.crawlerContext.setNumOfThread(numOfThread);
            crawler.urlFilter.addInclude(url + ".*");
            crawler.urlFilter.addExclude(url + "/dir1/.*");
            final String sessionId = crawler.execute();
            assertEquals(maxCount, dataService.getCount(sessionId));
            dataService.delete(sessionId);
        }
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 12.8K bytes
    - Viewed (0)
  3. impl/maven-core/src/test/java/org/apache/maven/project/ResourceIncludeTest.java

        }
    
        @Test
        void testAddMultipleIncludes() {
            Resource resource = project.getResources().get(0);
    
            // Add multiple includes
            resource.addInclude("*.xml");
            resource.addInclude("*.properties");
    
            // Verify both includes are present
            assertEquals(2, resource.getIncludes().size(), "Should have two includes");
    Registered: Sun Dec 28 03:35:09 UTC 2025
    - Last Modified: Fri Nov 07 13:11:07 UTC 2025
    - 12.6K bytes
    - Viewed (0)
  4. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java

            crawler.crawlerContext.setMaxAccessCount(maxCount);
            crawler.crawlerContext.setNumOfThread(numOfThread);
            crawler.urlFilter.addInclude(url + ".*");
            crawler.urlFilter.addExclude(url + "/dir1/.*");
            final String sessionId = crawler.execute();
            assertEquals(maxCount, dataService.getCount(sessionId));
            dataService.delete(sessionId);
        }
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Tue Nov 11 13:40:14 UTC 2025
    - 25.8K bytes
    - Viewed (0)
  5. README.md

    ```
    
    ### URL Filtering
    
    ```java
    // Include patterns
    crawler.urlFilter.addInclude("https://example.com/.*");
    crawler.urlFilter.addInclude(".*\\.pdf$");
    
    // Exclude patterns  
    crawler.urlFilter.addExclude(".*\\.js$");
    crawler.urlFilter.addExclude(".*login.*");
    ```
    
    ## Supported Protocols and Formats
    
    ### Protocols
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java

         * @param regexp The regular expression for the include filter.
         */
        public void addIncludeFilter(final String regexp) {
            if (StringUtil.isNotBlank(regexp)) {
                urlFilter.addInclude(regexp);
            }
        }
    
        /**
         * Adds an exclude filter for URLs.
         * URLs matching this regular expression will not be crawled.
         * @param regexp The regular expression for the exclude filter.
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 17K bytes
    - Viewed (0)
  7. CLAUDE.md

    3. **Add test with sample file** in `src/test/resources/`
    
    ### Configuring URL Filtering
    
    ```java
    // Include patterns (must match)
    crawler.urlFilter.addInclude("https://example.com/.*");
    
    // Exclude patterns (must not match)
    crawler.urlFilter.addExclude(".*\\.(css|js|png|jpg)$");
    ```
    
    ### Setting Crawl Limits
    
    ```java
    context.setMaxAccessCount(1000);  // Max URLs (0 = unlimited)
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  8. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java

            @Override
            public void init(String sessionId) {
            }
    
            @Override
            public void addInclude(String urlPattern) {
            }
    
            @Override
            public void addExclude(String urlPattern) {
            }
    
            @Override
            public boolean match(String url) {
                return true;
            }
    
            @Override
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 25.6K bytes
    - Viewed (0)
Back to top