Search Options

Results per page
Sort
Preferred Languages
Advance

Results 11 - 14 of 14 for addInclude (0.05 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java

         * @param regexp The regular expression for the include filter.
         */
        public void addIncludeFilter(final String regexp) {
            if (StringUtil.isNotBlank(regexp)) {
                urlFilter.addInclude(regexp);
            }
        }
    
        /**
         * Adds an exclude filter for URLs.
         * URLs matching this regular expression will not be crawled.
         * @param regexp The regular expression for the exclude filter.
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 17K bytes
    - Viewed (0)
  2. CLAUDE.md

    3. **Add test with sample file** in `src/test/resources/`
    
    ### Configuring URL Filtering
    
    ```java
    // Include patterns (must match)
    crawler.urlFilter.addInclude("https://example.com/.*");
    
    // Exclude patterns (must not match)
    crawler.urlFilter.addExclude(".*\\.(css|js|png|jpg)$");
    ```
    
    ### Setting Crawl Limits
    
    ```java
    context.setMaxAccessCount(1000);  // Max URLs (0 = unlimited)
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  3. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java

            @Override
            public void init(String sessionId) {
            }
    
            @Override
            public void addInclude(String urlPattern) {
            }
    
            @Override
            public void addExclude(String urlPattern) {
            }
    
            @Override
            public boolean match(String url) {
                return true;
            }
    
            @Override
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 25.6K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/HcHttpClient.java

                                            final String urlValue = hostUrl + urlPattern;
                                            crawlerContext.getUrlFilter().addInclude(urlValue);
                                            if (logger.isInfoEnabled()) {
                                                logger.info("Included URL: {}", urlValue);
                                            }
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 12:19:14 UTC 2025
    - 53.7K bytes
    - Viewed (0)
Back to top