Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 12 for delas (0.18 sec)

  1. src/main/java/org/codelibs/fess/helper/IntervalControlHelper.java

            }
        }
    
        /**
         * Applies delay based on the configured interval rules.
         * This method calculates the appropriate delay for the current time
         * and applies it by sleeping the current thread.
         */
        public void delayByRules() {
            final long delay = getDelay();
            if (delay > 0) {
                ThreadUtil.sleep(delay);
            }
        }
    
        /**
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sun Nov 23 12:34:02 UTC 2025
    - 10K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/interval/impl/HostIntervalController.java

     * The delayBeforeProcessing method is overridden to introduce a delay before processing a URL,
     * ensuring that requests to the same host are not made too frequently.
     * The delay is calculated based on the configured delayMillisBeforeProcessing parameter.
     * If the time since the last request to the host is less than the configured delay,
     * the thread waits until the delay has elapsed.
     * This class is thread-safe.
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 5.2K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java

        }
    
        /**
         * Gets the crawl delay value for the specified user agent from robots.txt.
         * The crawl delay specifies the time (in seconds) to wait between successive requests.
         *
         * @param userAgent The user agent string to match against robots.txt directives
         * @return The crawl delay value in seconds. Returns 0 if no matching directive is found
         *         or no crawl delay is specified for the matching directive.
         */
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 18.5K bytes
    - Viewed (0)
  4. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            } finally {
                CloseableUtil.closeQuietly(in);
            }
    
            // Should not throw exception for invalid crawl-delay
            assertNotNull(robotsTxt);
            // Invalid crawl-delay should be ignored (default 0)
            assertEquals(0, robotsTxt.getCrawlDelay("TestBot"));
            // Other directives should still work
            assertFalse(robotsTxt.allows("/test/", "TestBot"));
        }
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
  5. fess-crawler/src/test/java/org/codelibs/fess/crawler/interval/impl/HostIntervalControllerTest.java

            // First access - should not delay
            final long start1 = System.nanoTime();
            controller.delayBeforeProcessing();
            final long elapsed1 = (System.nanoTime() - start1) / 1000000;
            assertTrue("First access should not delay", elapsed1 < 50);
    
            // Second access to same host - should delay
            q.setUrl("http://example.com/page2");
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 11.4K bytes
    - Viewed (0)
  6. CLAUDE.md

    ```
    
    ### CrawlerThread (`CrawlerThread.java`)
    
    Worker thread for crawling.
    
    **Flow**: Poll URL → Validate → Get client → Delay → Check last-modified → Execute → Process → Extract children → Queue children → Delay
    
    ### CrawlerClientFactory
    
    Pattern-based client selection using `LinkedHashMap<Pattern, CrawlerClient>`.
    
    **Standard Patterns**:
    ```java
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  7. CLAUDE.md

        runner.close();
        runner.clean();
    }
    ```
    
    ### Testing Best Practices
    
    - Minimize `Thread.sleep()` - use only when absolutely necessary
    - For timestamp tests: use minimal delays (50-100ms)
    - Clean only test-specific indices (not `_all` unless needed)
    - Use `runner.refresh()` after index operations
    - Test with realistic multilingual content
    
    ### Key Test Classes
    
    Registered: Sat Dec 20 13:04:59 UTC 2025
    - Last Modified: Mon Nov 24 03:40:05 UTC 2025
    - 8.9K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java

        /**
         * Container for managing crawler components.
         */
        @Resource
        protected CrawlerContainer crawlerContainer;
    
        /**
         * Controller for managing crawling intervals and delays.
         */
        @Resource
        protected IntervalController intervalController;
    
        /**
         * Factory for creating crawler clients.
         */
        @Resource
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 17K bytes
    - Viewed (0)
  9. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerThreadTest.java

            crawlerContext.setStatus(CrawlerStatus.RUNNING);
            crawlerContext.maxThreadCheckCount = 1; // Will exit after 1 check
    
            crawlerThread.run();
    
            verify(intervalController, times(1)).delay(IntervalController.NO_URL_IN_QUEUE);
        }
    
        /**
         * Test isContinue with active threads still running.
         */
        public void test_isContinue_withActiveThreads() throws Exception {
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 18.3K bytes
    - Viewed (0)
  10. src/main/java/org/codelibs/fess/helper/DataIndexHelper.java

                        logger.debug("Interrupted.", e);
                    }
                }
            }
        }
    
        /**
         * Sets the crawling execution interval.
         * This interval controls the delay between starting new crawler threads
         * and the frequency of status checks during crawling operations.
         *
         * @param crawlingExecutionInterval interval in milliseconds
         */
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 19K bytes
    - Viewed (0)
Back to top