Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 2 of 2 for NumericBot (3.77 sec)

  1. fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots_malformed.txt

    Sitemap: not-a-valid-url
    
    # Case 12: Malformed lines that should be completely ignored
    This line is completely invalid
    :NoKey
    NoValue:
    :::
       :
    
    # Case 13: Numeric crawl-delay edge cases
    User-agent: NumericBot
    Crawl-delay: 0
    Crawl-delay: 999999999
    Crawl-delay: 1.23e10
    
    # Case 14: Tab characters instead of spaces
    User-agent:	TabBot
    Disallow:	/tab1/
    Allow:	/tab2/
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 14 12:52:01 UTC 2025
    - 2.6K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            String[] sitemaps = robotsTxt.getSitemaps();
            assertTrue(sitemaps.length >= 3); // At least the valid ones should be parsed
    
            // Test NumericBot - various crawl-delay formats
            // Should handle edge cases gracefully
            assertTrue(robotsTxt.getCrawlDelay("NumericBot") >= 0);
    
            // Test TabBot - tab characters should be treated as whitespace
            assertFalse(robotsTxt.allows("/tab1/", "TabBot"));
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
Back to top