Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 4 of 4 for finishing (0.07 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/impl/LogHelperImpl.java

     * It uses Log4j2 for logging.
     *
     * <p>
     * The class contains methods for logging different types of events, such as:
     * </p>
     * <ul>
     *   <li>Starting and finishing threads</li>
     *   <li>Starting and cleaning up crawling</li>
     *   <li>Handling unsupported URLs</li>
     *   <li>Checking last modified dates</li>
     *   <li>Getting content</li>
     *   <li>Handling redirects</li>
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 02:01:26 UTC 2025
    - 14K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            // Disallow: /fish$ - should block exactly /fish but not /fishing
            assertFalse(robotsTxt.allows("/fish", "EndPathBot"));
            assertTrue(robotsTxt.allows("/fishing", "EndPathBot"));
            assertTrue(robotsTxt.allows("/fish/", "EndPathBot"));
    
            // Disallow: /temp$ but Allow: /fishing
            assertFalse(robotsTxt.allows("/temp", "EndPathBot"));
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
  3. fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots_wildcard.txt

    User-agent: WildcardBot
    Disallow: /*.pdf$
    Disallow: /admin/*.php
    Disallow: /*/private/
    Allow: /public/*.html
    
    # Test end-of-path ($) patterns
    User-agent: EndPathBot
    Disallow: /fish$
    Disallow: /temp$
    Allow: /fishing
    
    # Test complex patterns
    User-agent: ComplexBot
    Disallow: /
    Allow: /$
    Allow: /index.html$
    Allow: /public/
    
    # Test priority rules (longer match wins)
    User-agent: PriorityBot
    Disallow: /store
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 14:03:41 UTC 2025
    - 910 bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java

                // If pattern doesn't end with $, it implicitly matches anything after it
                if (pattern.isEmpty() || pattern.charAt(pattern.length() - 1) != '$') {
                    // This makes "/fish" match "/fish", "/fishing", "/fish/", etc.
                    // No need to add anything - the pattern naturally continues
                }
    
                return Pattern.compile(regex.toString());
            }
    
            /**
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 18.5K bytes
    - Viewed (0)
Back to top