- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 13 for Directives (1.13 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java
*/ public int getCrawlDelay(final String userAgent) { final Directive directive = getMatchedDirective(userAgent); if (directive == null) { return 0; } return directive.getCrawlDelay(); } /** * Returns the most specific directive matching the given user agent. * The method finds the longest matching user agent pattern in the directives,Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 18.5K bytes - Viewed (0) -
fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots_malformed.txt
# Test file for malformed robots.txt parsing # This file contains various malformed directives that should be handled gracefully # Case 1: Directives before any User-agent (should be ignored) Disallow: /orphaned1/ Allow: /orphaned2/ # Case 2: Valid user-agent with various malformed directives User-agent: GoodBot Disallow: /admin/ InvalidDirective: some-value unknown-field: test Disallow /missing-colon
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 14 12:52:01 UTC 2025 - 2.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/RobotsTxtHelper.java
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 14 12:52:01 UTC 2025 - 11.4K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java
assertNotNull(robotsTxt); // Test that orphaned directives (before any User-agent) are ignored // These should not affect any bot assertTrue(robotsTxt.allows("/orphaned1/", "AnyBot")); assertTrue(robotsTxt.allows("/orphaned2/", "AnyBot")); // Test GoodBot - should parse valid directives and ignore invalid ones assertNotNull(robotsTxt.getDirective("goodbot"));
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 20.6K bytes - Viewed (0) -
doc/asm.html
own source code, making it easier to move the code from one location to another. </p> <h3 id="directives">Directives</h3> <p> The assembler uses various directives to bind text and data to symbol names. For example, here is a simple complete function definition. The <code>TEXT</code> directive declares the symbol <code>runtime·profileloop</code> and the instructions that follow form the body of the function.
Registered: Tue Dec 30 11:13:12 UTC 2025 - Last Modified: Fri Nov 14 19:09:46 UTC 2025 - 36.5K bytes - Viewed (0) -
doc/godebug.md
Only the work module's `go.mod` is consulted for `godebug` directives. Any directives in required dependency modules are ignored. It is an error to list a `godebug` with an unrecognized setting. (Toolchains older than Go 1.23 reject all `godebug` lines, since they do not understand `godebug` at all.) When a workspace is in use, `godebug` directives in `go.mod` files are ignored, and `go.work` will be consulted
Registered: Tue Dec 30 11:13:12 UTC 2025 - Last Modified: Wed Dec 03 00:18:09 UTC 2025 - 24.7K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/entity/RobotsTxtTest.java
Directive directive = new Directive("MyBot"); assertNotNull(directive); assertEquals("MyBot", directive.getUserAgent()); assertEquals(0, directive.getCrawlDelay()); } public void test_directiveCrawlDelay() { // Test Directive crawl delay Directive directive = new Directive("MyBot"); directive.setCrawlDelay(10);Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Nov 13 13:29:22 UTC 2025 - 14.4K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/crawler/transformer/FessXpathTransformer.java
} } /** * Processes robots meta tags in the HTML document. * Handles noindex, nofollow, and none directives. * * @param responseData the response data from crawling * @param resultData the result data to store processed information * @param document the parsed HTML document */Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Dec 12 13:58:40 UTC 2025 - 54.6K bytes - Viewed (0) -
CHANGELOG/CHANGELOG-1.35.md
Registered: Fri Dec 26 09:05:12 UTC 2025 - Last Modified: Wed Dec 17 13:01:55 UTC 2025 - 228.5K bytes - Viewed (0) -
api/go1.26.txt
pkg go/ast, func ParseDirective(token.Pos, string) (Directive, bool) #68021 pkg go/ast, method (*Directive) End() token.Pos #68021 pkg go/ast, method (*Directive) ParseArgs() ([]DirectiveArg, error) #68021 pkg go/ast, method (*Directive) Pos() token.Pos #68021 pkg go/ast, type BasicLit struct, ValueEnd token.Pos #76031 pkg go/ast, type Directive struct #68021 pkg go/ast, type Directive struct, Args string #68021
Registered: Tue Dec 30 11:13:12 UTC 2025 - Last Modified: Thu Dec 11 19:57:52 UTC 2025 - 11.5K bytes - Viewed (0)