- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 49 for aient (0.68 sec)
-
src/main/resources/fess_indices/fess/fr/stopwords.txt
aurons aurez auront aurais aurait aurions auriez auraient avais avait avions aviez avaient eut eûmes eûtes eurent aie aies ait ayons ayez aient eusse eusses eût eussions eussiez eussent ceci cela celà cet cette ici ils les leurs quel quels quelle quelles sans
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Mon Nov 27 12:59:36 UTC 2023 - 977 bytes - Viewed (0) -
src/main/resources/fess_indices/_aws/fess.json
}, "french_keywords": {Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Sat Jun 14 00:36:40 UTC 2025 - 117.3K bytes - Viewed (0) -
src/main/resources/fess_indices/_cloud/fess.json
}, "french_keywords": {Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Sat Feb 27 09:26:16 UTC 2021 - 117.3K bytes - Viewed (0) -
fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots.txt
User-agent: FessCrawler Disallow: # allows all User-agent: BruteBot Disallow: / Allow: /foo/bar/ Crawl-delay: 1314000 # welcome! User-agent: Googlebot Crawl-delay: 1 User-agent: * Disallow: /private/ Disallow: /help # disallows /help.html, /help/index.html, etc. Allow: /help/faq.html Crawl-delay: 3 User-agent: Crawler Disallow: /aaa User-agent: Crawler/1.0 Disallow: /bbb
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Oct 11 02:16:55 UTC 2015 - 566 bytes - Viewed (0) -
fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots_wildcard.txt
User-agent: PriorityBot Disallow: /store Allow: /store/public Disallow: /store/public/sale # Test Allow vs Disallow with same length (Allow wins) User-agent: SameLengthBot Disallow: /page Allow: /page # Test multiple wildcards User-agent: MultiWildcardBot Disallow: /*.cgi* Disallow: /*?*id=* # Test literal $ in middle of pattern User-agent: DollarBot Disallow: /price$info
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Nov 13 14:03:41 UTC 2025 - 910 bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java
* <li>Manages crawl delay settings per user agent</li> * <li>Stores sitemap URLs listed in robots.txt</li> * </ul> * * <p>The class uses case-insensitive pattern matching for user agents and supports * wildcard characters (*) in user agent strings. When multiple directives match a user agent, * the most specific (longest) match is used.</p> * */ public class RobotsTxt {
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 18.5K bytes - Viewed (0) -
fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots_malformed.txt
NoValue: ::: : # Case 13: Numeric crawl-delay edge cases User-agent: NumericBot Crawl-delay: 0 Crawl-delay: 999999999 Crawl-delay: 1.23e10 # Case 14: Tab characters instead of spaces User-agent: TabBot Disallow: /tab1/ Allow: /tab2/ # Case 15: Unicode and special characters in user-agent User-agent: Bot™ Disallow: /trademark/ User-agent: Bot® Disallow: /registered/
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 14 12:52:01 UTC 2025 - 2.6K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/helper/UserAgentHelper.java
} /** The HTTP header name for User-Agent */ protected static final String USER_AGENT = "user-agent"; /** The request attribute key for storing cached user agent type */ protected static final String USER_AGENT_TYPE = "ViewHelper.UserAgent"; /** * Determines the user agent type from the current HTTP request. * The method analyzes the User-Agent header to categorize the browser typeRegistered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Thu Jul 17 08:28:31 UTC 2025 - 3.5K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java
RobotsTxt robotsTxt; final InputStream in = new java.io.ByteArrayInputStream(robotsTxtContent.getBytes()); try { robotsTxt = robotsTxtHelper.parse(in); } finally { CloseableUtil.closeQuietly(in); } // Directives before User-agent should be ignoredRegistered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 20.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/RobotsTxtHelper.java
* </ul> * * @author bowez * @author shinsuke * */ public class RobotsTxtHelper { /** Pattern for parsing user-agent records. */ protected static final Pattern USER_AGENT_RECORD = Pattern.compile("^user-agent:\\s*([^\\t\\n\\x0B\\f\\r]+)\\s*$", Pattern.CASE_INSENSITIVE); /** Pattern for parsing disallow records. */Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 14 12:52:01 UTC 2025 - 11.4K bytes - Viewed (0)