- Sort Score
- Result 10 results
- Languages All
Results 21 - 30 of 31 for FessCrawler (0.08 sec)
-
src/main/resources/crawler/contentlength.xml
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <include path="crawler/container.xml" /> <component name="contentLengthHelper" class="org.codelibs.fess.crawler.helper.ContentLengthHelper" instance="singleton"> <property name="defaultMaxLength">10485760</property><!-- 10M --> <postConstruct name="addMaxLength">
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Sun Oct 11 06:51:14 UTC 2015 - 561 bytes - Viewed (0) -
fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots.txt
User-agent: FessCrawler Disallow: # allows all User-agent: BruteBot Disallow: / Allow: /foo/bar/ Crawl-delay: 1314000 # welcome! User-agent: Googlebot Crawl-delay: 1 User-agent: * Disallow: /private/ Disallow: /help # disallows /help.html, /help/index.html, etc. Allow: /help/faq.html Crawl-delay: 3 User-agent: Crawler Disallow: /aaa User-agent: Crawler/1.0 Disallow: /bbb
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Oct 11 02:16:55 UTC 2015 - 566 bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java
try { robotsTxt = robotsTxtHelper.parse(in); } finally { CloseableUtil.closeQuietly(in); } for (String userAgent : new String[] { "FessCrawler", "FessCrawler/1.0", "Mozilla FessCrawler" }) { assertTrue(robotsTxt.allows("/aaa", userAgent)); assertTrue(robotsTxt.allows("/private/", userAgent)); assertTrue(robotsTxt.allows("/private/index.html", userAgent));Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 20.6K bytes - Viewed (0) -
src/main/resources/crawler/transformer.xml
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <include path="crawler/transformer_basic.xml"/> <component name="fessXpathTransformer" class="org.codelibs.fess.crawler.transformer.FessXpathTransformer" instance="singleton"> <property name="name">"fessXpathTransformer"</property>
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Dec 20 13:14:54 UTC 2024 - 1.7K bytes - Viewed (0) -
src/main/resources/crawler/rule.xml
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <include path="crawler/container.xml" /> <include path="crawler/transformer.xml" /> <component name="ruleManager" class="org.codelibs.fess.crawler.rule.impl.RuleManagerImpl" instance="prototype"> <postConstruct name="addRule"> <arg>sitemapsRule</arg> </postConstruct>
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Thu Jun 04 08:42:49 UTC 2020 - 4.6K bytes - Viewed (0) -
fess-crawler-lasta/src/main/resources/crawler/client.xml
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <include path="crawler/container.xml" /> <include path="crawler/robotstxt.xml" /> <include path="crawler/contentlength.xml" /> <include path="crawler/mimetype.xml" /> <component name="internalHttpClient" class="org.codelibs.fess.crawler.client.http.HcHttpClient"
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Dec 11 08:38:29 UTC 2025 - 3.3K bytes - Viewed (0) -
fess-crawler-lasta/src/main/resources/crawler.xml
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <include path="crawler/container.xml"/> <include path="crawler/client.xml"/> <include path="crawler/rule.xml"/> <include path="crawler/filter.xml"/> <include path="crawler/interval.xml"/> <include path="crawler/extractor.xml"/> <include path="crawler/mimetype.xml"/>
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Tue Nov 28 13:40:25 UTC 2017 - 1.7K bytes - Viewed (0) -
fess-crawler-opensearch/src/main/resources/crawler_opensearch.xml
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <include path="crawler/container.xml"/> <include path="crawler/client.xml"/> <include path="crawler/rule.xml"/> <include path="crawler/filter.xml"/> <include path="crawler/interval.xml"/> <include path="crawler/extractor.xml"/>
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Nov 07 04:44:10 UTC 2024 - 2.2K bytes - Viewed (0) -
fess-crawler-lasta/src/main/resources/crawler/transformer_basic.xml
<?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <include path="crawler/container.xml" /> <component name="binaryTransformer" class="org.codelibs.fess.crawler.transformer.impl.BinaryTransformer" instance="singleton"> <property name="name">"binaryTransformer"</property> </component>
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Sep 30 21:21:24 UTC 2018 - 3.3K bytes - Viewed (0) -
README.md
```xml <!-- crawler.xml --> <?xml version="1.0" encoding="UTF-8"?> <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" "http://dbflute.org/meta/lastadi10.dtd"> <components namespace="fessCrawler"> <component name="crawler" class="org.codelibs.fess.crawler.Crawler" instance="prototype"/> <component name="httpClient" class="org.codelibs.fess.crawler.client.http.HcHttpClient" instance="singleton"/>
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0)