- Sort Score
- Num 10 results
- Language All
Results 1 - 4 of 4 for sitemapsRule (0.06 seconds)
-
src/main/resources/crawler/rule.xml
<arg>fsFileRule</arg> </postConstruct> <postConstruct name="addRule"> <arg>defaultRule</arg> </postConstruct> </component> <component name="sitemapsRule" class="org.codelibs.fess.crawler.rule.impl.SitemapsRule" > <property name="ruleId">"sitemapsRule"</property> <property name="responseProcessor"> <component class="org.codelibs.fess.crawler.processor.impl.SitemapsResponseProcessor"> </component> </property>
Created: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Sun Mar 29 08:21:02 GMT 2026 - 4.6K bytes - Click Count (0) -
CLAUDE.md
← UrlQueueService ← ← ← ← ← ← ← ← ← ← ← ← ← ← ← ← ← DataService ← ← ← ← ← ← ← ← ← ← ← ← ← ← ← ← ← ← ``` - **Rule**: Pattern-based response routing (`RegexRule`, `SitemapsRule`) - **ResponseProcessor**: `DefaultResponseProcessor`, `SitemapsResponseProcessor`, `NullResponseProcessor` - **Transformer**: `HtmlTransformer`, `XmlTransformer`, `FileTransformer`, etc.Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Thu Mar 12 03:39:20 GMT 2026 - 8.1K bytes - Click Count (0) -
README.md
controller.setDelayMillisForWaitingNewUrl(5000); controller.setDefaultIntervalTime(1000); }); ``` ### Sitemap Support ```java // Enable sitemap processing container.singleton("sitemapsRule", SitemapsRule.class, rule -> { rule.addRule("url", ".*sitemap.*"); }); // Add sitemap URL crawler.addUrl("https://example.com/sitemap.xml"); ``` ## Data Access and Storage
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Sun Aug 31 05:32:52 GMT 2025 - 15.3K bytes - Click Count (0) -
fess-crawler-lasta/src/test/java/org/codelibs/fess/crawler/util/CrawlerWebServer.java
buf.append("</url>").append('\n'); buf.append("</urlset>").append('\n'); File sitemapsFile = new File(tempDir, "sitemaps.xml"); FileUtil.writeBytes(sitemapsFile.getAbsolutePath(), buf.toString().getBytes("UTF-8")); robotTxtFile.deleteOnExit(); // sitemaps.txt buf = new StringBuilder();
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Thu Jan 15 01:11:43 GMT 2026 - 8.1K bytes - Click Count (0)