- Sort Score
- Num 10 results
- Language All
Results 1 - 2 of 2 for XmlExtractor (0.25 seconds)
-
fess-crawler-lasta/src/main/resources/crawler/extractor.xml
</component> </property> <postConstruct name="addMetadata"> <arg>"title"</arg> <arg>"//TITLE"</arg> </postConstruct> </component> <component name="xmlExtractor" class="org.codelibs.fess.crawler.extractor.impl.XmlExtractor" /> <component name="htmlXpathExtractor" class="org.codelibs.fess.crawler.extractor.impl.HtmlXpathExtractor"> <postConstruct name="addFeature">
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Wed Feb 11 01:15:55 GMT 2026 - 50.4K bytes - Click Count (0) -
CLAUDE.md
### Key Extractors `TikaExtractor`, `PdfExtractor`, `MsWordExtractor`, `MsExcelExtractor`, `MsPowerPointExtractor`, `ZipExtractor`, `HtmlExtractor`, `MarkdownExtractor`, `EmlExtractor` ### Helpers - **RobotsTxtHelper**: RFC 9309 parsing, user-agent matching, crawl-delay, sitemaps - **SitemapsHelper**: Sitemap XML parsing, index handling - **MimeTypeHelper**: MIME detection via Tika
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Thu Mar 12 03:39:20 GMT 2026 - 8.1K bytes - Click Count (0)