- Sort Score
- Num 10 results
- Language All
Results 1 - 2 of 2 for msWordExtractor (0.32 seconds)
-
CLAUDE.md
- **Transformer**: `HtmlTransformer`, `XmlTransformer`, `FileTransformer`, etc. - **Extractor**: Weight-based selection (tries in descending weight order) ### Key Extractors `TikaExtractor`, `PdfExtractor`, `MsWordExtractor`, `MsExcelExtractor`, `MsPowerPointExtractor`, `ZipExtractor`, `HtmlExtractor`, `MarkdownExtractor`, `EmlExtractor` ### Helpers - **RobotsTxtHelper**: RFC 9309 parsing, user-agent matching, crawl-delay, sitemaps
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Thu Mar 12 03:39:20 GMT 2026 - 8.1K bytes - Click Count (0) -
fess-crawler-lasta/src/main/resources/crawler/extractor.xml
<property name="maxCompressionRatio">1</property> <property name="maxUncompressionSize">10000000</property> </component> <component name="msWordExtractor" class="org.codelibs.fess.crawler.extractor.impl.MsWordExtractor" /> <component name="msExcelExtractor" class="org.codelibs.fess.crawler.extractor.impl.MsExcelExtractor" /> <component name="msPowerPointExtractor"
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Wed Feb 11 01:15:55 GMT 2026 - 50.4K bytes - Click Count (0)