- Sort Score
- Result 10 results
- Languages All
Results 1 - 3 of 3 for pdfExtractor (0.04 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/PdfExtractor.java
* <li>Configurable timeout for extraction process</li> * </ul> * * @author shinsuke */ public class PdfExtractor extends PasswordBasedExtractor { /** Logger instance for this class. */ private static final Logger logger = LogManager.getLogger(PdfExtractor.class); /** Timeout for PDF extraction in milliseconds (default: 30 seconds). */ protected long timeout = 30000; // 30sec
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 12:19:14 UTC 2025 - 12.8K bytes - Viewed (0) -
CLAUDE.md
**Transformer**: `HtmlTransformer`, `XmlTransformer`, `FileTransformer`, etc. **Extractor**: Weight-based selection (tries in descending weight order) ### Key Extractors `TikaExtractor` (1000+ formats), `PdfExtractor`, `MsWordExtractor`, `MsExcelExtractor`, `MsPowerPointExtractor`, `ZipExtractor`, `HtmlExtractor`, etc. **Registration**: ```java extractorFactory.addExtractor("text/html", htmlExtractor, 2); // Weight 2
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/JodExtractor.java
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 12:19:14 UTC 2025 - 10.4K bytes - Viewed (0)