- Sort Score
- Result 10 results
- Languages All
Results 1 - 6 of 6 for ZipExtractor (0.07 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/ZipExtractor.java
import org.codelibs.fess.crawler.util.IgnoreCloseInputStream; import jakarta.annotation.Resource; /** * Extracts text content from ZIP archives. */ public class ZipExtractor extends AbstractExtractor { private static final Logger logger = LogManager.getLogger(ZipExtractor.class); /** * The archive stream factory. */ @Resource protected ArchiveStreamFactory archiveStreamFactory; /**Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Dec 11 08:38:29 UTC 2025 - 4.8K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/ZipExtractorTest.java
.singleton("zipExtractor", ZipExtractor.class) .<ExtractorFactory> singleton("extractorFactory", ExtractorFactory.class, factory -> { TikaExtractor tikaExtractor = container.getComponent("tikaExtractor"); ZipExtractor zipExtractor = container.getComponent("zipExtractor"); factory.addExtractor("text/plain", tikaExtractor);
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 3.7K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/ArchiveExtractorErrorHandlingTest.java
final ZipExtractor zipExtractor = container.getComponent("zipExtractor"); final TarExtractor tarExtractor = container.getComponent("tarExtractor"); factory.addExtractor("text/plain", textExtractor); factory.addExtractor("text/html", tikaExtractor); factory.addExtractor("application/zip", zipExtractor);
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 12.6K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/EmlExtractorTest.java
container.singleton("mimeTypeHelper", MimeTypeHelperImpl.class) .singleton("tikaExtractor", TikaExtractor.class) .singleton("zipExtractor", ZipExtractor.class) .<ExtractorFactory> singleton("extractorFactory", ExtractorFactory.class, factory -> { TikaExtractor tikaExtractor = container.getComponent("tikaExtractor");
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 4.6K bytes - Viewed (1) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/EXTRACTOR_TESTS_README.md
**Key Test Areas**: - Enhanced error messages - Partial extraction (continues on failure) - Invalid archive handling - Empty archive handling - Mixed valid/invalid entries **Covered Extractors**: - ZipExtractor - TarExtractor **Test Count**: 12 tests **Key Scenarios**: - ✅ Descriptive error messages for invalid archives - ✅ Continues processing when some entries fail - ✅ Handles empty archives gracefully
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Wed Nov 19 08:55:01 UTC 2025 - 5.7K bytes - Viewed (0) -
CLAUDE.md
**Extractor**: Weight-based selection (tries in descending weight order) ### Key Extractors `TikaExtractor` (1000+ formats), `PdfExtractor`, `MsWordExtractor`, `MsExcelExtractor`, `MsPowerPointExtractor`, `ZipExtractor`, `HtmlExtractor`, etc. **Registration**: ```java extractorFactory.addExtractor("text/html", htmlExtractor, 2); // Weight 2 extractorFactory.addExtractor("text/html", tikaExtractor, 1); // Fallback ```
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0)