- Sort Score
- Result 10 results
- Languages All
Results 11 - 20 of 37 for USING (0.02 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractor.java
import org.xml.sax.SAXException; import jakarta.annotation.PostConstruct; /** * <p> * The {@link TikaExtractor} class is responsible for extracting text content and metadata from various file formats * using the Apache Tika library. It extends {@link PasswordBasedExtractor} to handle password-protected files. * </p> * * <p> * This class provides methods to extract text from an input stream, handling different scenarios such as:Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 30.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/JodExtractor.java
import org.jodconverter.local.LocalConverter; import jakarta.annotation.PostConstruct; import jakarta.annotation.PreDestroy; /** * Extracts text content from various document formats using JODConverter. */ public class JodExtractor extends AbstractExtractor { /** Logger for this class. */ private static final Logger logger = LogManager.getLogger(JodExtractor.class);Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/net/protocol/storage/Handler.java
/** * Handler for the "storage" protocol, allowing access to objects stored in a MinIO-compatible storage service. * This handler extends {@link URLStreamHandler} to provide a way to open connections to storage objects * using URLs with the "storage" protocol. * * <p> * The URL format is expected to be: {@code storage://bucketName/objectName}. * The bucket name and object name are extracted from the URL. * </p> * * <p>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10.5K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/PdfExtractor.java
import org.codelibs.fess.crawler.extractor.Extractor; import org.codelibs.fess.crawler.extractor.ExtractorFactory; import org.codelibs.fess.crawler.helper.MimeTypeHelper; /** * PdfExtractor extracts text content from PDF files using Apache PDFBox. * It supports password-protected PDFs and can extract embedded documents and annotations. * * <p>The extractor runs text extraction in a separate thread with a configurable timeout
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 12.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/form/FormScheme.java
import org.codelibs.fess.crawler.Constants; /** * The FormScheme class implements the AuthScheme interface to provide * form-based authentication for HTTP clients. It handles the process of * obtaining a token and logging in using the provided credentials. * * <p>This class supports both GET and POST methods for token and login * requests. It also allows for the replacement of placeholders in URLs and * parameters with actual credentials. *
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14.3K bytes - Viewed (1) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/LhaExtractor.java
import jp.gr.java_conf.dangan.util.lha.LhaHeader; /** * Extractor implementation for LHA (LZH) archive files. * This extractor can extract text content from files within LHA archives * by using appropriate extractors for each contained file type. * * @author shinsuke */ public class LhaExtractor extends AbstractExtractor { /** Logger for this class. */
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 5.8K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/util/XPathAPI.java
/** * Creates a new XPathAPI instance. */ public XPathAPI() { xPath = createXPath(f -> {}); } /** * Creates an instance of {@link XPath} using the provided {@link Consumer} to configure the {@link XPathFactory}. * * @param builder a {@link Consumer} that accepts an {@link XPathFactory} and configures it. * @return a configured instance of {@link XPath}.Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 4.6K bytes - Viewed (0) -
README.md
- **CrawlerContext**: Execution context and configuration - **CrawlerThread**: Individual crawler thread implementation #### Client Architecture - **HcHttpClient**: HTTP/HTTPS client using Apache HttpComponents - **FileSystemClient**: File system access - **FtpClient**: FTP protocol support - **SmbClient**: SMB/CIFS network shares - **StorageClient**: Cloud storage integration
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/UrlConvertHelper.java
* based on a map of target strings and their corresponding replacements. It allows * adding new conversion rules, setting the entire conversion map, and converting * URLs using these rules.</p> * * <p>The conversion is performed by iterating through the conversion map and applying * each replacement rule sequentially. The order of the rules in the map is preserved * during the conversion process.</p>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 3.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 28.5K bytes - Viewed (0)