- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 26 for supports (0.02 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/storage/StorageClient.java
* <li>readTimeout - Read timeout in milliseconds (default: 10000)</li> * </ul> * * <p>The client supports URLs in the format: {@code storage://bucket-name/object-path} * * <p>Features: * <ul> * <li>Automatic initialization of MinIO client</li> * <li>Support for HEAD and GET operations</li> * <li>Content length validation</li> * <li>MIME type detection</li>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 17.9K bytes - Viewed (2) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java
* * <p>Key features:</p> * <ul> * <li>Supports multiple user-agent directives with pattern matching</li> * <li>Handles Allow and Disallow rules for path-based access control</li> * <li>Manages crawl delay settings per user agent</li> * <li>Stores sitemap URLs listed in robots.txt</li> * </ul> * * <p>The class uses case-insensitive pattern matching for user agents and supports
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/suggest/index/contents/document/ESSourceReader.java
* It implements the {@link DocumentReader} interface to provide a way to iterate over documents * in a large index without loading all of them into memory at once. * </p> * * <p> * The reader supports limiting the number of documents read based on a percentage of the total documents * or a fixed number. It also allows filtering documents based on their size, using the {@code limitOfDocumentSize} * parameter. * </p> *Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Thu Aug 07 02:41:28 UTC 2025 - 11K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/container/StandardCrawlerContainer.java
import jakarta.annotation.PreDestroy; import jakarta.annotation.Resource; /** * A container implementation that manages the lifecycle and dependency injection of components * in a crawler application. This container supports both singleton and prototype component * instantiation patterns. * * <p>The container provides mechanisms for: * <ul> * <li>Registering and retrieving components by name</li>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14.3K bytes - Viewed (0) -
README.md
### Key Features - **Multi-Protocol Support**: HTTP/HTTPS, File System, FTP, SMB/CIFS, Cloud Storage (MinIO, S3)
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/SitemapsHelper.java
import org.xml.sax.SAXNotRecognizedException; import org.xml.sax.SAXNotSupportedException; import org.xml.sax.helpers.DefaultHandler; /** * Helper class for parsing and validating sitemaps. * It supports XML sitemaps, XML sitemap indexes, and text sitemaps, * and can handle GZIP compressed sitemaps. * The class provides methods to check if an input stream is a valid sitemap,
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/XpathTransformer.java
* @return The result data body. */ protected String getResultDataBody(final String name, final String value) { // TODO: Support other XML footer types // TODO: Support other field types and trimming options return "<field name=\"" + XmlUtil.escapeXml(name) + "\">" + trimSpace(XmlUtil.escapeXml(value != null ? value : "")) + "</field>\n"; } /**Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 13.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/PdfExtractor.java
import org.codelibs.fess.crawler.extractor.ExtractorFactory; import org.codelibs.fess.crawler.helper.MimeTypeHelper; /** * PdfExtractor extracts text content from PDF files using Apache PDFBox. * It supports password-protected PDFs and can extract embedded documents and annotations. * * <p>The extractor runs text extraction in a separate thread with a configurable timeout
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 12.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb1/SmbClient.java
* on SMB (Server Message Block) shares using the SMB1 protocol. It extends {@link AbstractCrawlerClient} and utilizes the JCIFS library * to interact with SMB resources. * * <p> * This client supports authentication, content retrieval, and metadata extraction from SMB files. * It handles file access, directory listing, and access control entries (ACEs) processing. * </p> * * <p> * The class provides methods to:
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Sep 18 09:30:45 UTC 2025 - 23K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/form/FormScheme.java
* form-based authentication for HTTP clients. It handles the process of * obtaining a token and logging in using the provided credentials. * * <p>This class supports both GET and POST methods for token and login * requests. It also allows for the replacement of placeholders in URLs and * parameters with actual credentials. * * <p>Usage example: * <pre> * {@code
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14.3K bytes - Viewed (1)