Search Options

Results per page
Sort
Preferred Languages
Advance

Results 71 - 80 of 87 for provide (0.04 sec)

  1. README.md

    ### Key Features
    
    - **Multi-Protocol Support**: HTTP/HTTPS, File System, FTP, SMB/CIFS, Cloud Storage (MinIO, S3)
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/service/UrlFilterService.java

     */
    package org.codelibs.fess.crawler.service;
    
    import java.util.List;
    import java.util.regex.Pattern;
    
    /**
     * Service interface for managing URL filters.
     * Provides methods to add and remove include/exclude URL filters,
     * as well as retrieve the patterns of these filters.
     */
    public interface UrlFilterService {
    
        /**
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Mar 15 06:52:00 UTC 2025
    - 3.1K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/UrlQueue.java

     * governing permissions and limitations under the License.
     */
    package org.codelibs.fess.crawler.entity;
    
    /**
     * The UrlQueue interface represents a queue of URLs to be processed by a web crawler.
     * It provides methods to get and set various properties of a URL queue entry.
     *
     * @param <IDTYPE> the type of the identifier for the URL queue entry
     */
    public interface UrlQueue<IDTYPE> {
    
        /**
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Mar 15 06:52:00 UTC 2025
    - 4.3K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb/SmbAuthentication.java

    /**
     * Represents SMB authentication information, including server details,
     * credentials, and domain. This class is used to encapsulate the necessary
     * information for authenticating with an SMB server.
     *
     * <p>
     * It provides methods to set and retrieve the server address, port, username,
     * password, and domain. Additionally, it offers a method to construct a path
     * prefix for SMB URLs based on the configured server and port.
     * </p>
     *
     * <p>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 3.9K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/AbstractXmlExtractor.java

    import org.codelibs.fess.crawler.exception.CrawlerSystemException;
    import org.codelibs.fess.crawler.exception.ExtractException;
    
    /**
     * Abstract base class for XML extractors.
     * Provides common functionality for extracting text content from XML-like documents.
     * It handles encoding detection, HTML entity unescaping, and tag-based content extraction.
     *
     */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 8.5K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb1/SmbClient.java

     * This client supports authentication, content retrieval, and metadata extraction from SMB files.
     * It handles file access, directory listing, and access control entries (ACEs) processing.
     * </p>
     *
     * <p>
     * The class provides methods to:
     * </p>
     * <ul>
     *   <li>Initialize the client with SMB authentication details.</li>
     *   <li>Retrieve content and metadata from SMB files.</li>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Sep 18 09:30:45 UTC 2025
    - 23K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/CrawlerThread.java

     *   <li>Handling exceptions that may occur during the crawling process.</li>
     * </ol>
     *
     * <p>
     * The thread also manages the active thread count using {@code crawlerContext.activeThreadCountLock}
     * and provides methods for logging messages using {@link LogHelper}.
     * </p>
     *
     * <p>
     * The crawling process continues until the crawler status is {@link CrawlerStatus#DONE} or the
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 20.4K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/impl/MimeTypeHelperImpl.java

    /**
     * MimeTypeHelperImpl is a helper class that detects the MIME type of a given input stream or filename.
     * It uses the Apache Tika library to detect the MIME type.
     *
     * <p>
     * This class provides methods to:
     * </p>
     * <ul>
     *   <li>Detect the MIME type based on the input stream and filename.</li>
     *   <li>Normalize the filename to handle special characters.</li>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 6.5K bytes
    - Viewed (0)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/XpathTransformer.java

     * to parse the HTML content, evaluate XPath expressions, and generate the XML output.
     * </p>
     * <p>
     * The class supports various XPath result types, including BOOLEAN, NUMBER, STRING, NODESET, and NODE.
     * It also provides options to trim whitespace from extracted values and to specify the character encoding for the output.
     * </p>
     * <p>
     * The {@link #getData(AccessResultData)} method allows retrieving the transformed data as a String (XML content),
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 13.1K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/service/impl/UrlQueueServiceImpl.java

    import org.codelibs.fess.crawler.service.UrlQueueService;
    
    import jakarta.annotation.Resource;
    
    /**
     * Implementation of the {@link UrlQueueService} interface.
     * This class provides methods for managing a queue of URLs to be crawled,
     * including adding, deleting, and retrieving URLs from the queue.
     * It uses a {@link MemoryDataHelper} to store the URL queue data in memory.
     *
     * <p>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 9.3K bytes
    - Viewed (0)
Back to top