Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 26 for supports (0.02 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/storage/StorageClient.java

     *   <li>readTimeout - Read timeout in milliseconds (default: 10000)</li>
     * </ul>
     *
     * <p>The client supports URLs in the format: {@code storage://bucket-name/object-path}
     *
     * <p>Features:
     * <ul>
     *   <li>Automatic initialization of MinIO client</li>
     *   <li>Support for HEAD and GET operations</li>
     *   <li>Content length validation</li>
     *   <li>MIME type detection</li>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 17.9K bytes
    - Viewed (2)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java

     *
     * <p>Key features:</p>
     * <ul>
     *   <li>Supports multiple user-agent directives with pattern matching</li>
     *   <li>Handles Allow and Disallow rules for path-based access control</li>
     *   <li>Manages crawl delay settings per user agent</li>
     *   <li>Stores sitemap URLs listed in robots.txt</li>
     * </ul>
     *
     * <p>The class uses case-insensitive pattern matching for user agents and supports
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10K bytes
    - Viewed (0)
  3. src/main/java/org/codelibs/fess/suggest/index/contents/document/ESSourceReader.java

     * It implements the {@link DocumentReader} interface to provide a way to iterate over documents
     * in a large index without loading all of them into memory at once.
     * </p>
     *
     * <p>
     * The reader supports limiting the number of documents read based on a percentage of the total documents
     * or a fixed number. It also allows filtering documents based on their size, using the {@code limitOfDocumentSize}
     * parameter.
     * </p>
     *
    Registered: Fri Sep 19 09:08:11 UTC 2025
    - Last Modified: Thu Aug 07 02:41:28 UTC 2025
    - 11K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/container/StandardCrawlerContainer.java

    import jakarta.annotation.PreDestroy;
    import jakarta.annotation.Resource;
    
    /**
     * A container implementation that manages the lifecycle and dependency injection of components
     * in a crawler application. This container supports both singleton and prototype component
     * instantiation patterns.
     *
     * <p>The container provides mechanisms for:
     * <ul>
     *   <li>Registering and retrieving components by name</li>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14.3K bytes
    - Viewed (0)
  5. README.md

    ### Key Features
    
    - **Multi-Protocol Support**: HTTP/HTTPS, File System, FTP, SMB/CIFS, Cloud Storage (MinIO, S3)
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/SitemapsHelper.java

    import org.xml.sax.SAXNotRecognizedException;
    import org.xml.sax.SAXNotSupportedException;
    import org.xml.sax.helpers.DefaultHandler;
    
    /**
     * Helper class for parsing and validating sitemaps.
     * It supports XML sitemaps, XML sitemap indexes, and text sitemaps,
     * and can handle GZIP compressed sitemaps.
     * The class provides methods to check if an input stream is a valid sitemap,
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14.7K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/XpathTransformer.java

         * @return The result data body.
         */
        protected String getResultDataBody(final String name, final String value) {
            // TODO: Support other XML footer types
            // TODO: Support other field types and trimming options
            return "<field name=\"" + XmlUtil.escapeXml(name) + "\">" + trimSpace(XmlUtil.escapeXml(value != null ? value : "")) + "</field>\n";
        }
    
        /**
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 13.1K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/PdfExtractor.java

    import org.codelibs.fess.crawler.extractor.ExtractorFactory;
    import org.codelibs.fess.crawler.helper.MimeTypeHelper;
    
    /**
     * PdfExtractor extracts text content from PDF files using Apache PDFBox.
     * It supports password-protected PDFs and can extract embedded documents and annotations.
     *
     * <p>The extractor runs text extraction in a separate thread with a configurable timeout
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 12.7K bytes
    - Viewed (0)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb1/SmbClient.java

     * on SMB (Server Message Block) shares using the SMB1 protocol. It extends {@link AbstractCrawlerClient} and utilizes the JCIFS library
     * to interact with SMB resources.
     *
     * <p>
     * This client supports authentication, content retrieval, and metadata extraction from SMB files.
     * It handles file access, directory listing, and access control entries (ACEs) processing.
     * </p>
     *
     * <p>
     * The class provides methods to:
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Sep 18 09:30:45 UTC 2025
    - 23K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/form/FormScheme.java

     * form-based authentication for HTTP clients. It handles the process of
     * obtaining a token and logging in using the provided credentials.
     *
     * <p>This class supports both GET and POST methods for token and login
     * requests. It also allows for the replacement of placeholders in URLs and
     * parameters with actual credentials.
     *
     * <p>Usage example:
     * <pre>
     * {@code
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14.3K bytes
    - Viewed (1)
Back to top