Search Options

Results per page
Sort
Preferred Languages
Advance

Results 21 - 30 of 58 for are (0.01 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorFactory.java

     * It also includes a builder for creating extractors.
     *
     * <p>
     * The factory maintains a map of keys to an array of {@link Extractor} objects.
     * When multiple extractors are associated with a single key, they are sorted by weight
     * in descending order. The {@link #getExtractor(String)} method returns a composite
     * extractor that iterates through the available extractors until one successfully
     * extracts the data.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 7.3K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapFile.java

            this.lastmod = lastmod;
        }
    
        /**
         * Checks if this SitemapFile is equal to another object.
         * @param obj the object to compare with
         * @return true if the objects are equal, false otherwise
         */
        @Override
        public boolean equals(final Object obj) {
            if (this == obj) {
                return true;
            }
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 4.4K bytes
    - Viewed (1)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/PasswordBasedExtractor.java

     * <ul>
     *   <li>Static passwords configured via {@link #addPassword(String, String)}</li>
     *   <li>Dynamic passwords provided through extraction parameters</li>
     * </ul>
     *
     * <p>Passwords are matched against URLs or resource names using regular expression patterns.
     * The extractor first tries to match against the URL, then falls back to the resource name if available.
     *
     * @author shinsuke
     */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 5.1K bytes
    - Viewed (0)
  4. src/test/java/org/codelibs/opensearch/extension/analysis/NGramSynonymTokenizerFactory.java

            if (synonymLoader.getSynonymMap() == null) {
                if (settings.getAsList("synonyms", null) != null) {
                    logger.warn("synonyms values are empty.");
                } else if (settings.get("synonyms_path") != null) {
                    logger.warn("synonyms_path[{}] is empty.", settings.get("synonyms_path"));
                } else {
    Registered: Fri Sep 19 09:08:11 UTC 2025
    - Last Modified: Sat Mar 15 06:51:20 UTC 2025
    - 2.4K bytes
    - Viewed (0)
  5. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerStatusTest.java

            hashCodes.add(CrawlerStatus.RUNNING.hashCode());
            hashCodes.add(CrawlerStatus.DONE.hashCode());
    
            // While not guaranteed, typically enum hashCodes are different
            assertTrue(hashCodes.size() > 1);
        }
    
        /**
         * Test compareTo method
         */
        public void test_compareTo() {
            // Compare with self
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Wed Sep 03 14:42:53 UTC 2025
    - 15.8K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/AbstractXmlExtractor.java

        public void setPreloadSizeForCharset(final int preloadSizeForCharset) {
            this.preloadSizeForCharset = preloadSizeForCharset;
        }
    
        /**
         * Returns whether comment tags are ignored during extraction.
         * @return true if comment tags are ignored, false otherwise.
         */
        public boolean isIgnoreCommentTag() {
            return ignoreCommentTag;
        }
    
        /**
         * Sets whether to ignore comment tags.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 8.5K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/net/protocol/storage/Handler.java

     * using URLs with the "storage" protocol.
     *
     * <p>
     * The URL format is expected to be: {@code storage://bucketName/objectName}.
     * The bucket name and object name are extracted from the URL.
     * </p>
     *
     * <p>
     * The handler relies on environment variables for configuration:
     * </p>
     * <ul>
     *   <li>{@code STORAGE_ENDPOINT}: The endpoint URL of the MinIO service.</li>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.5K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java

     * and handling the overall crawling lifecycle.
     *
     * <p>It implements the Runnable interface to be executed in a separate thread,
     * and the AutoCloseable interface to ensure resources are properly released after use.
     *
     * <p>The crawler uses several services and components, such as UrlQueueService, DataService,
     * UrlFilter, RuleManager, CrawlerContainer, IntervalController, and CrawlerClientFactory,
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14K bytes
    - Viewed (0)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/exception/ChildUrlsException.java

     */
    package org.codelibs.fess.crawler.exception;
    
    import java.util.Set;
    
    import org.codelibs.fess.crawler.entity.RequestData;
    
    /**
     * {@link ChildUrlsException} is thrown when child URLs are found during crawling.
     * It extends {@link CrawlerSystemException} and holds a set of {@link RequestData}
     * representing the child URLs that caused the exception.
     *
     */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 1.8K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/exception/CrawlingAccessException.java

     * It extends CrawlerSystemException and provides functionality to set and check the log level for the exception.
     *
     * <p>
     * This exception can be thrown when there are problems accessing URLs, files, or any other resources needed for crawling.
     * It includes constructors to handle messages, causes, or both.
     * </p>
     *
     * <p>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 3.8K bytes
    - Viewed (0)
Back to top