Search Options

Results per page
Sort
Preferred Languages
Advance

Results 31 - 40 of 82 for fset (0.07 sec)

  1. README.md

    </components>
    ```
    
    ### Crawler Context Configuration
    
    ```java
    // Set maximum number of URLs to crawl
    crawler.crawlerContext.setMaxAccessCount(1000);
    
    // Set number of crawler threads
    crawler.crawlerContext.setNumOfThread(10);
    
    // Set maximum crawl depth
    crawler.crawlerContext.setMaxDepth(3);
    
    // Set request interval (politeness)
    crawler.crawlerContext.setDefaultIntervalTime(1000); // 1 second
    ```
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java

            // Set new set
            Set<String> newSet = new HashSet<>();
            newSet.add("http://new.com/robots.txt");
            crawlerContext.setRobotsTxtUrlSet(newSet);
            assertSame(newSet, crawlerContext.getRobotsTxtUrlSet());
            assertEquals(1, crawlerContext.getRobotsTxtUrlSet().size());
    
            // Set null
            crawlerContext.setRobotsTxtUrlSet(null);
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 25.6K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/JodExtractor.java

         *
         * @param officeManager the office manager to set
         */
        public void setOfficeManager(final OfficeManager officeManager) {
            this.officeManager = officeManager;
        }
    
        /**
         * Sets the temporary directory for file operations.
         *
         * @param tempDir the temporary directory to set
         */
        public void setTempDir(final File tempDir) {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.3K bytes
    - Viewed (0)
  4. fess-crawler-opensearch/src/test/resources/lasta_di.properties

    # _/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/
    # Lasta Di properties, you can set container's options
    # _/_/_/_/_/_/_/_/_/_/
    
    # location of smart-deploy mode e.g. maihama_env.properties: lasta_di.smart.deploy.mode
    #smart.deploy.mode.location = maihama_env.properties: lasta_di.smart.deploy.mode
    
    # package for smart deploy target e.g. org.docksidestage.app
    smart.package1 = org.codelibs.fess.crawler
    
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Nov 07 04:44:10 UTC 2024
    - 479 bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/impl/AuthenticationImpl.java

        /**
         * Sets the authentication scope.
         *
         * @param authScope the authentication scope to set
         */
        public void setAuthScope(final AuthScope authScope) {
            this.authScope = authScope;
        }
    
        /**
         * Sets the credentials.
         * @param credentials The credentials to set.
         */
        public void setCredentials(final Credentials credentials) {
            this.credentials = credentials;
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 3.8K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/exception/CrawlingAccessException.java

     * It extends CrawlerSystemException and provides functionality to set and check the log level for the exception.
     *
     * <p>
     * This exception can be thrown when there are problems accessing URLs, files, or any other resources needed for crawling.
     * It includes constructors to handle messages, causes, or both.
     * </p>
     *
     * <p>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 3.8K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/interval/impl/AbstractIntervalController.java

     * If {@link #ignoreException} is set to true, any exceptions thrown during the delay will be caught
     * and ignored. Otherwise, they will be re-thrown as {@link CrawlerSystemException}.
     * </p>
     *
     */
    public abstract class AbstractIntervalController implements IntervalController {
    
        /**
         * Indicates whether exceptions during the delay process should be ignored.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 4.5K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb1/SmbAuthentication.java

     *
     * <p>
     * It provides methods to set and retrieve the server address, port, username,
     * password, and domain. Additionally, it offers a method to construct a path
     * prefix for SMB1 URLs based on the configured server and port.
     * </p>
     *
     * <p>
     * The path prefix is in the format "smb1://server:port/", where the port is
     * included only if it's greater than 0. If the server is not set, the path
     * prefix will be "smb1://".
     * </p>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Sep 18 09:30:45 UTC 2025
    - 3.9K bytes
    - Viewed (0)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/fs/FileSystemClient.java

                return filePath;
            }
            return buf.toString();
        }
    
        /**
         * Gets the character set for the given file.
         *
         * @param file the file to get the character set for
         * @return the character set
         */
        protected String getCharSet(final File file) {
            return charset;
        }
    
        /**
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 13.8K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/processor/impl/DefaultResponseProcessor.java

         *
         * @param crawlerContext the crawler context
         * @param childUrlList the set of child URLs
         * @param url the parent URL
         * @param depth the depth of the child URLs
         * @param encoding the encoding of the child URLs
         */
        protected void storeChildUrls(final CrawlerContext crawlerContext, final Set<RequestData> childUrlList, final String url,
                final int depth, final String encoding) {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 12.5K bytes
    - Viewed (0)
Back to top