- Sort Score
- Result 10 results
- Languages All
Results 31 - 40 of 82 for fset (0.02 sec)
-
README.md
</components> ``` ### Crawler Context Configuration ```java // Set maximum number of URLs to crawl crawler.crawlerContext.setMaxAccessCount(1000); // Set number of crawler threads crawler.crawlerContext.setNumOfThread(10); // Set maximum crawl depth crawler.crawlerContext.setMaxDepth(3); // Set request interval (politeness) crawler.crawlerContext.setDefaultIntervalTime(1000); // 1 second ```
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java
// Set new set Set<String> newSet = new HashSet<>(); newSet.add("http://new.com/robots.txt"); crawlerContext.setRobotsTxtUrlSet(newSet); assertSame(newSet, crawlerContext.getRobotsTxtUrlSet()); assertEquals(1, crawlerContext.getRobotsTxtUrlSet().size()); // Set null crawlerContext.setRobotsTxtUrlSet(null);Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 25.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/JodExtractor.java
* * @param officeManager the office manager to set */ public void setOfficeManager(final OfficeManager officeManager) { this.officeManager = officeManager; } /** * Sets the temporary directory for file operations. * * @param tempDir the temporary directory to set */ public void setTempDir(final File tempDir) {Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10.3K bytes - Viewed (0) -
fess-crawler-opensearch/src/test/resources/lasta_di.properties
# _/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/ # Lasta Di properties, you can set container's options # _/_/_/_/_/_/_/_/_/_/ # location of smart-deploy mode e.g. maihama_env.properties: lasta_di.smart.deploy.mode #smart.deploy.mode.location = maihama_env.properties: lasta_di.smart.deploy.mode # package for smart deploy target e.g. org.docksidestage.app smart.package1 = org.codelibs.fess.crawler
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Nov 07 04:44:10 UTC 2024 - 479 bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/impl/AuthenticationImpl.java
/** * Sets the authentication scope. * * @param authScope the authentication scope to set */ public void setAuthScope(final AuthScope authScope) { this.authScope = authScope; } /** * Sets the credentials. * @param credentials The credentials to set. */ public void setCredentials(final Credentials credentials) { this.credentials = credentials;Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 3.8K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/exception/CrawlingAccessException.java
* It extends CrawlerSystemException and provides functionality to set and check the log level for the exception. * * <p> * This exception can be thrown when there are problems accessing URLs, files, or any other resources needed for crawling. * It includes constructors to handle messages, causes, or both. * </p> * * <p>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 3.8K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/interval/impl/AbstractIntervalController.java
* If {@link #ignoreException} is set to true, any exceptions thrown during the delay will be caught * and ignored. Otherwise, they will be re-thrown as {@link CrawlerSystemException}. * </p> * */ public abstract class AbstractIntervalController implements IntervalController { /** * Indicates whether exceptions during the delay process should be ignored.Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 4.5K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb1/SmbAuthentication.java
* * <p> * It provides methods to set and retrieve the server address, port, username, * password, and domain. Additionally, it offers a method to construct a path * prefix for SMB1 URLs based on the configured server and port. * </p> * * <p> * The path prefix is in the format "smb1://server:port/", where the port is * included only if it's greater than 0. If the server is not set, the path * prefix will be "smb1://". * </p>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Sep 18 09:30:45 UTC 2025 - 3.9K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/fs/FileSystemClient.java
return filePath; } return buf.toString(); } /** * Gets the character set for the given file. * * @param file the file to get the character set for * @return the character set */ protected String getCharSet(final File file) { return charset; } /**Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 13.8K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/processor/impl/DefaultResponseProcessor.java
* * @param crawlerContext the crawler context * @param childUrlList the set of child URLs * @param url the parent URL * @param depth the depth of the child URLs * @param encoding the encoding of the child URLs */ protected void storeChildUrls(final CrawlerContext crawlerContext, final Set<RequestData> childUrlList, final String url, final int depth, final String encoding) {Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 12.5K bytes - Viewed (0)