Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 21 for crawl (0.01 sec)

  1. src/main/java/org/codelibs/fess/exec/Crawler.java

     * monitoring, and cleanup operations.
     *
     * <p>The crawler can operate in different modes based on command-line options:
     * <ul>
     * <li>Web crawling - crawls web sites and web content</li>
     * <li>File system crawling - crawls file systems and documents</li>
     * <li>Data store crawling - crawls databases and other data sources</li>
     * <li>Combined crawling - runs multiple crawling types simultaneously</li>
     * </ul>
     *
     * <p>Command line usage:
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Aug 07 03:06:29 UTC 2025
    - 31K bytes
    - Viewed (0)
  2. src/main/java/org/codelibs/fess/helper/WebFsIndexHelper.java

         *
         * @param sessionId The session ID for this crawling operation
         * @param webConfigIdList List of web configuration IDs to crawl, null for all
         * @param fileConfigIdList List of file configuration IDs to crawl, null for all
         */
        public void crawl(final String sessionId, final List<String> webConfigIdList, final List<String> fileConfigIdList) {
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 24.9K bytes
    - Viewed (0)
  3. src/test/java/org/codelibs/fess/exec/CrawlerTest.java

                @Override
                public void crawl(String sessionId, List<String> webConfigIdList, List<String> fileConfigIdList) {
                    // Mock crawl implementation
                }
            };
            crawler.webFsIndexHelper = webFsIndexHelper;
    
            dataIndexHelper = new DataIndexHelper() {
                @Override
                public void crawl(String sessionId, List<String> dataConfigIdList) {
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Tue Aug 19 14:09:36 UTC 2025
    - 30.5K bytes
    - Viewed (0)
  4. src/main/java/org/codelibs/fess/helper/DataIndexHelper.java

         * specified in the configIdList parameter.
         *
         * @param sessionId unique identifier for this crawling session
         * @param configIdList list of data configuration IDs to crawl
         */
        public void crawl(final String sessionId, final List<String> configIdList) {
            final List<DataConfig> configList = ComponentUtil.getCrawlingConfigHelper().getDataConfigListByIds(configIdList);
    
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Aug 07 03:06:29 UTC 2025
    - 18.9K bytes
    - Viewed (0)
  5. README.md

    </components>
    ```
    
    ### Crawler Context Configuration
    
    ```java
    // Set maximum number of URLs to crawl
    crawler.crawlerContext.setMaxAccessCount(1000);
    
    // Set number of crawler threads
    crawler.crawlerContext.setNumOfThread(10);
    
    // Set maximum crawl depth
    crawler.crawlerContext.setMaxDepth(3);
    
    // Set request interval (politeness)
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  6. src/main/java/org/codelibs/fess/app/web/admin/fileconfig/CreateForm.java

        @Size(max = 200)
        public String name;
    
        /** The description of the file configuration (maximum 1000 characters). */
        @Size(max = 1000)
        public String description;
    
        /** The file paths to crawl (required, must be valid file URIs). */
        @Required
        @UriType(protocolType = ProtocolType.FILE)
        @CustomSize(maxKey = "form.admin.max.input.size")
        public String paths;
    
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 5.6K bytes
    - Viewed (0)
  7. src/main/java/org/codelibs/fess/app/web/admin/dataconfig/EditForm.java

     * This form extends CreateForm to include fields necessary for updating existing data config entries,
     * including tracking information for optimistic locking and audit trails.
     * Data configs define how to crawl and extract data from databases, CSV files, and other data sources.
     *
     */
    public class EditForm extends CreateForm {
    
        /**
         * Creates a new EditForm instance.
         */
        public EditForm() {
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 2.3K bytes
    - Viewed (0)
  8. src/main/java/org/codelibs/fess/ds/callback/FileListIndexUpdateCallbackImpl.java

            /**
             * Constructs a new crawl request.
             *
             * @param url the URL to crawl
             * @param depth the depth of this URL in the crawling hierarchy
             */
            CrawlRequest(final String url, final int depth) {
                this.url = url;
                this.depth = depth;
            }
    
            /**
             * Gets the URL of this crawl request.
             *
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Aug 07 03:06:29 UTC 2025
    - 28.9K bytes
    - Viewed (0)
  9. src/main/java/org/codelibs/fess/app/web/api/admin/webconfig/SearchBody.java

        /**
         * Default constructor.
         */
        public SearchBody() {
            super();
        }
    
        /** Name of the web crawling configuration */
        public String name;
    
        /** URLs to crawl */
        public String urls;
    
        /** Description of the web crawling configuration */
        public String description;
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 1.2K bytes
    - Viewed (0)
  10. src/main/java/org/codelibs/fess/app/web/api/admin/fileconfig/SearchBody.java

        /**
         * Default constructor.
         */
        public SearchBody() {
            super();
        }
    
        /** Name of the file crawling configuration */
        public String name;
    
        /** File paths to crawl */
        public String paths;
    
        /** Description of the file crawling configuration */
        public String description;
    
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 1.2K bytes
    - Viewed (0)
Back to top