Search Options

Results per page
Sort
Preferred Languages
Advance

Results 31 - 40 of 664 for crawled (0.57 sec)

  1. fess-crawler-lasta/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java

                crawler1.addUrl(url1);
                crawler1.getCrawlerContext().setMaxAccessCount(maxCount);
                crawler1.getCrawlerContext().setNumOfThread(numOfThread);
    
                final Crawler crawler2 = crawlerContainer.getComponent("crawler");
                crawler2.setSessionId(crawler2.getSessionId() + "2");
                crawler2.setBackground(true);
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 12.8K bytes
    - Viewed (0)
  2. src/main/resources/fess_label_nl.properties

    labels.wizard_crawling_config_path=Crawlpad
    labels.wizard_button_register_again=Continu aanmaken
    labels.wizard_button_register_next=Maken
    labels.wizard_start_crawling_title=Start crawlen
    labels.wizard_start_crawler_title=Crawler
    labels.wizard_start_crawling_desc=U kunt nu beginnen met crawlen door op de knop "Start crawlen" te klikken.
    labels.wizard_button_start_crawling=Start crawlen
    labels.wizard_button_finish=Overslaan
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sat Dec 13 02:21:17 UTC 2025
    - 46.1K bytes
    - Viewed (1)
  3. fess-crawler-opensearch/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java

                crawler1.getCrawlerContext().setMaxAccessCount(maxCount);
                crawler1.getCrawlerContext().setNumOfThread(numOfThread);
    
                final Crawler crawler2 = getComponent(Crawler.class);
                crawler2.setBackground(true);
                ((UrlFilterImpl) crawler2.urlFilter).setIncludeFilteringPattern("$1$2$3.*");
                crawler2.addUrl(url2);
                crawler2.getCrawlerContext().setMaxAccessCount(maxCount);
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 7.7K bytes
    - Viewed (0)
  4. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java

                crawler1.addUrl(url1);
                crawler1.getCrawlerContext().setMaxAccessCount(maxCount);
                crawler1.getCrawlerContext().setNumOfThread(numOfThread);
    
                final Crawler crawler2 = container.getComponent("crawler");
                crawler2.setSessionId(crawler2.getSessionId() + "2");
                crawler2.setBackground(true);
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Tue Nov 11 13:40:14 UTC 2025
    - 25.8K bytes
    - Viewed (0)
  5. CLAUDE.md

    - **Web Framework**: LastaFlute (MVC framework)
    - **DI Container**: Lasta Di
    - **Data Access**: DBFlute (type-safe ORM for OpenSearch)
    - **Search Engine**: OpenSearch
    - **App Server**: Embedded Tomcat
    - **Crawler**: fess-crawler library
    - **Scheduler**: Lasta Job
    
    ## Key Directories
    
    ```
    src/main/java/org/codelibs/fess/
    ├── FessBoot.java              # Application entry point
    ├── app/
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 4.8K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapUrl.java

         * command. Even though search engine crawlers may consider this information
         * when making decisions, they may crawl pages marked "hourly" less
         * frequently than that, and they may crawl pages marked "yearly" more
         * frequently than that. Crawlers may periodically crawl pages marked
         * "never" so that they can handle unexpected changes to those pages.
         */
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 13:34:36 UTC 2025
    - 9.1K bytes
    - Viewed (0)
  7. src/main/java/org/codelibs/fess/helper/WebFsIndexHelper.java

    import org.codelibs.fess.Constants;
    import org.codelibs.fess.crawler.Crawler;
    import org.codelibs.fess.crawler.CrawlerContext;
    import org.codelibs.fess.crawler.CrawlerStatus;
    import org.codelibs.fess.crawler.interval.FessIntervalController;
    import org.codelibs.fess.crawler.service.impl.OpenSearchDataService;
    import org.codelibs.fess.crawler.service.impl.OpenSearchUrlFilterService;
    import org.codelibs.fess.crawler.service.impl.OpenSearchUrlQueueService;
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 25K bytes
    - Viewed (0)
  8. fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots.txt

    User-agent: BruteBot
    Disallow: /
    Allow: /foo/bar/
    Crawl-delay: 1314000
    
    # welcome!
    User-agent: Googlebot
    Crawl-delay: 1
    
    User-agent: *
    Disallow: /private/
    Disallow: /help        # disallows /help.html, /help/index.html, etc.
    Allow: /help/faq.html
    Crawl-delay: 3
    
    User-agent: Crawler
    Disallow: /aaa
    
    User-agent: Crawler/1.0
    Disallow: /bbb
    
    User-agent: Crawler/2.0
    Disallow: /ccc
    
    User-agent: Hoge Crawler
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Oct 11 02:16:55 UTC 2015
    - 566 bytes
    - Viewed (0)
  9. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            assertFalse(robotsTxt.allows("/aaa", "Crawler"));
            assertTrue(robotsTxt.allows("/bbb", "Crawler"));
            assertTrue(robotsTxt.allows("/ccc", "Crawler"));
            assertTrue(robotsTxt.allows("/ddd", "Crawler"));
            assertTrue(robotsTxt.allows("/aaa", "Crawler/1.0"));
            assertFalse(robotsTxt.allows("/bbb", "Crawler/1.0"));
            assertTrue(robotsTxt.allows("/ccc", "Crawler/1.0"));
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
  10. MIGRATION.md

      -F "file=@documents.bulk"
    ```
    
    ### 4. Direct Crawler Configuration
    
    Instead of importing documents, configure Fess to crawl your data sources:
    
    **Web Crawling**:
    - **Admin Path**: Crawler > Web
    - **Supports**: HTTP/HTTPS websites
    - **Features**: JavaScript rendering, authentication, custom headers
    
    **File Crawling**:
    - **Admin Path**: Crawler > File
    - **Supports**: SMB, FTP, local file systems
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Nov 06 12:40:11 UTC 2025
    - 23.2K bytes
    - Viewed (0)
Back to top