Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 10 of 286 for CRAWLER (0.04 seconds)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/Crawler.java

     *
     * <p>Example usage:
     * <pre>
     *   Crawler crawler = new Crawler();
     *   crawler.addUrl("http://example.com/");
     *   crawler.execute();
     *   crawler.close();
     * </pre>
     */
    public class Crawler implements Runnable, AutoCloseable {
    
        private static final Logger logger = LogManager.getLogger(Crawler.class);
    
        /**
         * Service for managing URL queues during crawling.
         */
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Mon Nov 24 03:59:47 GMT 2025
    - 17K bytes
    - Click Count (0)
  2. src/main/java/org/codelibs/fess/exec/Crawler.java

     * </pre>
     */
    public class Crawler {
    
        /**
         * Creates a new instance of Crawler.
         */
        public Crawler() {
            // Default constructor
        }
    
        /** Logger instance for this class. */
        private static final Logger logger = LogManager.getLogger(Crawler.class);
    
        /** Thread name for web and file system crawling process. */
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Thu Mar 26 02:24:08 GMT 2026
    - 32.4K bytes
    - Click Count (0)
  3. samples/crawler/src/main/java/okhttp3/sample/Crawler.java

        OkHttpClient client = new OkHttpClient.Builder()
            .cache(cache)
            .callTimeout(5, TimeUnit.SECONDS)
            .build();
    
        Crawler crawler = new Crawler(client, queueLimit, hostLimit);
        crawler.queue.add(HttpUrl.get(args[1]));
        crawler.parallelDrainQueue(threadCount);
      }
    Created: Fri Apr 03 11:42:14 GMT 2026
    - Last Modified: Wed Jul 23 00:58:06 GMT 2025
    - 5K bytes
    - Click Count (0)
  4. fess-crawler-lasta/src/main/resources/crawler.xml

    <components namespace="fessCrawler">
    	<include path="crawler/container.xml"/>
    	<include path="crawler/client.xml"/>
    	<include path="crawler/rule.xml"/>
    	<include path="crawler/filter.xml"/>
    	<include path="crawler/interval.xml"/>
    	<include path="crawler/extractor.xml"/>
    	<include path="crawler/mimetype.xml"/>
    	<include path="crawler/encoding.xml"/>
    	<include path="crawler/urlconverter.xml"/>
    	<include path="crawler/log.xml"/>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Tue Nov 28 13:40:25 GMT 2017
    - 1.7K bytes
    - Click Count (0)
  5. src/main/resources/mail/crawler.dfmail

    /*
     [Crawler Notification]
     Crawler notification mail.
    */
    subject: [FESS] Crawler completed: /*pmb.hostname*/
    >>>
    --- Server Info ---
    Host Name: /*pmb.hostname:orElse('Unknown')*/
    Job Name: /*pmb.jobname:orElse('Unknown')*/
    
    --- Web/FileSystem Crawler ---
    Start Time: /*pmb.webFsCrawlStartTime:orElse('-')*/
    End Time:   /*pmb.webFsCrawlEndTime:orElse('-')*/
    Exec Time:  /*pmb.webFsCrawlExecTime:orElse('-')*/ ms
    
    --- Web/FileSystem Indexer ---
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Wed Jan 15 22:05:20 GMT 2020
    - 1K bytes
    - Click Count (0)
  6. src/main/resources/fess_indices/fess_config.scheduled_job/scheduled_job.bulk

    {"index":{"_index":"fess_config.scheduled_job","_id":"default_crawler"}}
    {"name":"Default Crawler","target":"all","cronExpression":"0 0 * * *","scriptType":"groovy","scriptData":"return container.getComponent(\"crawlJob\").logLevel(\"info\").gcLogging().execute(executor);","jobLogging":true,"crawler":true,"available":true,"sortOrder":1,"createdBy":"system","createdTime":0,"updatedBy":"system","updatedTime":0}
    {"index":{"_index":"fess_config.scheduled_job","_id":"suggest_indexer"}}
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Thu Mar 26 02:24:08 GMT 2026
    - 5K bytes
    - Click Count (0)
  7. CLAUDE.md

    **DI Config**: `fess-crawler-lasta/src/main/resources/`
    - `crawler.xml` (root), `crawler/client.xml`, `crawler/extractor.xml`, `crawler/rule.xml`, `crawler/transformer.xml`, `crawler/transformer_basic.xml`
    - `crawler/mimetype.xml`, `crawler/encoding.xml`, `crawler/robotstxt.xml`, `crawler/sitemaps.xml`
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Thu Mar 12 03:39:20 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  8. README.md

    });
    
    Crawler crawler = container.getComponent("crawler");
    crawler.addUrl("https://example.com");
    crawler.crawlerContext.setMaxAccessCount(500);
    String sessionId = crawler.execute();
    ```
    
    ### Background Crawling
    
    ```java
    // Configure for background execution
    crawler.setBackground(true);
    String sessionId = crawler.execute();
    
    // Check crawling status
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Sun Aug 31 05:32:52 GMT 2025
    - 15.3K bytes
    - Click Count (0)
  9. src/test/resources/plugin/repo1/index.html

    <a href="fess-crawler-es/" title="fess-crawler-es/">fess-crawler-es/</a>                                                 -         -      
    <a href="fess-crawler-lasta/" title="fess-crawler-lasta/">fess-crawler-lasta/</a>                                              -         -      
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Tue Aug 13 07:34:14 GMT 2019
    - 3.3K bytes
    - Click Count (0)
  10. src/test/resources/plugin/repo2/index.html

                <td>Fri Jan 13 13:53:34 UTC 2017</td>
                <td align="right">
                                  &nbsp;
                              </td>
                <td></td>
              </tr>
                      <tr>
                <td><a href="https://oss.sonatype.org/content/repositories/snapshots/org/codelibs/fess/fess-crawler-db-mysql/">fess-crawler-db-mysql/</a></td>
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Tue Aug 13 07:34:14 GMT 2019
    - 10.5K bytes
    - Click Count (0)
Back to Top