Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 10 of 77 for Crawler (0.07 seconds)

  1. src/main/java/org/codelibs/fess/exec/Crawler.java

     * </pre>
     */
    public class Crawler {
    
        /**
         * Creates a new instance of Crawler.
         */
        public Crawler() {
            // Default constructor
        }
    
        /** Logger instance for this class. */
        private static final Logger logger = LogManager.getLogger(Crawler.class);
    
        /** Thread name for web and file system crawling process. */
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Thu Mar 26 02:24:08 GMT 2026
    - 32.4K bytes
    - Click Count (0)
  2. src/main/resources/fess_indices/fess_config.scheduled_job/scheduled_job.bulk

    {"index":{"_index":"fess_config.scheduled_job","_id":"default_crawler"}}
    {"name":"Default Crawler","target":"all","cronExpression":"0 0 * * *","scriptType":"groovy","scriptData":"return container.getComponent(\"crawlJob\").logLevel(\"info\").gcLogging().execute(executor);","jobLogging":true,"crawler":true,"available":true,"sortOrder":1,"createdBy":"system","createdTime":0,"updatedBy":"system","updatedTime":0}
    {"index":{"_index":"fess_config.scheduled_job","_id":"suggest_indexer"}}
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Thu Mar 26 02:24:08 GMT 2026
    - 5K bytes
    - Click Count (0)
  3. CLAUDE.md

    **DI Config**: `fess-crawler-lasta/src/main/resources/`
    - `crawler.xml` (root), `crawler/client.xml`, `crawler/extractor.xml`, `crawler/rule.xml`, `crawler/transformer.xml`, `crawler/transformer_basic.xml`
    - `crawler/mimetype.xml`, `crawler/encoding.xml`, `crawler/robotstxt.xml`, `crawler/sitemaps.xml`
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Thu Mar 12 03:39:20 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  4. src/test/java/org/codelibs/fess/exec/CrawlerTest.java

            Crawler.Options options = new Crawler.Options();
            options.sessionId = "test-session";
            options.fileConfigIds = "file1";
    
            int result = crawler.doCrawl(options);
            assertEquals(Constants.EXIT_OK, result);
        }
    
        @Test
        public void test_doCrawl_withDataConfigIds() {
            Crawler.Options options = new Crawler.Options();
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Fri Mar 13 23:01:26 GMT 2026
    - 30.8K bytes
    - Click Count (0)
  5. src/test/java/org/codelibs/fess/helper/PluginHelperTest.java

            assertTrue(pluginHelper.isExcludedName(ArtifactType.CRAWLER, "fess-crawler-lasta"));
            assertTrue(pluginHelper.isExcludedName(ArtifactType.CRAWLER, "fess-crawler-parent"));
            assertTrue(pluginHelper.isExcludedName(ArtifactType.CRAWLER, "fess-crawler-playwright"));
            assertTrue(pluginHelper.isExcludedName(ArtifactType.CRAWLER, "fess-crawler-webdriver"));
    
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Fri Mar 13 23:01:26 GMT 2026
    - 22.6K bytes
    - Click Count (0)
  6. README.md

    * [LastaFlute](https://github.com/lastaflute/lastaflute "LastaFlute"): Web Framework
    * [Lasta Job](https://github.com/lastaflute/lasta-job "Lasta Job"): Job Scheduler
    * [Fess Crawler](https://github.com/codelibs/fess-crawler "Fess Crawler"): Web Crawler
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Sat Feb 14 03:19:23 GMT 2026
    - 7.8K bytes
    - Click Count (2)
  7. fess-crawler-opensearch/pom.xml

    	<modelVersion>4.0.0</modelVersion>
    	<artifactId>fess-crawler-opensearch</artifactId>
    	<packaging>jar</packaging>
    	<name>Fess Crawler OpenSearch</name>
    	<parent>
    		<groupId>org.codelibs.fess</groupId>
    		<artifactId>fess-crawler-parent</artifactId>
    		<version>15.6.0-SNAPSHOT</version>
    		<relativePath>../pom.xml</relativePath>
    	</parent>
    	<build>
    		<plugins>
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Thu Mar 05 23:30:42 GMT 2026
    - 3.9K bytes
    - Click Count (0)
  8. src/main/java/org/codelibs/fess/Constants.java

        public static final String CRAWLER_START_TIME = "CrawlerStartTime";
    
        /** Info map key for crawler end time. */
        public static final String CRAWLER_END_TIME = "CrawlerEndTime";
    
        /** Info map key for crawler execution time. */
        public static final String CRAWLER_EXEC_TIME = "CrawlerExecTime";
    
        /** Info map key for web/file system crawler start time. */
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Sat Mar 28 11:55:54 GMT 2026
    - 35.8K bytes
    - Click Count (0)
  9. src/main/resources/crawler/rule.xml

    <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN"
    	"http://dbflute.org/meta/lastadi10.dtd">
    <components namespace="fessCrawler">
    	<include path="crawler/container.xml" />
    	<include path="crawler/transformer.xml" />
    
    	<component name="ruleManager" class="org.codelibs.fess.crawler.rule.impl.RuleManagerImpl" instance="prototype">
    		<postConstruct name="addRule">
    			<arg>sitemapsRule</arg>
    		</postConstruct>
    		<postConstruct name="addRule">
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Sun Mar 29 08:21:02 GMT 2026
    - 4.6K bytes
    - Click Count (0)
  10. CLAUDE.md

    |-----------|------------|
    | Web Framework | LastaFlute (MVC framework) |
    | DI Container | Lasta Di |
    | Data Access | DBFlute (type-safe ORM for OpenSearch) |
    | Search Engine | OpenSearch |
    | App Server | Embedded Tomcat |
    | Crawler | fess-crawler library |
    | Scheduler | Lasta Job |
    | Logging | Log4j2 |
    | Testing | JUnit 4/5, UTFlute, REST Assured |
    
    ## Development Commands
    
    ### Setup
    ```bash
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Thu Mar 19 09:48:10 GMT 2026
    - 7.8K bytes
    - Click Count (0)
Back to Top