Search Options

Results per page
Sort
Preferred Languages
Advance

Results 21 - 30 of 49 for aient (0.48 sec)

  1. fess-crawler-lasta/src/test/java/org/codelibs/fess/crawler/util/CrawlerWebServer.java

                tempDir.delete();
                tempDir.mkdirs();
    
                // robots.txt
                StringBuilder buf = new StringBuilder();
                buf.append("User-agent: *").append('\n');
                buf.append("Disallow: /admin/").append('\n');
                buf.append("Disallow: /websvn/").append('\n');
                final File robotTxtFile = new File(tempDir, "robots.txt");
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Mar 15 06:52:00 UTC 2025
    - 6.3K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/util/CrawlerWebServer.java

                tempDir.delete();
                tempDir.mkdirs();
    
                // robots.txt
                StringBuilder buf = new StringBuilder();
                buf.append("User-agent: *").append('\n');
                buf.append("Disallow: /admin/").append('\n');
                buf.append("Disallow: /websvn/").append('\n');
                final File robotTxtFile = new File(tempDir, "robots.txt");
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Mar 15 06:52:00 UTC 2025
    - 6.3K bytes
    - Viewed (0)
  3. README.md

    ```bash
    # Format code (Eclipse formatter)
    mvn formatter:format
    
    # Check/apply license headers
    mvn license:check
    mvn license:format
    
    # Generate test coverage report
    mvn jacoco:prepare-agent test jacoco:report
    
    # Generate API documentation
    mvn javadoc:javadoc
    ```
    
    ### Testing
    
    The project uses JUnit 4 with embedded OpenSearch for integration testing:
    
    ```bash
    # Run all tests
    Registered: Sat Dec 20 13:04:59 UTC 2025
    - Last Modified: Sun Aug 31 03:31:14 UTC 2025
    - 12.1K bytes
    - Viewed (1)
  4. pom.xml

    			</plugin>
    			<plugin>
    				<groupId>org.jacoco</groupId>
    				<artifactId>jacoco-maven-plugin</artifactId>
    				<version>0.8.13</version>
    				<executions>
    					<execution>
    						<goals>
    							<goal>prepare-agent</goal>
    						</goals>
    					</execution>
    					<execution>
    						<id>report</id>
    						<phase>prepare-package</phase>
    						<goals>
    							<goal>report</goal>
    						</goals>
    					</execution>
    Registered: Sat Dec 20 09:13:53 UTC 2025
    - Last Modified: Sat Dec 20 04:17:15 UTC 2025
    - 4.7K bytes
    - Viewed (0)
  5. pom.xml

    			</plugin>
    			<plugin>
    				<groupId>org.jacoco</groupId>
    				<artifactId>jacoco-maven-plugin</artifactId>
    				<version>0.8.13</version>
    				<executions>
    					<execution>
    						<goals>
    							<goal>prepare-agent</goal>
    						</goals>
    					</execution>
    					<execution>
    						<id>report</id>
    						<phase>prepare-package</phase>
    						<goals>
    							<goal>report</goal>
    						</goals>
    					</execution>
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Mon Aug 25 14:34:10 UTC 2025
    - 12.1K bytes
    - Viewed (0)
  6. CLAUDE.md

    mvn test -Dtest=SuggesterTest           # Run specific test
    mvn package                              # Package JAR
    mvn formatter:format license:format     # Format code and apply licenses
    mvn clean jacoco:prepare-agent test jacoco:report  # Generate coverage report
    ```
    
    ### Adding New Features
    
    1. Read related source files and tests
    2. Write implementation following existing patterns
    3. Add comprehensive tests
    Registered: Sat Dec 20 13:04:59 UTC 2025
    - Last Modified: Mon Nov 24 03:40:05 UTC 2025
    - 8.9K bytes
    - Viewed (0)
  7. CLAUDE.md

    extractorFactory.addExtractor("text/html", htmlExtractor, 2);  // Weight 2
    extractorFactory.addExtractor("text/html", tikaExtractor, 1);  // Fallback
    ```
    
    ### Helpers
    
    **RobotsTxtHelper**: RFC 9309 parsing, user-agent matching, crawl-delay, sitemaps
    **SitemapsHelper**: Sitemap XML parsing, index handling
    **MimeTypeHelper**: MIME detection via Tika
    **EncodingHelper**: Charset detection with BOM
    **UrlConvertHelper**: URL normalization
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  8. src/main/java/org/codelibs/fess/util/ComponentUtil.java

         */
        public static WebApiManagerFactory getWebApiManagerFactory() {
            return getComponent(WEB_API_MANAGER_FACTORY);
        }
    
        /**
         * Gets the user agent helper component.
         * @return The user agent helper.
         */
        public static UserAgentHelper getUserAgentHelper() {
            return getComponent(USER_AGENT_HELPER);
        }
    
        /**
         * Gets the data store factory component.
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 28.9K bytes
    - Viewed (0)
  9. src/main/resources/fess_label_en.properties

    labels.regularName=Regular Name
    labels.replacement=Replacement
    labels.sessionId=Session ID
    labels.sortOrder=Sort Order
    labels.updatedBy=Updated By
    labels.updatedTime=Updated Time
    labels.urls=URLs
    labels.userAgent=User Agent
    labels.username=Username
    labels.value=Value
    labels.versionNo=Version No
    labels.cronExpression=Schedule
    labels.dayForCleanup=Delete documents older than specified days
    labels.crawlingThreadCount=Concurrent Crawling Count
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sat Dec 13 02:21:17 UTC 2025
    - 44K bytes
    - Viewed (0)
  10. src/main/java/org/codelibs/fess/util/GsaConfigParser.java

     *
     * <p>The parser handles the following GSA configuration elements:
     * <ul>
     * <li>Collections with good/bad URL patterns</li>
     * <li>Global parameters including start URLs and filtering rules</li>
     * <li>User agent settings</li>
     * <li>URL pattern matching with regular expressions and contains filters</li>
     * </ul>
     *
     */
    public class GsaConfigParser extends DefaultHandler {
    
        /** Logger instance for this class. */
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 21.6K bytes
    - Viewed (0)
Back to top