Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 114 for Patterns (0.79 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java

            private final int priorityLength;
    
            /**
             * Constructs a new PathPattern from the given robots.txt path pattern.
             * @param pattern the path pattern string from robots.txt (may contain * and $)
             */
            public PathPattern(final String pattern) {
                this.pattern = pattern;
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 18.5K bytes
    - Viewed (0)
  2. fess-crawler-opensearch/src/test/java/org/codelibs/fess/crawler/service/impl/OpenSearchUrlFilterServiceTest.java

                    .getTotalHits()
                    .value() > 0);
    
            // Verify pattern can be retrieved
            final List<Pattern> patterns = urlFilterService.getIncludeUrlPatternList(sessionId);
            assertEquals(1, patterns.size());
            assertTrue(patterns.get(0).matcher("http://example.com/page1").matches());
            assertFalse(patterns.get(0).matcher("http://other.com/page1").matches());
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 11.4K bytes
    - Viewed (0)
  3. src/main/java/org/codelibs/fess/util/GsaConfigParser.java

        }
    
        /**
         * Converts a GSA URL pattern into a regular expression pattern suitable for Fess.
         * Handles various GSA pattern formats including regexp, contains, and URL-based patterns.
         *
         * @param s the input GSA pattern string
         * @return a regular expression pattern string, or empty string for comments/invalid patterns
         */
        protected String getFilterPath(final String s) {
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 21.6K bytes
    - Viewed (0)
  4. fess-crawler/src/test/java/org/codelibs/fess/crawler/filter/UrlFilterTest.java

            String sessionId = "test-session-024";
            urlFilter.init(sessionId);
    
            // Test empty pattern
            urlFilter.addInclude("");
            urlFilter.addExclude("");
    
            // Test single character pattern
            urlFilter.addInclude(".");
            urlFilter.addExclude("*");
    
            // Test patterns with only special characters
            urlFilter.addInclude("^$");
            urlFilter.addExclude(".*");
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Wed Sep 03 14:42:53 UTC 2025
    - 19K bytes
    - Viewed (0)
  5. MIGRATION.md

       - Convert URL patterns to Fess regex patterns
       - Set up LabelType (access control labels) if defined
    
    **Step 3: Verify Imported Configurations**
    
    After import, verify:
    - **Crawler > Web**: Check web crawling configurations
    - **Crawler > File**: Check file crawling configurations (SMB, FTP)
    - **System > General**: Verify URL patterns and filters
    
    **GSA Configuration Mapping**
    
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Nov 06 12:40:11 UTC 2025
    - 23.2K bytes
    - Viewed (0)
  6. CLAUDE.md

    **Flow**: Poll URL → Validate → Get client → Delay → Check last-modified → Execute → Process → Extract children → Queue children → Delay
    
    ### CrawlerClientFactory
    
    Pattern-based client selection using `LinkedHashMap<Pattern, CrawlerClient>`.
    
    **Standard Patterns**:
    ```java
    "^https?://.*"     → httpClient
    "^file:.*"         → fileSystemClient
    "^ftp://.*"        → ftpClient
    "^smb://.*"        → smbClient
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  7. pom.xml

    							<licenseFamilyName>GNU Lesser General Public License</licenseFamilyName>
    							<notes />
    							<patterns>
    								<pattern>This library is free software; you can redistribute it</pattern>
    								<pattern>GNU Lesser General Public License</pattern>
    							</patterns>
    						</license>
    					</licenses>
    					<licenseFamilies>
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Mon Aug 25 14:34:10 UTC 2025
    - 12.1K bytes
    - Viewed (0)
  8. src/test/java/jcifs/internal/smb2/info/Smb2QueryDirectoryRequestTest.java

        }
    
        @Test
        @DisplayName("Test with wildcard patterns")
        void testWildcardPatterns() {
            request = new Smb2QueryDirectoryRequest(mockConfig);
    
            // Test various wildcard patterns
            String[] patterns = { "*", "*.txt", "test*.*", "?test?.doc" };
    
            for (String pattern : patterns) {
                request.setFileName(pattern);
    
                byte[] buffer = new byte[1024];
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Thu Aug 14 05:31:44 UTC 2025
    - 13.2K bytes
    - Viewed (0)
  9. README.md

    ### Key Architectural Patterns
    - **Factory Pattern** - `BeanDescFactory` for creating bean descriptors, `ParameterizedClassDescFactory` for generic type handling
    - **Builder Pattern** - `CopyOptions` for configuring bean copying operations with fluent API
    - **Adapter Pattern** - Logging adapters (`JclLoggerAdapter`, `JulLoggerAdapter`) for different logging frameworks
    Registered: Sat Dec 20 08:55:33 UTC 2025
    - Last Modified: Sun Aug 31 02:56:02 UTC 2025
    - 12.7K bytes
    - Viewed (0)
  10. src/main/java/org/codelibs/fess/helper/LabelTypeHelper.java

            final Set<String> valueSet = new HashSet<>();
            for (final LabelTypePattern pattern : labelTypePatternList) {
                if (pattern.match(path)) {
                    valueSet.add(pattern.getValue());
                }
            }
            return valueSet;
        }
    
        /**
         * Builds a list of label type patterns.
         *
         * @param labelTypeList The list of label types.
         */
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 14.8K bytes
    - Viewed (0)
Back to top