- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 56 for Patterns (0.72 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/CrawlerClientFactory.java
/** * A factory class for managing and creating crawler clients based on URL patterns. * This class implements AutoCloseable to properly handle resource cleanup. * * <p>The factory maintains a map of regular expression patterns to crawler clients, * allowing for URL-based client selection. Clients can be added with specific patterns * and optionally at specific positions in the processing order.</p> *
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 7.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java
private final int priorityLength; /** * Constructs a new PathPattern from the given robots.txt path pattern. * @param pattern the path pattern string from robots.txt (may contain * and $) */ public PathPattern(final String pattern) { this.pattern = pattern;Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 18.5K bytes - Viewed (0) -
fess-crawler-opensearch/src/test/java/org/codelibs/fess/crawler/service/impl/OpenSearchUrlFilterServiceTest.java
.getTotalHits() .value() > 0); // Verify pattern can be retrieved final List<Pattern> patterns = urlFilterService.getIncludeUrlPatternList(sessionId); assertEquals(1, patterns.size()); assertTrue(patterns.get(0).matcher("http://example.com/page1").matches()); assertFalse(patterns.get(0).matcher("http://other.com/page1").matches());
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 11.4K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/util/GsaConfigParser.java
} /** * Converts a GSA URL pattern into a regular expression pattern suitable for Fess. * Handles various GSA pattern formats including regexp, contains, and URL-based patterns. * * @param s the input GSA pattern string * @return a regular expression pattern string, or empty string for comments/invalid patterns */ protected String getFilterPath(final String s) {Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 21.6K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/mylasta/direction/FessProp.java
Pattern[] patterns = (Pattern[]) propMap.get(CRAWLER_METADATA_CONTENT_EXCLUDES); if (patterns == null) { patterns = split(getCrawlerMetadataContentExcludes(), ",") .get(stream -> stream.filter(StringUtil::isNotBlank).map(Pattern::compile).toArray(n -> new Pattern[n])); propMap.put(CRAWLER_METADATA_CONTENT_EXCLUDES, patterns); }
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Sat Dec 13 02:21:17 UTC 2025 - 88.2K bytes - Viewed (0) -
src/main/java/org/codelibs/core/io/SerializeUtil.java
* <p> * Patterns can be exact class names or use wildcards with '*' at the end. * For example: "com.example.*" allows all classes in the com.example package. * </p> * * @param allowedPatterns the patterns of classes to allow * @return an ObjectInputFilter configured with the specified patterns */Registered: Sat Dec 20 08:55:33 UTC 2025 - Last Modified: Sat Nov 22 11:21:59 UTC 2025 - 9K bytes - Viewed (0) -
CLAUDE.md
**Flow**: Poll URL → Validate → Get client → Delay → Check last-modified → Execute → Process → Extract children → Queue children → Delay ### CrawlerClientFactory Pattern-based client selection using `LinkedHashMap<Pattern, CrawlerClient>`. **Standard Patterns**: ```java "^https?://.*" → httpClient "^file:.*" → fileSystemClient "^ftp://.*" → ftpClient "^smb://.*" → smbClient
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/helper/RelatedContentHelper.java
* - Second: List of regex pattern matches (Pattern -> content template) */ protected Map<String, Pair<Map<String, String>, List<Pair<Pattern, String>>>> relatedContentMap = Collections.emptyMap(); /** * Prefix used to identify regex patterns in related content terms. * When a term starts with this prefix, it is treated as a regular expression * pattern rather than an exact match term. */
Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Fri Nov 28 16:29:12 UTC 2025 - 8.2K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/storage/StorageClientFactory.java
} final String lowerEndpoint = endpoint.toLowerCase(Locale.ROOT); // GCS patterns if (lowerEndpoint.contains("storage.googleapis.com") || lowerEndpoint.contains(".storage.cloud.google.com")) { return StorageType.GCS; } // S3 patterns if (lowerEndpoint.contains(".amazonaws.com") || lowerEndpoint.contains("s3.") || lowerEndpoint.contains("s3-")) {Registered: Sat Dec 20 09:19:18 UTC 2025 - Last Modified: Sat Dec 20 05:56:45 UTC 2025 - 4.2K bytes - Viewed (0) -
CLAUDE.md
├── normalizer/ # Text normalizers ├── converter/ # Reading converters (katakana, romaji) ├── concurrent/ # Async patterns (Deferred/Promise) └── util/ # Utilities ``` ### Key Design Patterns - **Builder**: SuggesterBuilder, SuggestRequestBuilder - **Facade**: Suggester (main entry point) - **Composite**: NormalizerChain, ReadingConverterChain
Registered: Sat Dec 20 13:04:59 UTC 2025 - Last Modified: Mon Nov 24 03:40:05 UTC 2025 - 8.9K bytes - Viewed (0)