- Sort Score
- Num 10 results
- Language All
Results 1 - 10 of 12 for Extraction (0.14 seconds)
-
CLAUDE.md
**Fess Crawler** is a Java-based web crawling framework for enterprise content extraction. ### Essential Info - **Language**: Java 21+ - **Build**: Maven 3.x - **License**: Apache 2.0 - **DI**: LastaFlute DI - **Repo**: https://github.com/codelibs/fess-crawler ### Tech Stack - **HTTP**: Apache HttpComponents 4.5+ and 5.x (switchable) - **Extraction**: Apache Tika, POI, PDFBox
Created: Sun Apr 12 03:50:13 GMT 2026 - Last Modified: Thu Mar 12 03:39:20 GMT 2026 - 8.1K bytes - Click Count (0) -
src/main/java/org/codelibs/fess/crawler/transformer/FessXpathTransformer.java
} return new URL(currentUrl); } /** * Gets child URL extraction rules from configuration. * * @param responseData the response data from crawling * @param resultData the result data * @return stream of tag-attribute pairs for URL extraction */ @OverrideCreated: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Thu Mar 12 01:46:45 GMT 2026 - 55.3K bytes - Click Count (0) -
.teamcity/scripts/CheckWrapper.java
private static final Pattern ALLOWED_WRAPPER_VERSION = Pattern.compile("^[0-9.]+(-(rc|milestone|m)-[0-9]+)?$"); // Keep the same extraction semantics as the old sed: // sed 's/.*gradle-\(.*\)-[a-z]*\.[a-z]*/\1/' private static final Pattern WRAPPER_VERSION_EXTRACT = Pattern.compile(".*gradle-(.*)-[a-z]*\\.[a-z]*");Created: Wed Apr 01 11:36:16 GMT 2026 - Last Modified: Tue Jan 20 03:53:25 GMT 2026 - 6.4K bytes - Click Count (0) -
src/test/java/org/codelibs/fess/crawler/transformer/AbstractFessFileTransformerTest.java
import org.junit.jupiter.api.Test; import org.junit.jupiter.api.TestInfo; /** * Unit tests for {@link AbstractFessFileTransformer}. * Tests file transformation logic including content extraction and metadata handling. */ public class AbstractFessFileTransformerTest extends UnitFessTestCase { private TestableAbstractFessFileTransformer transformer; @OverrideCreated: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Thu Jan 15 12:54:47 GMT 2026 - 8.1K bytes - Click Count (0) -
src/main/java/org/codelibs/fess/helper/DocumentHelper.java
/** * Helper class for document processing and manipulation in the Fess search system. * This class provides utilities for processing document content, titles, and digests, * handling text normalization, content extraction, and similar document hash encoding/decoding. * It also manages document processing requests and integrates with the crawler system. * */ public class DocumentHelper {Created: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Mon Mar 30 14:27:04 GMT 2026 - 17.4K bytes - Click Count (0) -
okhttp/src/commonJvmAndroid/kotlin/okhttp3/internal/platform/Platform.kt
* * Supported on Android 5.0+. * * Supported on OpenJDK 8 via the JettyALPN-boot library or Conscrypt. * * Supported on OpenJDK 9+ via SSLParameters and SSLSocket features. * * ### Trust Manager Extraction * * Supported on Android 2.3+ and OpenJDK 7+. There are no public APIs to recover the trust * manager that was used to create an [SSLSocketFactory]. * * Not supported by choice on JDK9+ due to access checks. *
Created: Fri Apr 03 11:42:14 GMT 2026 - Last Modified: Tue Feb 03 22:17:59 GMT 2026 - 8.1K bytes - Click Count (0) -
src/main/java/org/codelibs/fess/suggest/util/MapValueExtractor.java
*/ package org.codelibs.fess.suggest.util; import java.util.ArrayList; import java.util.List; import java.util.Map; /** * Utility class for type-safe value extraction from Map objects. * Centralizes map access patterns to reduce code duplication and improve type safety. * * <p>This class provides methods to safely extract typed values from Map<String, Object>
Created: Fri Apr 17 09:08:13 GMT 2026 - Last Modified: Sun Feb 01 12:48:24 GMT 2026 - 9.8K bytes - Click Count (0) -
android/guava/src/com/google/common/collect/FluentIterable.java
* * <ul> * <li>chaining methods which return a new {@code FluentIterable} based in some way on the * contents of the current one (for example {@link #transform}) * <li>element extraction methods which facilitate the retrieval of certain elements (for example * {@link #last}) * <li>query methods which answer questions about the {@code FluentIterable}'s contents (for * example {@link #anyMatch})
Created: Fri Apr 03 12:43:13 GMT 2026 - Last Modified: Thu Apr 02 14:49:41 GMT 2026 - 34.7K bytes - Click Count (0) -
guava/src/com/google/common/collect/FluentIterable.java
* * <ul> * <li>chaining methods which return a new {@code FluentIterable} based in some way on the * contents of the current one (for example {@link #transform}) * <li>element extraction methods which facilitate the retrieval of certain elements (for example * {@link #last}) * <li>query methods which answer questions about the {@code FluentIterable}'s contents (for * example {@link #anyMatch})
Created: Fri Apr 03 12:43:13 GMT 2026 - Last Modified: Thu Apr 02 14:49:41 GMT 2026 - 34.7K bytes - Click Count (0) -
src/main/java/org/codelibs/fess/llm/AbstractLlmClient.java
} return extractJsonStringFallback(json, key); } return ""; } /** * Fallback regex-based extraction for string values. * * @param json the JSON response * @param key the key to extract * @return the extracted string value */Created: Tue Mar 31 13:07:34 GMT 2026 - Last Modified: Sat Mar 21 06:04:58 GMT 2026 - 72K bytes - Click Count (0)