Search Options

Display Count
Sort
Preferred Language
Advanced Search

Results 1 - 10 of 12 for Extraction (0.14 seconds)

  1. CLAUDE.md

    **Fess Crawler** is a Java-based web crawling framework for enterprise content extraction.
    
    ### Essential Info
    
    - **Language**: Java 21+
    - **Build**: Maven 3.x
    - **License**: Apache 2.0
    - **DI**: LastaFlute DI
    - **Repo**: https://github.com/codelibs/fess-crawler
    
    ### Tech Stack
    
    - **HTTP**: Apache HttpComponents 4.5+ and 5.x (switchable)
    - **Extraction**: Apache Tika, POI, PDFBox
    Created: Sun Apr 12 03:50:13 GMT 2026
    - Last Modified: Thu Mar 12 03:39:20 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  2. src/main/java/org/codelibs/fess/crawler/transformer/FessXpathTransformer.java

            }
            return new URL(currentUrl);
        }
    
        /**
         * Gets child URL extraction rules from configuration.
         *
         * @param responseData the response data from crawling
         * @param resultData the result data
         * @return stream of tag-attribute pairs for URL extraction
         */
        @Override
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Thu Mar 12 01:46:45 GMT 2026
    - 55.3K bytes
    - Click Count (0)
  3. .teamcity/scripts/CheckWrapper.java

        private static final Pattern ALLOWED_WRAPPER_VERSION =
            Pattern.compile("^[0-9.]+(-(rc|milestone|m)-[0-9]+)?$");
    
        // Keep the same extraction semantics as the old sed:
        //   sed 's/.*gradle-\(.*\)-[a-z]*\.[a-z]*/\1/'
        private static final Pattern WRAPPER_VERSION_EXTRACT =
            Pattern.compile(".*gradle-(.*)-[a-z]*\\.[a-z]*");
    
    Created: Wed Apr 01 11:36:16 GMT 2026
    - Last Modified: Tue Jan 20 03:53:25 GMT 2026
    - 6.4K bytes
    - Click Count (0)
  4. src/test/java/org/codelibs/fess/crawler/transformer/AbstractFessFileTransformerTest.java

    import org.junit.jupiter.api.Test;
    import org.junit.jupiter.api.TestInfo;
    
    /**
     * Unit tests for {@link AbstractFessFileTransformer}.
     * Tests file transformation logic including content extraction and metadata handling.
     */
    public class AbstractFessFileTransformerTest extends UnitFessTestCase {
    
        private TestableAbstractFessFileTransformer transformer;
    
        @Override
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Thu Jan 15 12:54:47 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  5. src/main/java/org/codelibs/fess/helper/DocumentHelper.java

    /**
     * Helper class for document processing and manipulation in the Fess search system.
     * This class provides utilities for processing document content, titles, and digests,
     * handling text normalization, content extraction, and similar document hash encoding/decoding.
     * It also manages document processing requests and integrates with the crawler system.
     *
     */
    public class DocumentHelper {
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Mon Mar 30 14:27:04 GMT 2026
    - 17.4K bytes
    - Click Count (0)
  6. okhttp/src/commonJvmAndroid/kotlin/okhttp3/internal/platform/Platform.kt

     *
     * Supported on Android 5.0+.
     *
     * Supported on OpenJDK 8 via the JettyALPN-boot library or Conscrypt.
     *
     * Supported on OpenJDK 9+ via SSLParameters and SSLSocket features.
     *
     * ### Trust Manager Extraction
     *
     * Supported on Android 2.3+ and OpenJDK 7+. There are no public APIs to recover the trust
     * manager that was used to create an [SSLSocketFactory].
     *
     * Not supported by choice on JDK9+ due to access checks.
     *
    Created: Fri Apr 03 11:42:14 GMT 2026
    - Last Modified: Tue Feb 03 22:17:59 GMT 2026
    - 8.1K bytes
    - Click Count (0)
  7. src/main/java/org/codelibs/fess/suggest/util/MapValueExtractor.java

     */
    package org.codelibs.fess.suggest.util;
    
    import java.util.ArrayList;
    import java.util.List;
    import java.util.Map;
    
    /**
     * Utility class for type-safe value extraction from Map objects.
     * Centralizes map access patterns to reduce code duplication and improve type safety.
     *
     * <p>This class provides methods to safely extract typed values from Map&lt;String, Object&gt;
    Created: Fri Apr 17 09:08:13 GMT 2026
    - Last Modified: Sun Feb 01 12:48:24 GMT 2026
    - 9.8K bytes
    - Click Count (0)
  8. android/guava/src/com/google/common/collect/FluentIterable.java

     *
     * <ul>
     *   <li>chaining methods which return a new {@code FluentIterable} based in some way on the
     *       contents of the current one (for example {@link #transform})
     *   <li>element extraction methods which facilitate the retrieval of certain elements (for example
     *       {@link #last})
     *   <li>query methods which answer questions about the {@code FluentIterable}'s contents (for
     *       example {@link #anyMatch})
    Created: Fri Apr 03 12:43:13 GMT 2026
    - Last Modified: Thu Apr 02 14:49:41 GMT 2026
    - 34.7K bytes
    - Click Count (0)
  9. guava/src/com/google/common/collect/FluentIterable.java

     *
     * <ul>
     *   <li>chaining methods which return a new {@code FluentIterable} based in some way on the
     *       contents of the current one (for example {@link #transform})
     *   <li>element extraction methods which facilitate the retrieval of certain elements (for example
     *       {@link #last})
     *   <li>query methods which answer questions about the {@code FluentIterable}'s contents (for
     *       example {@link #anyMatch})
    Created: Fri Apr 03 12:43:13 GMT 2026
    - Last Modified: Thu Apr 02 14:49:41 GMT 2026
    - 34.7K bytes
    - Click Count (0)
  10. src/main/java/org/codelibs/fess/llm/AbstractLlmClient.java

                }
                return extractJsonStringFallback(json, key);
            }
            return "";
        }
    
        /**
         * Fallback regex-based extraction for string values.
         *
         * @param json the JSON response
         * @param key the key to extract
         * @return the extracted string value
         */
    Created: Tue Mar 31 13:07:34 GMT 2026
    - Last Modified: Sat Mar 21 06:04:58 GMT 2026
    - 72K bytes
    - Click Count (0)
Back to Top