Search Options

Results per page
Sort
Preferred Languages
Advance

Results 161 - 170 of 198 for Extract (1.12 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TextExtractor.java

    import org.codelibs.fess.crawler.entity.ExtractData;
    import org.codelibs.fess.crawler.exception.CrawlerSystemException;
    import org.codelibs.fess.crawler.exception.ExtractException;
    
    /**
     * Extracts text content from an input stream as plain text.
     */
    public class TextExtractor extends AbstractExtractor {
    
        /**
         * The encoding for text.
         */
        protected String encoding = Constants.UTF_8;
    
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 2K bytes
    - Viewed (0)
  2. src/main/java/org/codelibs/fess/crawler/transformer/FessXpathTransformer.java

            if (StringUtil.isNotBlank(xpath)) {
                return xpath;
            }
            return fessConfig.getCrawlerDocumentHtmlDigestXpath();
        }
    
        /**
         * Extracts the canonical URL from the HTML document.
         *
         * @param responseData the response data from crawling
         * @param document the parsed HTML document
         * @return the canonical URL if found, null otherwise
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Aug 07 03:06:29 UTC 2025
    - 54.4K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/XmlExtractor.java

     * governing permissions and limitations under the License.
     */
    package org.codelibs.fess.crawler.extractor.impl;
    
    import java.util.regex.Pattern;
    
    /**
     * Extracts text content from XML documents.
     */
    public class XmlExtractor extends AbstractXmlExtractor {
    
        /**
         * Creates a new XmlExtractor instance.
         */
        public XmlExtractor() {
            super();
        }
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 2.6K bytes
    - Viewed (0)
  4. ci/official/utilities/extract_resultstore_links.py

    # See the License for the specific language governing permissions and
    # limitations under the License.
    # ==============================================================================
    
    """Extracts ResultStore links from a log containing Bazel invocations.
    
    The links and the invocations' status can then be printed out, or output in the
    form of JUnit-based XML.
    """
    import argparse
    import datetime
    import os
    Registered: Tue Sep 09 12:39:10 UTC 2025
    - Last Modified: Wed Nov 08 17:50:27 UTC 2023
    - 10.9K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/rule/impl/RegexRule.java

     * only one of them (allRequired = false). It also supports a default rule that always matches.
     *
     * <p>
     * The class uses a map of field names to Pattern objects to store the regular expressions.
     * The match method extracts the values of the specified fields from the ResponseData and
     * applies the corresponding regular expressions.
     * </p>
     *
     * <p>
     * Example usage:
     * </p>
     *
     * <pre>
     * {@code
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 6.2K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/BinaryTransformer.java

     * into a ResultData object, and for retrieving the binary data from an AccessResultData object.
     * It extends the AbstractTransformer class.
     *
     * <p>
     * This transformer extracts the binary content from the response body of a web resource,
     * stores it as a byte array in the ResultData, and provides a method to retrieve this data
     * as a ByteArrayInputStream.
     * </p>
     *
     * <p>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 3.8K bytes
    - Viewed (0)
  7. docs/fr/docs/python-types.md

    Mais malheureusement, rien d'utile n'en résulte :
    
    <img src="/img/python-types/image01.png">
    
    ### Ajouter des types
    
    Modifions une seule ligne de la version précédente.
    
    Nous allons changer seulement cet extrait, les paramètres de la fonction, de :
    
    
    ```Python
        first_name, last_name
    ```
    
    à :
    
    ```Python
        first_name: str, last_name: str
    ```
    
    C'est tout.
    
    Ce sont des annotations de types :
    Registered: Sun Sep 07 07:19:17 UTC 2025
    - Last Modified: Sat Nov 09 16:39:20 UTC 2024
    - 10K bytes
    - Viewed (0)
  8. src/main/java/org/codelibs/fess/entity/GeoInfo.java

        /** The OpenSearch query builder for geographic queries. */
        private QueryBuilder builder;
    
        /**
         * Constructs a GeoInfo instance by parsing geographic parameters from the HTTP request.
         * Extracts geo.{field}.point and geo.{field}.distance parameters to create
         * geo-distance queries for each configured geographic field.
         *
         * @param request the HTTP servlet request containing geographic parameters
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Jul 17 08:28:31 UTC 2025
    - 5.7K bytes
    - Viewed (0)
  9. src/test/java/jcifs/smb1/ntlmssp/NtlmMessageTest.java

            byte[] a = new byte[] { 0x01, 0x02, 0x03 };
            assertThrows(IndexOutOfBoundsException.class, () -\u003e NtlmMessage.readULong(a, 0));
        }
    
        @Test
        @DisplayName("readSecurityBuffer correctly extracts data")
        void testReadSecurityBuffer() {
            byte[] buf = new byte[8 + 4];
            byte[] payload = { 0x11, 0x22, 0x33, 0x44 };
            NtlmMessage.writeSecurityBuffer(buf, 0, 8, payload);
    Registered: Sun Sep 07 00:10:21 UTC 2025
    - Last Modified: Thu Aug 14 05:31:44 UTC 2025
    - 6.8K bytes
    - Viewed (0)
  10. src/main/java/org/codelibs/fess/crawler/FessCrawlerThread.java

                        throw new ContainerNotAvailableException(t);
                    }
                    throw t;
                }
            }
        }
    
        /**
         * Extracts anchor URLs from the given object and converts them to RequestData objects.
         * The input object can be either a single string or a list of strings representing URLs.
         *
    Registered: Thu Sep 04 12:52:25 UTC 2025
    - Last Modified: Thu Aug 07 03:06:29 UTC 2025
    - 19.1K bytes
    - Viewed (0)
Back to top