Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 7 of 7 for beta (0.03 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlXpathExtractor.java

     * </p>
     *
     */
    public class HtmlXpathExtractor extends AbstractXmlExtractor {
        /**
         * Regular expression pattern to match the charset attribute in the meta tag of HTML documents.
         * The pattern captures the charset value specified in the content attribute of the meta tag.
         * Example: &lt;meta http-equiv="Content-Type" content="text/html; charset=UTF-8"&gt;
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.3K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlExtractor.java

        /**
         * Gets the pattern used for extracting charset from meta tags.
         *
         * @return the meta charset pattern
         */
        public Pattern getMetaCharsetPattern() {
            return metaCharsetPattern;
        }
    
        /**
         * Sets the pattern used for extracting charset from meta tags.
         *
         * @param metaCharsetPattern the meta charset pattern to set
         */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 9.3K bytes
    - Viewed (0)
  3. fess-crawler-opensearch/src/test/resources/app.xml

    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" 
    	"http://dbflute.org/meta/lastadi10.dtd">
    <components>
        <include path="crawler_opensearch.xml"/>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Nov 07 04:44:10 UTC 2024
    - 216 bytes
    - Viewed (0)
  4. fess-crawler-opensearch/src/main/resources/crawler/opensearch.xml

    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" 
    	"http://dbflute.org/meta/lastadi10.dtd">
    <components namespace="fessCrawler">
    	<component name="esClient"
    		class="org.codelibs.fess.crawler.client.FesenClient">
    	</component>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Nov 07 04:44:10 UTC 2024
    - 293 bytes
    - Viewed (1)
  5. fess-crawler-opensearch/src/main/resources/crawler_opensearch.xml

    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN" 
    	"http://dbflute.org/meta/lastadi10.dtd">
    <components namespace="fessCrawler">
        <include path="crawler/container.xml"/>
        <include path="crawler/client.xml"/>
        <include path="crawler/rule.xml"/>
        <include path="crawler/filter.xml"/>
        <include path="crawler/interval.xml"/>
        <include path="crawler/extractor.xml"/>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Nov 07 04:44:10 UTC 2024
    - 2.2K bytes
    - Viewed (0)
  6. README.md

    ```xml
    <!-- crawler.xml -->
    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE components PUBLIC "-//DBFLUTE//DTD LastaDi 1.0//EN"
        "http://dbflute.org/meta/lastadi10.dtd">
    <components namespace="fessCrawler">
        <component name="crawler" class="org.codelibs.fess.crawler.Crawler" instance="prototype"/>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  7. fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml

        <glob pattern="*.junit"/>
        <glob pattern="*.jx"/>
        <glob pattern="*.manifest"/>
        <glob pattern="*.m4"/>
        <glob pattern="*.mf"/>
        <glob pattern="*.MF"/>
        <glob pattern="*.meta"/>
        <glob pattern="*.mdo"/>
        <glob pattern="*.n3"/>
        <glob pattern="*.pen"/>
        <glob pattern="*.pod"/>
        <glob pattern="*.pom"/>
        <glob pattern="*.project"/>
        <glob pattern="*.rng"/>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Mar 13 08:18:01 UTC 2025
    - 320.1K bytes
    - Viewed (1)
Back to top