- Sort Score
- Result 10 results
- Languages All
Results 41 - 44 of 44 for Sitemaps (1.22 sec)
-
README.md
controller.setDefaultIntervalTime(1000); }); ``` ### Sitemap Support ```java // Enable sitemap processing container.singleton("sitemapsRule", SitemapsRule.class, rule -> { rule.addRule("url", ".*sitemap.*"); }); // Add sitemap URL crawler.addUrl("https://example.com/sitemap.xml"); ``` ## Data Access and Storage ### Accessing Crawled Data ```java
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/RobotsTxtHelper.java
protected static final Pattern CRAWL_DELAY_RECORD = Pattern.compile("^crawl-delay:\\s*([^\\s]+)\\s*$", Pattern.CASE_INSENSITIVE); /** * Pattern for Sitemap record. */ protected static final Pattern SITEMAP_RECORD = Pattern.compile("^sitemap:\\s*([^\\s]+)\\s*$", Pattern.CASE_INSENSITIVE); /** Whether robots.txt processing is enabled. */ protected boolean enabled = true; /**
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 14 12:52:01 UTC 2025 - 11.4K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapAlternateLink.java
* governing permissions and limitations under the License. */ package org.codelibs.fess.crawler.entity; import java.io.Serializable; /** * Represents an alternate link entry within a sitemap URL. * This class encapsulates the properties of an alternate link as defined in the hreflang annotation. * * <p> * The hreflang attribute tells search engines about alternate versions of a page in different languages
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Nov 13 13:34:36 UTC 2025 - 2.6K bytes - Viewed (0) -
fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml
<match value="allow:" type="stringignorecase" offset="0"/> <match value="disallow:" type="stringignorecase" offset="0"/> <match value="sitemap:" type="stringignorecase" offset="0"/> <match value="\nuser-agent:" type="stringignorecase" offset="0:1000"/> <match value="\nallow:" type="stringignorecase" offset="0:1000"/>
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Oct 16 07:46:32 UTC 2025 - 320.2K bytes - Viewed (2)