- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 28 for sitemaps (0.04 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/SitemapsHelper.java
/** * Helper class for parsing and validating sitemaps. * It supports XML sitemaps, XML sitemap indexes, and text sitemaps, * and can handle GZIP compressed sitemaps. * The class provides methods to check if an input stream is a valid sitemap, * and to parse an input stream into a {@link SitemapSet} object. * It uses SAX parser for XML sitemaps and XML sitemap indexes, * and handles potential exceptions during parsing.
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14.7K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java
} /** * Test sitemaps add and remove operations */ public void test_sitemaps() { // Initial state assertNull(crawlerContext.removeSitemaps()); // Add sitemaps String[] sitemaps = new String[] { "http://example.com/sitemap.xml", "http://test.com/sitemap.xml" }; crawlerContext.addSitemaps(sitemaps); // Remove and verify
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 25.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/CrawlerThread.java
} /** * Adds sitemaps from robots.txt to the crawling queue. * @param urlQueue The URL queue to add sitemaps to. */ protected void addSitemapsFromRobotsTxt(final UrlQueue<?> urlQueue) { final String[] sitemaps = crawlerContext.removeSitemaps(); if (sitemaps != null) { for (final String childUrl : sitemaps) { try {
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 20.4K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java
} /** * Adds a sitemap URL to the list of sitemaps. * * @param url The URL of the sitemap to be added */ public void addSitemap(final String url) { if (!sitemapList.contains(url)) { sitemapList.add(url); } } /** * Returns an array of sitemap URLs. * * @return an array of sitemap URLs */
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10K bytes - Viewed (0) -
fess-crawler-lasta/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
file.delete(); file.mkdirs(); file.deleteOnExit(); fileTransformer.setPath(file.getAbsolutePath()); crawler.addUrl(url + "sitemaps.xml"); crawler.crawlerContext.setMaxAccessCount(maxCount); crawler.crawlerContext.setNumOfThread(numOfThread); crawler.urlFilter.addInclude(url + ".*");
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 12.8K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
file.delete(); file.mkdirs(); file.deleteOnExit(); fileTransformer.setPath(file.getAbsolutePath()); crawler.addUrl(url + "sitemaps.xml"); crawler.crawlerContext.setMaxAccessCount(maxCount); crawler.crawlerContext.setNumOfThread(numOfThread); crawler.urlFilter.addInclude(url + ".*");
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 19.1K bytes - Viewed (0) -
docs/pt/docs/tutorial/dependencies/index.md
* todos os bancos de dados relacionais * bancos de dados NoSQL * pacotes externos * APIs externas * sistemas de autenticação e autorização * istemas de monitoramento de uso para APIs * sistemas de injeção de dados de resposta * etc. ## Simples e Poderoso
Registered: Sun Sep 07 07:19:17 UTC 2025 - Last Modified: Mon Nov 18 02:25:44 UTC 2024 - 10.3K bytes - Viewed (0) -
README.md
controller.setDefaultIntervalTime(1000); }); ``` ### Sitemap Support ```java // Enable sitemap processing container.singleton("sitemapsRule", SitemapsRule.class, rule -> { rule.addRule("url", ".*sitemap.*"); }); // Add sitemap URL crawler.addUrl("https://example.com/sitemap.xml"); ``` ## Data Access and Storage ### Accessing Crawled Data ```java
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
docs/pt/docs/alternatives.md
## Ferramentas anteriores ### <a href="https://www.djangoproject.com/" class="external-link" target="_blank">Django</a> É o framework mais popular e largamente confiável. É utilizado para construir sistemas como o _Instagram_. É bem acoplado com banco de dados relacional (como MySQL ou PostgreSQL), então, tendo um banco de dados NoSQL (como Couchbase, MongoDB, Cassandra etc) como a principal ferramenta de armazenamento não é muito fácil.
Registered: Sun Sep 07 07:19:17 UTC 2025 - Last Modified: Sat Nov 09 16:39:20 UTC 2024 - 25.5K bytes - Viewed (0) -
docs/es/docs/alternatives.md
Registered: Sun Sep 07 07:19:17 UTC 2025 - Last Modified: Mon Dec 30 18:26:57 UTC 2024 - 25.4K bytes - Viewed (0)