- Sort Score
- Result 10 results
- Languages All
Results 1 - 3 of 3 for JS (0.02 sec)
-
README.md
### URL Filtering ```java // Include patterns crawler.urlFilter.addInclude("https://example.com/.*"); crawler.urlFilter.addInclude(".*\\.pdf$"); // Exclude patterns crawler.urlFilter.addExclude(".*\\.js$"); crawler.urlFilter.addExclude(".*login.*"); ``` ## Supported Protocols and Formats ### Protocols - **HTTP/HTTPS**: Full web crawling support with cookies, authentication, redirects
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/filter/UrlFilterTest.java
urlFilter.addExclude(".*\\.(css|js)$"); urlFilter.addExclude(".*\\/admin\\/.*"); urlFilter.addExclude(".*#.*"); assertTrue(urlFilter.match("https://example.com/page.html")); assertFalse(urlFilter.match("https://example.com/style.css")); assertFalse(urlFilter.match("https://example.com/script.js")); assertFalse(urlFilter.match("https://example.com/admin/login"));Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Wed Sep 03 14:42:53 UTC 2025 - 19K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractorTest.java
} public void test_getTika_js() { final InputStream in = ResourceUtil.getResourceAsStream("extractor/program/test.js"); final Map<String, String> params = new HashMap<String, String>(); params.put("Content-Type", "text/plain"); params.put("resourceName", "test.js"); final ExtractData extractData = tikaExtractor.getText(in, params); final String content = extractData.getContent();Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 30.6K bytes - Viewed (0)