- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 10 for fixtures (0.14 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlXpathExtractor.java
} propertyMap.put(key, value); } /** * Gets the map of parser features. * * @return the feature map */ public Map<String, String> getFeatureMap() { return featureMap; } /** * Sets the map of parser features. * * @param featureMap the feature map to set */Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java
* </ol> * <p> * The class also provides methods for configuring features and properties of the * underlying DOM parser, as well as defining rules for extracting child URLs * from specific HTML tags and attributes. * </p> * * <p> * <b>Configuration:</b> * </p> * <ul> * <li><b>featureMap:</b> A map of features to be set on the DOM parser.</li>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 28.5K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/transformer/impl/XpathTransformerTest.java
{ xpathTransformer = new XpathTransformer(); xpathTransformer.setName("xpathTransformer"); Map<String, String> featureMap = newHashMap(); featureMap.put("http://xml.org/sax/features/namespaces", "false"); xpathTransformer.setFeatureMap(featureMap); Map<String, String> propertyMap = newHashMap(); xpathTransformer.setPropertyMap(propertyMap);Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Mar 15 06:52:00 UTC 2025 - 12.5K bytes - Viewed (0) -
README.md
### Key Features - **Multi-Protocol Support**: HTTP/HTTPS, File System, FTP, SMB/CIFS, Cloud Storage (MinIO, S3) - **Comprehensive Content Extraction**: Office documents, PDFs, archives, images, audio/video files
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java
* * <p>The robots.txt protocol is implemented according to the standard specification, * supporting pattern matching for user agents, path-based access control, and crawl delay settings.</p> * * <p>Key features:</p> * <ul> * <li>Supports multiple user-agent directives with pattern matching</li> * <li>Handles Allow and Disallow rules for path-based access control</li> * <li>Manages crawl delay settings per user agent</li>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/PdfExtractor.java
* to prevent hanging on problematic PDF files. It also extracts metadata from the PDF * document and includes it in the extraction result. * * <p>Features: * <ul> * <li>Text extraction from PDF pages</li> * <li>Embedded document extraction</li> * <li>Annotation extraction (file attachments)</li> * <li>Metadata extraction</li>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 12.7K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformerTest.java
super.setUp(); htmlTransformer = new HtmlTransformer(); htmlTransformer.setName("htmlTransformer"); Map<String, String> featureMap = newHashMap(); featureMap.put("http://xml.org/sax/features/namespaces", "false"); htmlTransformer.setFeatureMap(featureMap); Map<String, String> propertyMap = newHashMap(); htmlTransformer.setPropertyMap(propertyMap);
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Mar 15 06:52:00 UTC 2025 - 13.8K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerTest.java
@Override protected void setUp() throws Exception { super.setUp(); final Map<String, String> featureMap = newHashMap(); featureMap.put("http://xml.org/sax/features/namespaces", "false"); final Map<String, String> propertyMap = newHashMap(); final Map<String, String> childUrlRuleMap = newHashMap(); childUrlRuleMap.put("//A", "href");Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 19.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/storage/StorageClient.java
* <li>readTimeout - Read timeout in milliseconds (default: 10000)</li> * </ul> * * <p>The client supports URLs in the format: {@code storage://bucket-name/object-path} * * <p>Features: * <ul> * <li>Automatic initialization of MinIO client</li> * <li>Support for HEAD and GET operations</li> * <li>Content length validation</li> * <li>MIME type detection</li>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 17.9K bytes - Viewed (2) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/XmlTransformer.java
*/ protected boolean includeAware; /** * A map of attributes. */ protected final Map<String, Object> attributeMap = new HashMap<>(); /** * A map of features. */ protected final Map<String, String> featureMap = new HashMap<>(); /** * A map of field rules. */ protected Map<String, String> fieldRuleMap = new LinkedHashMap<>();Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 23.9K bytes - Viewed (0)