- Sort Score
- Result 10 results
- Languages All
Results 11 - 14 of 14 for Wain (0.01 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/util/TextUtil.java
/** * Utility class for text normalization and processing. * * This class provides methods to normalize text by reading characters from a provided Reader * and processing them according to specific rules. The main functionality is encapsulated * within the nested {@link TextNormalizeContext} class. * * <p>The text normalization process includes: * <ul>Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 12K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/FileTransformer.java
* {@link org.codelibs.fess.crawler.exception.CrawlerSystemException} in case of errors. * </p> * * <p> * The {@link #storeData(ResponseData, ResultData)} method is the main entry point for storing * the content of a crawled resource. The {@link #getData(AccessResultData)} method retrieves * the stored file path as a File object. * </p> */ public class FileTransformer extends HtmlTransformer {Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 11.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java
* <li><b>invalidUrlPattern:</b> A regular expression pattern used to identify * invalid URLs.</li> * </ul> * * <p> * <b>Usage:</b> * </p> * <p> * The {@code transform} method is the main entry point for transforming an HTML * response. It takes a {@link ResponseData} object as input and returns a * {@link ResultData} object containing the extracted data and child URLs. * </p> */
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 28.5K bytes - Viewed (0) -
fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml
Notes: * Tika supports a wider range of match types than Freedesktop does * Glob patterns must be unique, if there's a clash assign to the most popular format * The main mime type should be the canonical one, use aliases for any other widely used forms * Where there's a hierarchy in the types, list it via a parent * Highly specific magic matches get a high priority
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Mar 13 08:18:01 UTC 2025 - 320.1K bytes - Viewed (1)