- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 12 for General (0.1 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/Constants.java
/** * The feature for external general entities in XML. */ public static final String FEATURE_EXTERNAL_GENERAL_ENTITIES = "http://xml.org/sax/features/external-general-entities"; /** * Feature for external parameter entities in XML. */Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 3.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/log/LogType.java
/** Indicates processing a child URL due to an exception. */ PROCESS_CHILD_URL_BY_EXCEPTION, /** Indicates an access exception during crawling. */ CRAWLING_ACCESS_EXCEPTION, /** Indicates a general exception during crawling. */ CRAWLING_EXCEPTION, /** Indicates no URL is available in the queue. */ NO_URL_IN_QUEUE, /** Indicates the start of a crawler thread. */ START_THREAD,Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 2.4K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapUrl.java
* both sources differently. */ private String lastmod; /** * How frequently the page is likely to change. This value provides general * information to search engines and may not correlate exactly to how often * they crawl the page. Valid values are: * <ul> * <li>always</li> * <li>hourly</li> * <li>daily</li>Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 6.5K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/RobotsTxt.java
} /** * Returns the most specific directive matching the given user agent. * The method finds the longest matching user agent pattern in the directives, * excluding the general "*" pattern which matches all bots. * * @param userAgent the user agent string to match against directives, * can be null (treated as empty string)Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10K bytes - Viewed (0) -
fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml
* The main mime type should be the canonical one, use aliases for any other widely used forms * Where there's a hierarchy in the types, list it via a parent * Highly specific magic matches get a high priority * General magic matches which could trigger a false-positive need a low one * The priority for containers normally need to be higher than for the things they contain, so they don't accidently get detectedRegistered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Mar 13 08:18:01 UTC 2025 - 320.1K bytes - Viewed (1) -
README.md
[](https://opensource.org/licenses/Apache-2.0) [](https://maven-badges.herokuapp.com/maven-central/org.codelibs.fess/fess-crawler-parent) ## Overview
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
pom.xml
</configuration> </execution> </executions> </plugin> </plugins> </pluginManagement> </build> <repositories> <repository> <id>snapshots.central.sonatype.com</id> <url>https://central.sonatype.com/repository/maven-snapshots</url> <releases> <enabled>false</enabled> </releases> <snapshots> <enabled>true</enabled> </snapshots> </repository>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 2.8K bytes - Viewed (0) -
pom.xml
<plugin> <groupId>org.jacoco</groupId> <artifactId>jacoco-maven-plugin</artifactId> </plugin> </plugins> </build> <repositories> <repository> <id>snapshots.central.sonatype.com</id> <url>https://central.sonatype.com/repository/maven-snapshots</url> <releases> <enabled>false</enabled> </releases> <snapshots> <enabled>true</enabled> </snapshots> </repository>
Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Sat Sep 06 03:14:57 UTC 2025 - 4.3K bytes - Viewed (0) -
README.md
[](https://opensource.org/licenses/Apache-2.0) [](https://maven-badges.herokuapp.com/maven-central/org.codelibs.fess/fess-suggest)
Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Sun Aug 31 03:31:14 UTC 2025 - 12.1K bytes - Viewed (1) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/FileTransformer.java
* <li>Handle duplicated file paths by appending a counter.</li> * <li>Store the file path in the result data.</li> * <li>Retrieve the stored file as a File object.</li> * </ul> * * <p> * The class uses several configurable properties to customize the file storage behavior, * such as the base path, replacement strings for special characters in URLs,
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 11.7K bytes - Viewed (0)