- Sort Score
- Result 10 results
- Languages All
Results 1 - 10 of 38 for phases (0.05 sec)
-
README.md
- **HTTP/HTTPS**: Full web crawling support with cookies, authentication, redirects - **File System**: Local and network file system access - **FTP**: FTP server crawling with authentication - **SMB/CIFS**: Windows network shares - **Storage**: Cloud storage systems (MinIO, S3-compatible) ### Content Formats #### Office Documents - Microsoft Office (Word, Excel, PowerPoint) - OpenOffice/LibreOffice documents - RTF, WordPerfect
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Aug 31 05:32:52 UTC 2025 - 15.3K bytes - Viewed (0) -
pom.xml
<build> <pluginManagement> <plugins> <plugin> <groupId>org.apache.maven.plugins</groupId> <artifactId>maven-shade-plugin</artifactId> <executions> <execution> <phase>package</phase> <goals> <goal>shade</goal> </goals> <configuration> <artifactSet> <includes> <include>org.dbflute:dbflute-runtime</include> </includes>
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 2.8K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapUrl.java
* when making decisions, they may crawl pages marked "hourly" less * frequently than that, and they may crawl pages marked "yearly" more * frequently than that. Crawlers may periodically crawl pages marked * "never" so that they can handle unexpected changes to those pages. */ private String changefreq; /**
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 6.5K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/SitemapsHelper.java
* * @param in Input stream for a sitemap * @return a sitemap set */ public SitemapSet parse(final InputStream in) { return parse(in, true); } /** * Parses a sitemap from the given input stream. * @param in the input stream to parse * @param recursive whether to recursively parse compressed files * @return the parsed sitemap set */Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 14.7K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/suggest/index/contents/ContentsParser.java
import org.codelibs.fess.suggest.normalizer.Normalizer; /** * Interface for parsing content and creating suggest items. */ public interface ContentsParser { /** * Parses the given search words and creates a SuggestItem. * * @param words the array of search words * @param readings the array of readings corresponding to the search wordsRegistered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Sat Mar 15 06:51:20 UTC 2025 - 4.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/log/LogType.java
* Each enum constant represents a specific event or state in the crawler's execution. */ public enum LogType { /** Indicates the start of a crawling process. */ START_CRAWLING, /** Indicates the cleanup phase of crawling. */ CLEANUP_CRAWLING, /** Indicates an unsupported URL was encountered when crawling started. */ UNSUPPORTED_URL_AT_CRAWLING_STARTED, /** Indicates checking the last modified date of a resource. */
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 2.4K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/util/XmlUtil.java
* <li>{@link #escapeXml(String)}: Escapes special characters in an XML string.</li> * <li>{@link #stripInvalidXMLCharacters(String)}: Strips invalid XML characters from a string.</li> * <li>{@link #getDataMap(AccessResultData)}: Parses XML content from {@link AccessResultData} and returns a map of the data.</li> * </ul> * * <h2>Usage Example:</h2> * <pre> * {@code * String escapedXml = XmlUtil.escapeXml("<tag>value</tag>");
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 9.4K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/RobotsTxtHelper.java
// Default constructor } /** * Parses a robots.txt file from the given input stream using UTF-8 encoding. * @param stream the input stream to parse * @return the parsed RobotsTxt object, or null if disabled */ public RobotsTxt parse(final InputStream stream) { return parse(stream, Constants.UTF_8); } /**Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 7.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb1/SmbAuthenticationHolder.java
* with a specific path prefix. When a path is provided, it iterates through the stored * authentications to find the one whose path prefix matches the beginning of the given path. * This allows for different SMB shares to use different authentication credentials.</p> * */ public class SmbAuthenticationHolder { private final Map<String, SmbAuthentication> authMap = new HashMap<>(); /**Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 2.2K bytes - Viewed (0) -
LICENSE
"control" means (i) the power, direct or indirect, to cause the direction or management of such entity, whether by contract or otherwise, or (ii) ownership of fifty percent (50%) or more of the outstanding shares, or (iii) beneficial ownership of such entity. "You" (or "Your") shall mean an individual or Legal Entity exercising permissions granted by this License.
Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Mon Jan 11 04:30:09 UTC 2021 - 11.1K bytes - Viewed (0)