Search Options

Results per page
Sort
Preferred Languages
Advance

Results 21 - 30 of 47 for using (0.01 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/UrlConvertHelper.java

     * based on a map of target strings and their corresponding replacements. It allows
     * adding new conversion rules, setting the entire conversion map, and converting
     * URLs using these rules.</p>
     *
     * <p>The conversion is performed by iterating through the conversion map and applying
     * each replacement rule sequentially. The order of the rules in the map is preserved
     * during the conversion process.</p>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 3.1K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/PdfExtractor.java

    import org.codelibs.fess.crawler.extractor.Extractor;
    import org.codelibs.fess.crawler.extractor.ExtractorFactory;
    import org.codelibs.fess.crawler.helper.MimeTypeHelper;
    
    /**
     * PdfExtractor extracts text content from PDF files using Apache PDFBox.
     * It supports password-protected PDFs and can extract embedded documents and annotations.
     *
     * <p>The extractor runs text extraction in a separate thread with a configurable timeout
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 12.7K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/impl/MimeTypeHelperImpl.java

        protected boolean useFilenameOnOctetStream = true;
    
        /**
         * Creates a new MimeTypeHelperImpl instance.
         * Initializes the MimeTypes instance using the default configuration.
         * @throws CrawlerSystemException if the MIME types configuration cannot be loaded
         */
        public MimeTypeHelperImpl() {
            try {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 6.5K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/ApiExtractor.java

    import org.codelibs.fess.crawler.exception.ExtractException;
    
    import com.google.common.base.Charsets;
    
    import jakarta.annotation.PostConstruct;
    import jakarta.annotation.PreDestroy;
    
    /**
     * Extract a text by using external http server.
     */
    public class ApiExtractor extends AbstractExtractor {
    
        private static final Logger logger = LogManager.getLogger(ApiExtractor.class);
    
        /** The URL of the API endpoint. */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 12.2K bytes
    - Viewed (0)
  5. src/main/java/org/codelibs/fess/suggest/settings/SuggestSettings.java

     *
     * <p>Default settings and array settings can be customized using:</p>
     * <ul>
     *   <li>{@link #defaultSettings()}</li>
     *   <li>{@link #defaultArraySettings()}</li>
     * </ul>
     *
     * <p>Index settings can be loaded from a JSON file using:</p>
     * <ul>
     *   <li>{@link #loadIndexSettings()}</li>
     * </ul>
     *
     * <p>A builder for SuggestSettings can be obtained using:</p>
     * <ul>
     *   <li>{@link #builder()}</li>
     * </ul>
     *
    Registered: Fri Sep 19 09:08:11 UTC 2025
    - Last Modified: Thu Aug 07 02:41:28 UTC 2025
    - 18.7K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/container/StandardCrawlerContainer.java

     *   <li>Managing singleton instances with lifecycle hooks</li>
     *   <li>Creating prototype instances on demand</li>
     *   <li>Dependency injection using {@code @Resource} annotation</li>
     *   <li>Lifecycle management using {@code @PostConstruct} and {@code @PreDestroy} annotations</li>
     * </ul>
     *
     * <p>Components can be registered in two ways:
     * <ul>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14.3K bytes
    - Viewed (0)
  7. README.md

    - **CrawlerContext**: Execution context and configuration
    - **CrawlerThread**: Individual crawler thread implementation
    
    #### Client Architecture
    - **HcHttpClient**: HTTP/HTTPS client using Apache HttpComponents
    - **FileSystemClient**: File system access
    - **FtpClient**: FTP protocol support
    - **SmbClient**: SMB/CIFS network shares
    - **StorageClient**: Cloud storage integration
    
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  8. src/main/java/org/codelibs/fess/suggest/util/SuggestUtil.java

            final ReadingConverterChain chain = new ReadingConverterChain();
            chain.addConverter(new KatakanaToAlphabetConverter());
            return chain;
        }
    
        /**
         * Creates a default normalizer using the provided client and suggest settings.
         * The normalizer chain includes an AnalyzerNormalizer.
         *
         * @param client the client to be used for creating the normalizer
    Registered: Fri Sep 19 09:08:11 UTC 2025
    - Last Modified: Mon Sep 01 13:33:03 UTC 2025
    - 17.4K bytes
    - Viewed (1)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlExtractor.java

                });
                return extractData;
            } finally {
                xpathAPI.remove();
            }
        }
    
        /**
         * Extracts strings from a document using the specified XPath expression.
         *
         * @param document the DOM document to extract strings from
         * @param path the XPath expression to evaluate
         * @return an array of strings extracted from the document
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 9.3K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorFactory.java

                            }
                        }
                    }
                    throw new ExtractException("Failed to extract the content using available extractors.");
                }
    
                @Override
                public int getWeight() {
                    return extractors[0].getWeight();
                }
            };
        }
    
        /**
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 7.3K bytes
    - Viewed (0)
Back to top