- Sort Score
- Result 10 results
- Languages All
Results 11 - 20 of 41 for Reader (0.03 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlExtractor.java
super(); } @Override protected ExtractData createExtractData(final String content) { final DOMParser parser = getDomParser(); try (final Reader reader = new StringReader(content)) { parser.parse(new InputSource(reader)); } catch (final Exception e) { logger.warn("Failed to parse the content.", e); return new ExtractData(extractString(content)); }
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 9.3K bytes - Viewed (0) -
README.md
// Index suggestions from existing Elasticsearch documents DocumentReader reader = new ESSourceReader( client, suggester.settings(), "content-index", // source index "document" // document type ); suggester.indexer() .indexFromDocument(reader, 2, 100) // threads=2, batch=100 .getResponse(); ``` ### Index from Query Logs ```java
Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Sun Aug 31 03:31:14 UTC 2025 - 12.1K bytes - Viewed (1) -
src/main/java/org/codelibs/fess/suggest/index/contents/querylog/QueryLogReader.java
*/ package org.codelibs.fess.suggest.index.contents.querylog; import java.io.Closeable; /** * The {@code QueryLogReader} interface provides methods to read query logs and close the reader. * It extends the {@code Closeable} interface, ensuring that resources can be released when no longer needed. */ public interface QueryLogReader extends Closeable { /** * Reads a query log.Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Fri Jul 04 14:00:23 UTC 2025 - 1.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractor.java
if (!dfos.isInMemory()) { tempFile = dfos.getFile(); } try (Reader reader = new InputStreamReader(getContentStream(dfos), enc)) { if (normalizeText) { return TextUtil.normalizeText(reader) .initialCapacity(initialBufferSize) .maxAlphanumTermSize(maxAlphanumTermSize)Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Thu Aug 07 02:55:08 UTC 2025 - 30.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/RobotsTxtHelper.java
final BufferedReader reader = new BufferedReader(new InputStreamReader(new BOMInputStream(stream), charsetName)); String line; final RobotsTxt robotsTxt = new RobotsTxt(); final List<Directive> currentDirectiveList = new ArrayList<>(); boolean isGroupRecordStarted = false; while ((line = reader.readLine()) != null) {
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 7.7K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/suggest/index/SuggestIndexer.java
throw new SuggestIndexException("Failed to index from query_string.", e); } } /** * Indexes documents from a query log reader asynchronously. * @param queryLogReader The query log reader. * @param docPerReq The number of documents to process per request. * @param requestInterval The interval between requests.
Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Thu Aug 07 02:41:28 UTC 2025 - 34.8K bytes - Viewed (0) -
src/test/java/org/codelibs/fess/suggest/SuggesterTest.java
CountDownLatch latch = new CountDownLatch(1); AtomicInteger numObInputDoc = new AtomicInteger(0); ESSourceReader reader = new ESSourceReader(client, suggester.settings(), indexName); reader.setScrollSize(1000); suggester.indexer().indexFromDocument(() -> reader, 1000, () -> ThreadUtil.sleep(100)).then(response -> { numObInputDoc.set(response.getNumberOfInputDocs());Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Thu Aug 07 02:41:28 UTC 2025 - 37.2K bytes - Viewed (0) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sat Sep 06 04:15:37 UTC 2025 - 25.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/EmlExtractor.java
final ExtractData data = new ExtractData(content != null ? content : StringUtil.EMPTY); final Enumeration<Header> headers = message.getAllHeaders(); while (headers.hasMoreElements()) { final Header header = headers.nextElement(); data.putValue(header.getName(), header.getValue()); } putValue(data, "Content-ID", message.getContentID());
Registered: Sun Sep 21 03:50:09 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 12.6K bytes - Viewed (0) -
src/etc/header.txt
Shinsuke Sugaya <******@****.***> 1390632593 +0900
Registered: Fri Sep 19 09:08:11 UTC 2025 - Last Modified: Sat Jan 25 06:49:53 UTC 2014 - 586 bytes - Viewed (0)