- Sort Score
- Result 10 results
- Languages All
Results 51 - 60 of 148 for getText (0.11 sec)
-
fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/TextTransformer.java
params.put(ExtractData.CONTENT_TYPE, responseData.getMimeType()); String content = null; try (final InputStream in = responseData.getResponseBody()) { content = extractor.getText(in, params).getContent(); } catch (final Exception e) { throw new CrawlingAccessException("Could not extract data.", e); } final ResultData resultData = new ResultData();
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 6.5K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/suggest/util/SuggestUtil.java
*/ public static String createBulkLine(final String index, final String type, final SuggestItem item) { if (item == null || item.getId() == null || item.getText() == null) { throw new SuggesterException("Invalid SuggestItem: item, id, or text is null"); } final Map<String, Object> firstLineMap = new HashMap<>();Registered: Sat Dec 20 13:04:59 UTC 2025 - Last Modified: Sun Nov 23 11:21:40 UTC 2025 - 17.5K bytes - Viewed (1) -
fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/EXTRACTOR_TESTS_README.md
- Consistency across multiple calls **Test Count**: 11 tests **Key Scenarios**: - ✅ Validates non-null streams - ✅ Throws CrawlerSystemException for null - ✅ Called during getText execution - ✅ Does not consume or modify stream - ✅ Consistent behavior across multiple calls - ✅ Works with various InputStream types --- ### 5. TextExtractorEnhancedTest.java
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Wed Nov 19 08:55:01 UTC 2025 - 5.7K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorBuilder.java
} return new ExtractData(StringUtil.EMPTY); } else { try (InputStream is = getContentInputStream(out)) { return extractor.getText(is, params); } } } catch (final CrawlingAccessException e) { throw e; } catch (final Exception e) {Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Jul 06 02:13:03 UTC 2025 - 10.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/AbstractXmlExtractor.java
/** * Returns the pattern used to identify tags in the content. * @return The tag pattern. */ protected abstract Pattern getTagPattern(); @Override public ExtractData getText(final InputStream in, final Map<String, String> params) { if (in == null) { throw new CrawlerSystemException("XML input stream is null. Cannot extract text from null input."); } try {Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 12:19:14 UTC 2025 - 8.6K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/ApiExtractor.java
* @param params additional parameters * @return the extracted data * @throws ExtractException if extraction fails */ @Override public ExtractData getText(final InputStream in, final Map<String, String> params) { if (logger.isDebugEnabled()) { logger.debug("Accessing {}", url); } // startRegistered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Mon Nov 24 03:59:47 UTC 2025 - 12.2K bytes - Viewed (0) -
src/main/java/org/codelibs/fess/suggest/entity/SuggestItem.java
id = SuggestUtil.createSuggestTextId(this.text); } /** * Returns the text of the suggest item. * @return The text. */ public String getText() { return text; } /** * Returns the readings of the suggest item. * @return The readings. */ public String[][] getReadings() { return readings; }
Registered: Sat Dec 20 13:04:59 UTC 2025 - Last Modified: Thu Aug 07 02:41:28 UTC 2025 - 25.1K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/CsvExtractor.java
public CsvExtractor() { super(); } @Override public int getWeight() { return 2; // Higher priority than TikaExtractor (weight=1) } @Override public ExtractData getText(final InputStream in, final Map<String, String> params) { validateInputStream(in); final Charset charset = getCharset(params); final List<String[]> rows = new ArrayList<>();
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Thu Dec 11 08:38:29 UTC 2025 - 12.8K bytes - Viewed (0) -
fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/JsonExtractor.java
super(); } @Override public int getWeight() { return 2; // Higher priority than TikaExtractor (weight=1) } @Override public ExtractData getText(final InputStream in, final Map<String, String> params) { validateInputStream(in); try { final JsonNode rootNode = objectMapper.readTree(in);
Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Sun Nov 23 03:46:53 UTC 2025 - 9.7K bytes - Viewed (0) -
CLAUDE.md
4. **Add tests**: Unit + integration ### Adding a Content Extractor 1. **Implement `Extractor`**: ```java public class MyExtractor extends AbstractExtractor { @Override public ExtractData getText(InputStream in, Map<String, String> params) { ExtractData data = new ExtractData(); // Extract text data.setContent(extractedText); return data; } } ```Registered: Sat Dec 20 11:21:39 UTC 2025 - Last Modified: Fri Nov 28 17:31:34 UTC 2025 - 10.7K bytes - Viewed (0)