Search Options

Results per page
Sort
Preferred Languages
Advance

Results 51 - 60 of 148 for getText (0.11 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/TextTransformer.java

            params.put(ExtractData.CONTENT_TYPE, responseData.getMimeType());
            String content = null;
            try (final InputStream in = responseData.getResponseBody()) {
                content = extractor.getText(in, params).getContent();
            } catch (final Exception e) {
                throw new CrawlingAccessException("Could not extract data.", e);
            }
    
            final ResultData resultData = new ResultData();
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 6.5K bytes
    - Viewed (0)
  2. src/main/java/org/codelibs/fess/suggest/util/SuggestUtil.java

         */
        public static String createBulkLine(final String index, final String type, final SuggestItem item) {
            if (item == null || item.getId() == null || item.getText() == null) {
                throw new SuggesterException("Invalid SuggestItem: item, id, or text is null");
            }
    
            final Map<String, Object> firstLineMap = new HashMap<>();
    Registered: Sat Dec 20 13:04:59 UTC 2025
    - Last Modified: Sun Nov 23 11:21:40 UTC 2025
    - 17.5K bytes
    - Viewed (1)
  3. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/EXTRACTOR_TESTS_README.md

    - Consistency across multiple calls
    
    **Test Count**: 11 tests
    
    **Key Scenarios**:
    - ✅ Validates non-null streams
    - ✅ Throws CrawlerSystemException for null
    - ✅ Called during getText execution
    - ✅ Does not consume or modify stream
    - ✅ Consistent behavior across multiple calls
    - ✅ Works with various InputStream types
    
    ---
    
    ### 5. TextExtractorEnhancedTest.java
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Wed Nov 19 08:55:01 UTC 2025
    - 5.7K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/ExtractorBuilder.java

                    }
                    return new ExtractData(StringUtil.EMPTY);
                } else {
                    try (InputStream is = getContentInputStream(out)) {
                        return extractor.getText(is, params);
                    }
                }
            } catch (final CrawlingAccessException e) {
                throw e;
            } catch (final Exception e) {
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 10.1K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/AbstractXmlExtractor.java

        /**
         * Returns the pattern used to identify tags in the content.
         * @return The tag pattern.
         */
        protected abstract Pattern getTagPattern();
    
        @Override
        public ExtractData getText(final InputStream in, final Map<String, String> params) {
            if (in == null) {
                throw new CrawlerSystemException("XML input stream is null. Cannot extract text from null input.");
            }
            try {
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 12:19:14 UTC 2025
    - 8.6K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/ApiExtractor.java

         * @param params additional parameters
         * @return the extracted data
         * @throws ExtractException if extraction fails
         */
        @Override
        public ExtractData getText(final InputStream in, final Map<String, String> params) {
            if (logger.isDebugEnabled()) {
                logger.debug("Accessing {}", url);
            }
    
            // start
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 12.2K bytes
    - Viewed (0)
  7. src/main/java/org/codelibs/fess/suggest/entity/SuggestItem.java

            id = SuggestUtil.createSuggestTextId(this.text);
        }
    
        /**
         * Returns the text of the suggest item.
         * @return The text.
         */
        public String getText() {
            return text;
        }
    
        /**
         * Returns the readings of the suggest item.
         * @return The readings.
         */
        public String[][] getReadings() {
            return readings;
        }
    Registered: Sat Dec 20 13:04:59 UTC 2025
    - Last Modified: Thu Aug 07 02:41:28 UTC 2025
    - 25.1K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/CsvExtractor.java

        public CsvExtractor() {
            super();
        }
    
        @Override
        public int getWeight() {
            return 2; // Higher priority than TikaExtractor (weight=1)
        }
    
        @Override
        public ExtractData getText(final InputStream in, final Map<String, String> params) {
            validateInputStream(in);
    
            final Charset charset = getCharset(params);
            final List<String[]> rows = new ArrayList<>();
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Dec 11 08:38:29 UTC 2025
    - 12.8K bytes
    - Viewed (0)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/JsonExtractor.java

            super();
        }
    
        @Override
        public int getWeight() {
            return 2; // Higher priority than TikaExtractor (weight=1)
        }
    
        @Override
        public ExtractData getText(final InputStream in, final Map<String, String> params) {
            validateInputStream(in);
    
            try {
                final JsonNode rootNode = objectMapper.readTree(in);
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 03:46:53 UTC 2025
    - 9.7K bytes
    - Viewed (0)
  10. CLAUDE.md

    4. **Add tests**: Unit + integration
    
    ### Adding a Content Extractor
    
    1. **Implement `Extractor`**:
    ```java
    public class MyExtractor extends AbstractExtractor {
        @Override
        public ExtractData getText(InputStream in, Map<String, String> params) {
            ExtractData data = new ExtractData();
            // Extract text
            data.setContent(extractedText);
            return data;
        }
    }
    ```
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
Back to top