Search Options

Results per page
Sort
Preferred Languages
Advance

Results 31 - 40 of 43 for readers (0.03 sec)

  1. src/main/java/org/codelibs/fess/suggest/index/contents/document/DocumentReader.java

     * should also handle resource cleanup when the {@link #close()} method is called.</p>
     */
    public interface DocumentReader extends Closeable {
        /**
         * Reads a document and returns its contents as a map.
         *
         * @return a map containing the document's data, or null if there are no more documents to read.
         */
        Map<String, Object> read();
    
        @Override
    Registered: Fri Sep 19 09:08:11 UTC 2025
    - Last Modified: Fri Jul 04 14:00:23 UTC 2025
    - 1.4K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/XpathTransformer.java

            }
            resultData.setEncoding(charsetName);
        }
    
        /**
         * Returns the result data header.
         * @return The result data header.
         */
        protected String getResultDataHeader() {
            // TODO: Support other XML header types
            return "<?xml version=\"1.0\"?>\n<doc>\n";
        }
    
        /**
         * Returns the result data body for a single value.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 13.1K bytes
    - Viewed (0)
  3. README.md

    mvn clean install -pl fess-crawler
    
    # Generate test coverage report
    mvn jacoco:report
    ```
    
    ### Code Quality
    
    ```bash
    # Format code
    mvn formatter:format
    
    # Update license headers
    mvn license:format
    
    # Run static analysis
    mvn spotbugs:check
    ```
    
    ### Running Tests
    
    ```bash
    # Run all tests
    mvn test
    
    # Run specific test class
    mvn test -Dtest=CrawlerTest
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/util/TemporaryFileInputStream.java

            }
        }
    
        /**
         * Marks the current position in this input stream. A subsequent call to the reset method repositions this stream at the last marked position so that subsequent reads re-read the same bytes.
         * This method delegates to {@link FileInputStream#mark(int)}.
         *
         * @param readlimit the maximum limit of bytes that can be read before the mark position becomes invalid
         */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 4.3K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/BinaryTransformer.java

     * as a ByteArrayInputStream.
     * </p>
     *
     * <p>
     * The transform method takes a ResponseData object, checks if it has a response body,
     * and then reads the body into a byte array. This byte array is then set as the data
     * in the ResultData object.
     * </p>
     *
     * <p>
     * The getData method takes an AccessResultData object, checks if the transformer name matches,
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 3.8K bytes
    - Viewed (0)
  6. fess-crawler-opensearch/src/main/java/org/codelibs/fess/crawler/client/FesenClient.java

            client.execute(action, request, listener);
        }
    
        @Override
        public Client filterWithHeader(final Map<String, String> headers) {
            client.filterWithHeader(headers);
            return this;
        }
    
        /**
         * Deletes documents matching the specified query using scroll and bulk delete.
         *
         * @param index The index to delete from.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 25.3K bytes
    - Viewed (0)
  7. fess-crawler/src/test/java/org/codelibs/fess/crawler/CrawlerContextTest.java

                    } catch (InterruptedException e) {
                        Thread.currentThread().interrupt();
                    }
                }
            });
    
            Thread reader = new Thread(new Runnable() {
                @Override
                public void run() {
                    try {
                        statusSetLatch.countDown();
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sat Sep 06 04:15:37 UTC 2025
    - 25.6K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java

     *   <li>Storing the HTML content as data in the {@link ResultData}.</li>
     *   <li>Extracting child URLs from the HTML content based on configured rules.</li>
     *   <li>Handling redirect URLs specified in the response headers.</li>
     * </ol>
     * <p>
     * The class also provides methods for configuring features and properties of the
     * underlying DOM parser, as well as defining rules for extracting child URLs
     * from specific HTML tags and attributes.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 28.5K bytes
    - Viewed (0)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapUrl.java

         * Datetime format. This format allows you to omit the time portion, if
         * desired, and use YYYY-MM-DD.
         *
         * Note that this tag is separate from the If-Modified-Since (304) header
         * the server can return, and search engines may use the information from
         * both sources differently.
         */
        private String lastmod;
    
        /**
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 6.5K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/XmlTransformer.java

                    logger.debug("Failed to retrieval a cache.", e);
                }
                return new XPathAPI();
            }
        }
    
        /**
         * Returns the header for the result data.
         * @return The result data header.
         */
        protected String getResultDataHeader() {
            // TODO support other type
            return "<?xml version=\"1.0\"?>\n<doc>\n";
        }
    
        /**
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 23.9K bytes
    - Viewed (0)
Back to top