Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 17 for Processing (0.04 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/ResponseData.java

            this.lastModified = lastModified;
        }
    
        /**
         * Gets the processing status of this response.
         *
         * @return the processing status
         */
        public int getStatus() {
            return status;
        }
    
        /**
         * Sets the processing status of this response.
         *
         * @param status the processing status to set
         */
        public void setStatus(final int status) {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 11.6K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/helper/impl/LogHelperImpl.java

     *   <li>Handling unsupported URLs</li>
     *   <li>Checking last modified dates</li>
     *   <li>Getting content</li>
     *   <li>Handling redirects</li>
     *   <li>Processing responses</li>
     *   <li>Handling exceptions during crawling and child URL processing</li>
     *   <li>Handling cases where no URL is in the queue</li>
     *   <li>Handling cases where no response processor or rule is found</li>
     *   <li>Handling system errors</li>
     * </ul>
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 14K bytes
    - Viewed (0)
  3. README.md

    ## Key Features
    
    - **Smart Query Suggestions**: Real-time auto-completion and search suggestions
    - **Multi-language Support**: Built-in support for Japanese text processing with Kuromoji analyzer
    - **Popular Words Analytics**: Track and analyze frequently searched terms
    - **Flexible Text Processing**: Configurable converters and normalizers for text transformation
    - **OpenSearch Integration**: Seamless integration with OpenSearch/Elasticsearch clusters
    Registered: Fri Sep 19 09:08:11 UTC 2025
    - Last Modified: Sun Aug 31 03:31:14 UTC 2025
    - 12.1K bytes
    - Viewed (1)
  4. README.md

    - **SmbClient**: SMB/CIFS network shares
    - **StorageClient**: Cloud storage integration
    
    #### Content Processing Pipeline
    - **Extractors**: Content extraction from various formats
    - **Transformers**: Data transformation and enrichment
    - **Filters**: URL filtering with regex patterns
    - **Rules**: Content processing rules and validation
    
    ## Building and Testing
    
    ### Build Commands
    
    ```bash
    # Build all modules
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/util/TextUtil.java

    import org.apache.logging.log4j.Logger;
    import org.codelibs.core.lang.StringUtil;
    
    /**
     * Utility class for text normalization and processing.
     *
     * This class provides methods to normalize text by reading characters from a provided Reader
     * and processing them according to specific rules. The main functionality is encapsulated
     * within the nested {@link TextNormalizeContext} class.
     *
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 12K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb1/SmbClient.java

                } catch (final Exception e) {
                    if (logger.isDebugEnabled()) {
                        logger.debug("Exception on SID processing.", e);
                    }
                }
            }
        }
    
        /**
         * Preprocesses the URI before processing the request.
         *
         * @param uri the URI to preprocess
         * @return the preprocessed URI
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Sep 18 09:30:45 UTC 2025
    - 23K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/EmlExtractor.java

            }
        }
    
        /**
         * Gets the mail properties used for email processing.
         *
         * @return the mail properties
         */
        public Properties getMailProperties() {
            return mailProperties;
        }
    
        /**
         * Sets the mail properties used for email processing.
         *
         * @param mailProperties the mail properties to set
         */
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 12.6K bytes
    - Viewed (0)
  8. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/smb/SmbClient.java

                } catch (final Exception e) {
                    if (logger.isDebugEnabled()) {
                        logger.debug("Exception on SID processing.", e);
                    }
                }
            }
        }
    
        /**
         * Preprocesses the URI before processing the request.
         *
         * @param uri the URI to preprocess
         * @return the preprocessed URI
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Sep 18 09:30:45 UTC 2025
    - 22.5K bytes
    - Viewed (3)
  9. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TikaExtractor.java

     * <p>
     * The {@link ContentWriter} functional interface is used to abstract the process of writing content to a writer.
     * </p>
     *
     * <p>
     * The class uses temporary files for processing large input streams and ensures that these files are deleted after
     * processing.
     * </p>
     *
     */
    public class TikaExtractor extends PasswordBasedExtractor {
    
        private static final Logger logger = LogManager.getLogger(TikaExtractor.class);
    
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 30.7K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/CrawlerThread.java

    import jakarta.annotation.Resource;
    
    /**
     * The {@code CrawlerThread} class represents a thread that executes the crawling process.
     * It is responsible for fetching URLs from the queue, accessing the content,
     * processing the response, and extracting child URLs.
     *
     * <p>
     * This class implements the {@link Runnable} interface, allowing it to be executed in a separate thread.
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Thu Aug 07 02:55:08 UTC 2025
    - 20.4K bytes
    - Viewed (0)
Back to top