Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 138 for Limits (0.04 sec)

  1. CLAUDE.md

    - Thread-local storage via `CrawlingParameterUtil`
    
    **Resource Management**:
    - `AutoCloseable` throughout
    - `DeferredFileOutputStream` for large responses (temp files for >1MB)
    - Connection pooling with limits
    - Background temp file deletion via `FileUtil.deleteInBackground()`
    
    **Fault Tolerance**:
    - `FaultTolerantClient` wrapper (retry, circuit breaker)
    - Graceful degradation (e.g., robots.txt parsing continues on errors)
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/CsvExtractor.java

     *   <li>Header row detection and extraction</li>
     *   <li>Column name to data value association</li>
     *   <li>Quoted field handling</li>
     *   <li>Column names as metadata</li>
     *   <li>Configurable encoding and row limits</li>
     * </ul>
     */
    public class CsvExtractor extends AbstractExtractor {
        /** Logger instance for this class. */
        private static final Logger logger = LogManager.getLogger(CsvExtractor.class);
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Dec 11 08:38:29 UTC 2025
    - 12.8K bytes
    - Viewed (0)
  3. okhttp/src/commonJvmAndroid/kotlin/okhttp3/Dispatcher.kt

            field = maxRequests
          }
          promoteAndExecute()
        }
    
      /**
       * The maximum number of requests for each host to execute concurrently. This limits requests by
       * the URL's host name. Note that concurrent requests to a single IP address may still exceed this
       * limit: multiple hostnames may share an IP address or be routed through the same HTTP proxy.
       *
    Registered: Fri Dec 26 11:42:13 UTC 2025
    - Last Modified: Tue Oct 07 14:16:22 UTC 2025
    - 9.9K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/fs/FileSystemClient.java

        public static final String FS_FILE_GROUPS = "fsFileGroups";
    
        /** Character encoding for files */
        protected String charset = Constants.UTF_8;
    
        /** Helper for managing content length limits */
        @Resource
        protected ContentLengthHelper contentLengthHelper;
    
        /** Flag to track initialization status */
        protected AtomicBoolean isInit = new AtomicBoolean(false);
    
        /**
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 12:19:14 UTC 2025
    - 15.1K bytes
    - Viewed (0)
  5. src/main/java/org/codelibs/fess/job/CrawlJob.java

     * handles timeout scenarios, and ensures proper cleanup of resources.</p>
     *
     * <p>Key features:</p>
     * <ul>
     *   <li>Concurrent crawler process management with configurable limits</li>
     *   <li>Selective crawling based on configuration IDs</li>
     *   <li>Document expiration handling</li>
     *   <li>Hot thread monitoring for performance analysis</li>
     *   <li>Process isolation with separate JVM</li>
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 19.6K bytes
    - Viewed (0)
  6. src/main/java/org/codelibs/fess/ds/callback/FileListIndexUpdateCallbackImpl.java

     * This callback processes file events (create, modify, delete) and manages document indexing and deletion
     * operations in the search engine. It supports recursive crawling with configurable depth and access count limits.
     *
     * <p>The implementation uses an executor service for concurrent processing of file operations and maintains
     * a cache of URLs to be deleted for batch processing. It handles redirect following and child URL discovery
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 29.7K bytes
    - Viewed (3)
  7. scripts/people.py

        while discussion_edges:
            for discussion_edge in discussion_edges:
                discussion_nodes.append(discussion_edge.node)
            last_edge = discussion_edges[-1]
            # Handle GitHub secondary rate limits, requests per minute
            time.sleep(settings.sleep_interval)
            discussion_edges = get_graphql_question_discussion_edges(
                settings=settings, after=last_edge.cursor
            )
    Registered: Sun Dec 28 07:19:09 UTC 2025
    - Last Modified: Wed Dec 17 21:25:59 UTC 2025
    - 12.3K bytes
    - Viewed (0)
  8. CHANGELOG/CHANGELOG-1.34.md

      - Added validation to enforce the hugepage aggregated container limits to be smaller than or equal to pod-level limits. This was already enforced with the defaulted requests from the specified limits, however it did not make it clear about both hugepage requests and limits. ([#131089](https://github.com/kubernetes/kubernetes/pull/131089), [@KevinTMtz](https://github.com/KevinTMtz)) [SIG Apps, Node and Testing]
    Registered: Fri Dec 26 09:05:12 UTC 2025
    - Last Modified: Wed Dec 10 01:13:50 UTC 2025
    - 333.3K bytes
    - Viewed (1)
  9. src/main/java/org/codelibs/fess/helper/DataIndexHelper.java

         *
         * <p>The method:</p>
         * <ul>
         *   <li>Creates crawler threads for each data configuration</li>
         *   <li>Manages concurrent execution based on thread count limits</li>
         *   <li>Monitors thread completion and handles cleanup</li>
         *   <li>Records execution timing and statistics</li>
         * </ul>
         *
         * @param sessionId unique identifier for this crawling session
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 19K bytes
    - Viewed (0)
  10. src/main/java/org/codelibs/fess/helper/DocumentHelper.java

                return StringUtil.EMPTY; // empty
            }
        }
    
        /**
         * Processes and normalizes document content.
         * Applies text normalization including duplicate term removal, size limits,
         * and space character handling. May preserve original content based on configuration.
         *
         * @param crawlingConfig the crawling configuration containing processing parameters
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 17.4K bytes
    - Viewed (0)
Back to top