Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 23 for blocks (0.05 sec)

  1. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/MarkdownExtractorTest.java

            final ExtractData extractData = markdownExtractor.getText(in, null);
            CloseableUtil.closeQuietly(in);
    
            final String content = extractData.getContent();
    
            // Code blocks should be converted to plain text
            assertTrue(content.contains("public class Example") || content.contains("Example"));
        }
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 6.4K bytes
    - Viewed (0)
  2. src/main/java/org/codelibs/fess/helper/DataIndexHelper.java

             */
            public boolean isRunning() {
                return running;
            }
    
            /**
             * Waits for the crawling thread to terminate.
             * This method blocks until the thread completes its execution.
             * Interrupted exceptions are caught and logged at debug level.
             */
            public void awaitTermination() {
                try {
                    join();
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 19K bytes
    - Viewed (0)
  3. src/main/java/org/codelibs/fess/helper/IntervalControlHelper.java

        protected List<IntervalRule> ruleList = new ArrayList<>();
    
        /**
         * Checks the crawler status and waits if the crawler is not running.
         * This method blocks until the crawler is running again.
         */
        public void checkCrawlerStatus() {
            while (!crawlerRunning) {
                ThreadUtil.sleepQuietly(crawlerWaitMillis);
            }
        }
    
        /**
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sun Nov 23 12:34:02 UTC 2025
    - 10K bytes
    - Viewed (0)
  4. CLAUDE.md

    **DI**: LastaFlute container with `@Resource` and XML config
    
    ### Core Principles
    
    **Thread Safety**:
    - `AtomicLong` for counters (`CrawlerContext.accessCount`)
    - `volatile` for status flags
    - Synchronized blocks for critical sections
    - Thread-local storage via `CrawlingParameterUtil`
    
    **Resource Management**:
    - `AutoCloseable` throughout
    - `DeferredFileOutputStream` for large responses (temp files for >1MB)
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  5. src/main/java/org/codelibs/fess/query/QueryCommand.java

                return QueryBuilders.matchPhraseQuery(f, text);
            }
    
            final UnicodeBlock block = UnicodeBlock.of(text.codePointAt(0));
            if (block == UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS //
                    || block == UnicodeBlock.HIRAGANA //
                    || block == UnicodeBlock.KATAKANA //
                    || block == UnicodeBlock.HANGUL_SYLLABLES //
            ) {
                return QueryBuilders.prefixQuery(f, text);
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sun Nov 23 11:39:05 UTC 2025
    - 11.6K bytes
    - Viewed (0)
  6. fess-crawler/src/test/resources/extractor/markdown/test.md

    The extractor should handle:
    
    - YAML front matter extraction
    - Heading structure
    - **Bold text** and *italic text*
    - Lists and other formatting
    
    ### Code Examples
    
    Here is some inline `code` and a code block:
    
    ```java
    public class Example {
        public static void main(String[] args) {
            System.out.println("Hello, World!");
        }
    }
    ```
    
    ## Links
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 03:46:53 UTC 2025
    - 767 bytes
    - Viewed (0)
  7. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            // Disallow: /*.pdf$ - should block .pdf files but not .pdf with query params
            assertFalse(robotsTxt.allows("/document.pdf", "WildcardBot"));
            assertFalse(robotsTxt.allows("/files/report.pdf", "WildcardBot"));
            assertTrue(robotsTxt.allows("/document.pdf?download=true", "WildcardBot")); // $ means exact end
    
            // Disallow: /admin/*.php - should block PHP files in admin directory
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
  8. src/test/java/org/codelibs/core/io/FileUtilTest.java

            final Path baseDir = tempFolder.getRoot().toPath();
            final Path traversalPath = baseDir.resolve("../../../etc/passwd");
    
            assertFalse("Path traversal should be blocked", FileUtil.isPathSafe(traversalPath, baseDir));
        }
    
        /**
         * Test isPathSafe with File objects
         *
         * @throws Exception
         */
        @Test
    Registered: Sat Dec 20 08:55:33 UTC 2025
    - Last Modified: Sat Nov 22 11:21:59 UTC 2025
    - 10.3K bytes
    - Viewed (0)
  9. src/main/java/org/codelibs/fess/helper/ProcessHelper.java

                    throw new JobNotFoundException("Process for " + sessionId + " is not running.");
                }
            }
    
            // Perform I/O operations outside synchronized block to avoid blocking other threads
            try {
                final OutputStream out = process.getOutputStream();
                IOUtils.write(command + "\n", out, Constants.CHARSET_UTF_8);
                out.flush();
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Nov 20 06:54:47 UTC 2025
    - 10.9K bytes
    - Viewed (0)
  10. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/MarkdownExtractor.java

     *
     * <p>Features:
     * <ul>
     *   <li>YAML front matter metadata extraction</li>
     *   <li>Heading structure extraction</li>
     *   <li>Link URL extraction</li>
     *   <li>Code block content extraction</li>
     *   <li>Clean text conversion from Markdown</li>
     *   <li>Configurable encoding</li>
     * </ul>
     */
    public class MarkdownExtractor extends AbstractExtractor {
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 03:46:53 UTC 2025
    - 8.2K bytes
    - Viewed (0)
Back to top