Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 4 of 4 for MarkdownExtractor (0.1 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/MarkdownExtractor.java

     *   <li>Clean text conversion from Markdown</li>
     *   <li>Configurable encoding</li>
     * </ul>
     */
    public class MarkdownExtractor extends AbstractExtractor {
        /** Logger instance for this class. */
        private static final Logger logger = LogManager.getLogger(MarkdownExtractor.class);
    
        /** Default encoding for Markdown files. */
        protected String encoding = Constants.UTF_8;
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 03:46:53 UTC 2025
    - 8.2K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/MarkdownExtractorTest.java

            markdownExtractor = container.getComponent("markdownExtractor");
        }
    
        public void test_getText() {
            final InputStream in = ResourceUtil.getResourceAsStream("extractor/markdown/test.md");
            final ExtractData extractData = markdownExtractor.getText(in, null);
            CloseableUtil.closeQuietly(in);
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 6.4K bytes
    - Viewed (0)
  3. fess-crawler/src/test/resources/extractor/markdown/test.md

    ---
    title: Sample Markdown Document
    author: John Doe
    date: 2025-01-15
    tags:
      - crawler
      - extractor
      - markdown
    ---
    
    # Introduction
    
    This is a sample Markdown document for testing the MarkdownExtractor.
    
    ## Features
    
    The extractor should handle:
    
    - YAML front matter extraction
    - Heading structure
    - **Bold text** and *italic text*
    - Lists and other formatting
    
    ### Code Examples
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 03:46:53 UTC 2025
    - 767 bytes
    - Viewed (0)
  4. fess-crawler-lasta/src/main/resources/crawler/extractor.xml

    		<property name="hasHeader">true</property>
    		<property name="autoDetectDelimiter">true</property>
    		<property name="extractColumnMetadata">true</property>
    	</component>
    	<component name="markdownExtractor"
    		class="org.codelibs.fess.crawler.extractor.impl.MarkdownExtractor">
    		<property name="extractFrontMatter">true</property>
    		<property name="extractHeadings">true</property>
    		<property name="extractLinks">false</property>
    	</component>
    	<!--
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 03:46:53 UTC 2025
    - 50.1K bytes
    - Viewed (0)
Back to top