Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 1,167 for xpath (0.64 sec)

  1. src/main/java/org/codelibs/fess/crawler/transformer/FessXpathTransformer.java

            final String xpath = xpathConfigMap.get(XPath.DEFAULT_LANG);
            if (StringUtil.isNotBlank(xpath)) {
                return xpath;
            }
            return fessConfig.getCrawlerDocumentHtmlLangXpath();
        }
    
        /**
         * Gets the XPath expression for extracting content.
         *
         * @param fessConfig the Fess configuration
         * @param xpathConfigMap the XPath configuration map
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Dec 12 13:58:40 UTC 2025
    - 54.6K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java

        }
    
        /**
         * Checks if a path is valid for crawling (not a JavaScript, mailto, or other invalid URL).
         *
         * @param path the path to validate
         * @return true if the path is valid, false otherwise
         */
        protected boolean isValidPath(final String path) {
            if (StringUtil.isBlank(path)) {
                return false;
            }
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Nov 29 07:42:33 UTC 2025
    - 30.5K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlExtractor.java

        }
    
        /**
         * Adds a metadata field with its corresponding XPath expression for extraction.
         *
         * @param name the name of the metadata field
         * @param xpath the XPath expression to extract the metadata value
         */
        public void addMetadata(final String name, final String xpath) {
            metadataXpathMap.put(name, xpath);
        }
    
        /*
         * (non-Javadoc)
         *
         * @see
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Oct 04 08:47:19 UTC 2025
    - 9.3K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/HtmlXpathExtractor.java

        }
    
        /**
         * Gets the XPath expression for selecting target nodes.
         *
         * @return the target node path
         */
        public String getTargetNodePath() {
            return targetNodePath;
        }
    
        /**
         * Sets the XPath expression for selecting target nodes.
         *
         * @param targetNodePath the target node path to set
         */
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Oct 04 08:47:19 UTC 2025
    - 10.4K bytes
    - Viewed (0)
  5. src/main/resources/fess_config.properties

    dc:title=title:string\n\
    
    # html
    
    # XPath to extract main content from HTML documents.
    crawler.document.html.content.xpath=//BODY
    # XPath to extract language attribute from HTML documents.
    crawler.document.html.lang.xpath=//HTML/@lang
    # XPath to extract digest (description) from HTML documents.
    crawler.document.html.digest.xpath=//META[@name='description']/@content
    # XPath to extract canonical URL from HTML documents.
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Dec 11 09:47:03 UTC 2025
    - 54.8K bytes
    - Viewed (0)
  6. README.md

    // Configure for file system crawling
    container.singleton("fsClient", FileSystemClient.class);
    
    // Add file URL
    crawler.addUrl("file:///path/to/directory");
    crawler.urlFilter.addInclude("file:///path/to/directory/.*");
    ```
    
    ## Configuration
    
    ### XML Configuration
    
    Fess Crawler uses XML-based configuration with LastaFlute DI. Place configuration files in your classpath:
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  7. src/main/java/org/codelibs/fess/mylasta/direction/FessConfig.java

        /** The key of the configuration. e.g. //HTML/@lang */
        String CRAWLER_DOCUMENT_HTML_LANG_XPATH = "crawler.document.html.lang.xpath";
    
        /** The key of the configuration. e.g. //META[@name='description']/@content */
        String CRAWLER_DOCUMENT_HTML_DIGEST_XPATH = "crawler.document.html.digest.xpath";
    
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sat Dec 13 02:21:17 UTC 2025
    - 525.7K bytes
    - Viewed (2)
  8. docs/en/docs/tutorial/path-params.md

    ## Order matters { #order-matters }
    
    When creating *path operations*, you can find situations where you have a fixed path.
    
    Like `/users/me`, let's say that it's to get data about the current user.
    
    And then you can also have a path `/users/{user_id}` to get data about a specific user by some user ID.
    
    Registered: Sun Dec 28 07:19:09 UTC 2025
    - Last Modified: Wed Dec 17 20:41:43 UTC 2025
    - 9.2K bytes
    - Viewed (0)
  9. docs/es/docs/tutorial/path-params.md

    ```JSON
    {
      "model_name": "alexnet",
      "message": "Deep Learning FTW!"
    }
    ```
    
    ## Parámetros de path conteniendo paths { #path-parameters-containing-paths }
    
    Imaginemos que tienes una *path operation* con un path `/files/{file_path}`.
    
    Pero necesitas que `file_path` en sí mismo contenga un *path*, como `home/johndoe/myfile.txt`.
    
    Entonces, la URL para ese archivo sería algo como: `/files/home/johndoe/myfile.txt`.
    
    Registered: Sun Dec 28 07:19:09 UTC 2025
    - Last Modified: Wed Dec 17 20:41:43 UTC 2025
    - 9.8K bytes
    - Viewed (0)
  10. docs/de/docs/tutorial/path-params.md

    ### Pfad-Konverter { #path-convertor }
    
    Mittels einer Option direkt von Starlette können Sie einen *Pfad-Parameter* deklarieren, der einen Pfad enthalten soll, indem Sie eine URL wie folgt definieren:
    
    ```
    /files/{file_path:path}
    ```
    
    In diesem Fall ist der Name des Parameters `file_path`. Der letzte Teil, `:path`, sagt aus, dass der Parameter ein *Pfad* sein soll.
    
    Registered: Sun Dec 28 07:19:09 UTC 2025
    - Last Modified: Wed Dec 17 20:41:43 UTC 2025
    - 10.5K bytes
    - Viewed (0)
Back to top