Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 8 of 8 for nofollow (1.69 sec)

  1. src/main/java/org/codelibs/fess/crawler/transformer/FessXpathTransformer.java

                        }
                        if (content.contains(ROBOTS_TAG_NOFOLLOW)) {
                            nofollow = true;
                        }
                    }
                    if (noindex && nofollow) {
                        logger.info("META(robots=noindex,nofollow): {}", responseData.getUrl());
                        throw new ChildUrlsException(Collections.emptySet(), "#processMetaRobots");
                    }
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Dec 12 13:58:40 UTC 2025
    - 54.6K bytes
    - Viewed (0)
  2. src/main/resources/fess_config.properties

    crawler.document.html.canonical.xpath=//LINK[@rel='canonical'][1]/@href
    # HTML tags to prune (remove) during document processing.
    crawler.document.html.pruned.tags=noscript,script,style,header,footer,aside,nav,a[rel=nofollow]
    # Maximum length of digest extracted from HTML documents.
    crawler.document.html.max.digest.length=120
    # Default language for HTML documents.
    crawler.document.html.default.lang=
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Dec 11 09:47:03 UTC 2025
    - 54.8K bytes
    - Viewed (0)
  3. src/main/java/org/codelibs/fess/mylasta/direction/FessConfig.java

        String CRAWLER_DOCUMENT_HTML_CANONICAL_XPATH = "crawler.document.html.canonical.xpath";
    
        /** The key of the configuration. e.g. noscript,script,style,header,footer,aside,nav,a[rel=nofollow] */
        String CRAWLER_DOCUMENT_HTML_PRUNED_TAGS = "crawler.document.html.pruned.tags";
    
        /** The key of the configuration. e.g. 120 */
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sat Dec 13 02:21:17 UTC 2025
    - 525.7K bytes
    - Viewed (2)
  4. README.md

        $ mvn dbflute:download # (one time command)
        $ mvn dbflute:freegen
        $ mvn license:format
    
    ### Integration Tests
    
    Integration tests require a running Fess server with OpenSearch. Follow these steps:
    
    #### 1. Build Fess
    
        $ mvn antrun:run  # Download OpenSearch plugins (if not done)
        $ mvn package     # Build the package
    
    #### 2. Start Fess Server
    
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sat Dec 20 00:28:33 UTC 2025
    - 7.8K bytes
    - Viewed (2)
  5. src/main/java/org/codelibs/fess/thumbnail/impl/BaseThumbnailGenerator.java

        protected Map<String, String> filePathMap = new HashMap<>();
    
        /** The name of this thumbnail generator. */
        protected String name;
    
        /** Maximum number of redirects to follow. */
        protected int maxRedirectCount = 10;
    
        /** Availability status of this generator. */
        protected Boolean available = null;
    
        /**
         * Registers this thumbnail generator with the thumbnail manager.
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 13.4K bytes
    - Viewed (0)
  6. src/main/java/org/codelibs/fess/ds/callback/FileListIndexUpdateCallbackImpl.java

        /** Maximum size of the delete URL cache before batch deletion is triggered. */
        protected int maxDeleteDocumentCacheSize;
    
        /** Maximum number of redirects to follow when processing URLs. */
        protected int maxRedirectCount;
    
        /** Executor service for concurrent processing of file operations. */
        private final ExecutorService executor;
    
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Fri Nov 28 16:29:12 UTC 2025
    - 29.7K bytes
    - Viewed (3)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java

            resultData.setTransformerName(getName());
    
            try {
                // data
                storeData(responseData, resultData);
    
                if (isHtml(responseData) && !responseData.isNoFollow()) {
                    // urls
                    storeChildUrls(responseData, resultData);
                }
            } finally {
                xpathAPI.remove();
            }
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sat Nov 29 07:42:33 UTC 2025
    - 30.5K bytes
    - Viewed (0)
  8. CLAUDE.md

        // Process
    }  // Temp files auto-deleted
    ```
    
    ---
    
    ## Best Practices for AI Assistants
    
    ### When Adding Features
    
    1. Read existing code first (use symbol overview tools)
    2. Follow existing patterns
    3. Add tests
    4. Handle resources properly (try-with-resources)
    5. Consider thread safety
    6. Update JavaDoc
    
    ### When Fixing Bugs
    
    1. Write failing test first
    2. Understand root cause
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
Back to top