Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 75 for nofollow (0.06 sec)

  1. src/main/java/org/codelibs/fess/crawler/transformer/FessXpathTransformer.java

                        }
                        if (content.contains(ROBOTS_TAG_NOFOLLOW)) {
                            nofollow = true;
                        }
                    }
                    if (noindex && nofollow) {
                        logger.info("META(robots=noindex,nofollow): {}", responseData.getUrl());
                        throw new ChildUrlsException(Collections.emptySet(), "#processMetaRobots");
                    }
    Registered: Thu Oct 31 13:40:30 UTC 2024
    - Last Modified: Thu Oct 24 13:01:38 UTC 2024
    - 42.9K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/ResponseData.java

        }
    
        public void setStatus(final int status) {
            this.status = status;
        }
    
        public void setNoFollow(final boolean value) {
            noFollow = value;
        }
    
        public boolean isNoFollow() {
            return noFollow;
        }
    
        public void addMetaData(final String name, final Object value) {
            metaDataMap.put(name, value);
        }
    
    Registered: Sun Nov 10 03:50:12 UTC 2024
    - Last Modified: Sat Oct 12 01:40:57 UTC 2024
    - 5.8K bytes
    - Viewed (0)
  3. src/test/java/org/codelibs/fess/crawler/transformer/FessXpathTransformerTest.java

                };
            };
    
            final ResponseData responseData = new ResponseData();
            responseData.addMetaData("X-Robots-Tag", "nofollow");
    
            transformer.processXRobotsTag(responseData, new ResultData());
            assertTrue(responseData.isNoFollow());
        }
    
        public void test_processMetaRobots_no() throws Exception {
            final String data = "<html><body>foo</body></html>";
    Registered: Thu Oct 31 13:40:30 UTC 2024
    - Last Modified: Thu Oct 24 13:01:38 UTC 2024
    - 41.3K bytes
    - Viewed (0)
  4. src/main/resources/fess_config.properties

    crawler.document.html.digest.xpath=//META[@name='description']/@content
    crawler.document.html.canonical.xpath=//LINK[@rel='canonical'][1]/@href
    crawler.document.html.pruned.tags=noscript,script,style,header,footer,aside,nav,a[rel=nofollow]
    crawler.document.html.max.digest.length=120
    crawler.document.html.default.lang=
    crawler.document.html.default.include.index.patterns=
    Registered: Thu Oct 31 13:40:30 UTC 2024
    - Last Modified: Tue Oct 01 14:13:38 UTC 2024
    - 30.9K bytes
    - Viewed (0)
  5. src/main/java/org/codelibs/fess/mylasta/direction/FessConfig.java

        String CRAWLER_DOCUMENT_HTML_CANONICAL_XPATH = "crawler.document.html.canonical.xpath";
    
        /** The key of the configuration. e.g. noscript,script,style,header,footer,aside,nav,a[rel=nofollow] */
        String CRAWLER_DOCUMENT_HTML_PRUNED_TAGS = "crawler.document.html.pruned.tags";
    
        /** The key of the configuration. e.g. 120 */
    Registered: Thu Oct 31 13:40:30 UTC 2024
    - Last Modified: Fri Oct 11 21:11:58 UTC 2024
    - 468.5K bytes
    - Viewed (1)
  6. CONTRIBUTING.md

    ### Security vulnerabilities
    
    Do not report security vulnerabilities to the public issue tracker. Follow our [Security Vulnerability Disclosure Policy](https://github.com/gradle/gradle/security/policy).
    
    ### Follow the Code of Conduct
    
    Contributors must follow the Code of Conduct outlined at [https://gradle.org/conduct/](https://gradle.org/conduct/).
    
    ### Additional help
    
    Registered: Wed Nov 06 11:36:14 UTC 2024
    - Last Modified: Tue Nov 05 15:15:33 UTC 2024
    - 15.6K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/transformer/impl/HtmlTransformer.java

            resultData.setTransformerName(getName());
    
            try {
                // data
                storeData(responseData, resultData);
    
                if (isHtml(responseData) && !responseData.isNoFollow()) {
                    // urls
                    storeChildUrls(responseData, resultData);
                }
            } finally {
                xpathAPI.remove();
            }
    
    Registered: Sun Nov 10 03:50:12 UTC 2024
    - Last Modified: Thu Oct 24 12:16:00 UTC 2024
    - 19.4K bytes
    - Viewed (0)
  8. architecture/standards/0006-use-of-provider-apis-in-gradle.md

    The Gradle codebase has evolved over time and has a mixture of simple getter/setter methods, Provider API and things in between.  It can be hard to follow best practices because Gradle provided types are reused in unexpected ways and extended by third party plugins.
    
    Given these constraints, we cannot always follow best practices.  This ADR proposes the way we should handle the use of Provider APIs in the gradle/gradle codebase.
    
    ## Decision
    
    Registered: Wed Nov 06 11:36:14 UTC 2024
    - Last Modified: Tue Oct 15 20:00:57 UTC 2024
    - 10K bytes
    - Viewed (0)
  9. apache-maven/src/assembly/maven/lib/jline-native/README.txt

    This directory contains JLine native libraries extracted from JLine JAR.
    
    You can add your own build for platforms not natively supported by JLine.
    See here [1] on how to compile for your platform and here [2] how libraries
    follow JLine's directory and filename conventions.
    
    [1] https://github.com/jline/jline3/tree/master/native
    Registered: Sun Nov 03 03:35:11 UTC 2024
    - Last Modified: Thu Oct 24 12:01:35 UTC 2024
    - 428 bytes
    - Viewed (0)
  10. helm/minio/templates/NOTES.txt

    Read more about port forwarding here: http://kubernetes.io/docs/user-guide/kubectl/kubectl_port-forward/
    
    You can now access MinIO server on http://localhost:9000. Follow the below steps to connect to MinIO server with mc client:
    
      1. Download the MinIO mc client - https://min.io/docs/minio/linux/reference/minio-mc.html#quickstart
    
    Registered: Sun Nov 03 19:28:11 UTC 2024
    - Last Modified: Thu Oct 10 15:48:31 UTC 2024
    - 2.7K bytes
    - Viewed (0)
Back to top