Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 5 of 5 for Particles (0.14 sec)

  1. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapNews.java

    /**
     * Represents a news entry within a sitemap URL.
     * This class encapsulates the properties of a news article as defined in the Google News Sitemap extension.
     *
     * <p>
     * The news extension allows you to provide additional information about news articles on your site.
     * This can help Google News index your articles and display them in Google News search results.
     * </p>
     *
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 13:34:36 UTC 2025
    - 4.8K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/builder/RequestDataBuilderTest.java

            // Real-world example: crawling a web page
            RequestData data = RequestDataBuilder.newRequestData().get().url("https://example.com/article/12345").weight(1.0f).build();
    
            assertNotNull(data);
            assertEquals(Method.GET, data.getMethod());
            assertEquals("https://example.com/article/12345", data.getUrl());
        }
    
        public void test_realWorldUsageExample2() {
            // Real-world example: posting data
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 10.9K bytes
    - Viewed (0)
  3. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            // Disallow: /*?*id=* - should block URLs with ?...id=...
            assertFalse(robotsTxt.allows("/page?id=123", "MultiWildcardBot"));
            assertFalse(robotsTxt.allows("/article?name=test&id=456", "MultiWildcardBot"));
            assertTrue(robotsTxt.allows("/page?name=test", "MultiWildcardBot")); // no id=
    
            // Test DollarBot - literal $ in middle of pattern
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
  4. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/SitemapsHelperTest.java

                            + "        xmlns:news=\"http://www.google.com/schemas/sitemap-news/0.9\">\n" + "  <url>\n"
                            + "    <loc>http://www.example.com/news/article.html</loc>\n" + "    <news:news>\n" + "      <news:publication>\n"
                            + "        <news:name>Example Times</news:name>\n" + "        <news:language>en</news:language>\n"
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 36.7K bytes
    - Viewed (0)
  5. fess-crawler/src/main/resources/org/codelibs/fess/crawler/mime/tika-mimetypes.xml

          <match value="%\ BibTeX\ standard\ bibliography\ " type="string" offset="0"/>
          <match value="%%%\ \ @BibTeX-style-file{" type="string" offset="73"/>
          <match value="@article{" type="stringignorecase" offset="0"/>
          <match value="@book{" type="stringignorecase" offset="0"/>
          <match value="@inbook{" type="stringignorecase" offset="0"/>
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Oct 16 07:46:32 UTC 2025
    - 320.2K bytes
    - Viewed (2)
Back to top