Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 17 for about (0.42 sec)

  1. src/main/java/org/codelibs/fess/exception/DataStoreCrawlingException.java

            super(message, cause);
            this.url = url;
            this.abort = abort;
        }
    
        /**
         * Gets the URL where the crawling error occurred.
         *
         * @return the URL associated with this exception
         */
        public String getUrl() {
            return url;
        }
    
        /**
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Wed Nov 19 08:04:23 UTC 2025
    - 2.7K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/ExtractorResourceManagementTest.java

                fail("Expected exception for invalid Word document");
            } catch (final ExtractException e) {
                // ExtractException with improved message
                assertTrue("Error message should contain context about Word document or extraction",
                        e.getMessage().contains("Word") || e.getMessage().contains("extract"));
            } catch (final RuntimeException e) {
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 10.4K bytes
    - Viewed (0)
  3. README.md

    Fess is based on [OpenSearch](https://github.com/opensearch-project/OpenSearch), but knowledge/experience about OpenSearch is _not_ required. Fess provides an easy to use Administration GUI to configure the system via your browser.
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Sat Dec 20 00:28:33 UTC 2025
    - 7.8K bytes
    - Viewed (2)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapImage.java

     * This class encapsulates the properties of an image as defined in the Google Image Sitemap extension.
     *
     * <p>
     * The image extension allows you to provide additional information about images on your pages.
     * This can help Google index your images and display them in Google Images search results.
     * </p>
     *
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 13:34:36 UTC 2025
    - 3.8K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapAlternateLink.java

    /**
     * Represents an alternate link entry within a sitemap URL.
     * This class encapsulates the properties of an alternate link as defined in the hreflang annotation.
     *
     * <p>
     * The hreflang attribute tells search engines about alternate versions of a page in different languages
     * or for different regions. This helps search engines serve the correct language or regional URL
     * to searchers.
     * </p>
     *
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 13:34:36 UTC 2025
    - 2.6K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapNews.java

     * This class encapsulates the properties of a news article as defined in the Google News Sitemap extension.
     *
     * <p>
     * The news extension allows you to provide additional information about news articles on your site.
     * This can help Google News index your articles and display them in Google News search results.
     * </p>
     *
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 13:34:36 UTC 2025
    - 4.8K bytes
    - Viewed (0)
  7. fess-crawler/src/main/java/org/codelibs/fess/crawler/entity/SitemapVideo.java

    /**
     * Represents a video entry within a sitemap URL.
     * This class encapsulates the properties of a video as defined in the Video Sitemap extension.
     *
     * <p>
     * The video extension allows you to provide additional information about videos on your pages.
     * This can help Google index your videos and display them in Google Video search results.
     * </p>
     *
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 13 13:34:36 UTC 2025
    - 8.9K bytes
    - Viewed (0)
  8. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            // Allow: /public/*.html - should allow HTML files in public directory
            assertTrue(robotsTxt.allows("/public/index.html", "WildcardBot"));
            assertTrue(robotsTxt.allows("/public/about.html", "WildcardBot"));
    
            // Test EndPathBot - end-of-path ($) patterns
            // Disallow: /fish$ - should block exactly /fish but not /fishing
            assertFalse(robotsTxt.allows("/fish", "EndPathBot"));
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
  9. fess-crawler/src/test/java/org/codelibs/fess/net/protocol/gcs/HandlerTest.java

            // But importantly, there should be no race condition errors
            assertEquals(threadCount, failureCount.get());
            assertEquals(0, successCount.get());
    
            // All exceptions should be about project ID being blank
            for (Exception e : exceptions) {
                assertTrue(e instanceof IOException);
                assertTrue(e.getMessage().contains("GCS_PROJECT_ID is blank"));
            }
        }
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Dec 11 08:38:29 UTC 2025
    - 14.1K bytes
    - Viewed (0)
  10. fess-crawler/src/test/java/org/codelibs/fess/net/protocol/storage/HandlerTest.java

            // But importantly, there should be no race condition errors
            assertEquals(threadCount, failureCount.get());
            assertEquals(0, successCount.get());
    
            // All exceptions should be about endpoint being blank
            for (Exception e : exceptions) {
                assertTrue(e instanceof IOException);
                assertTrue(e.getMessage().contains("endpoint is blank"));
            }
        }
    
        /**
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Dec 11 08:38:29 UTC 2025
    - 14.4K bytes
    - Viewed (0)
Back to top