Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 38 for Unicode (0.23 sec)

  1. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/TextExtractorEnhancedTest.java

            assertTrue("Should contain line 9999", result.getContent().contains("Line 9999"));
        }
    
        /**
         * Test extraction with various Unicode characters.
         */
        public void test_getText_unicodeContent_extractsCorrectly() {
            final String unicodeContent = "Hello 世界 مرحبا мир שלום";
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 8.9K bytes
    - Viewed (0)
  2. fess-crawler/src/test/resources/org/codelibs/fess/crawler/helper/robots_malformed.txt

    Disallow: /test/
    
    # Case 9: Special characters in paths
    User-agent: SpecialCharBot
    Disallow: /path with spaces/
    Disallow: /path%20encoded/
    Disallow: /path?query=value
    Disallow: /path#fragment
    Allow: /unicode/日本語/
    
    # Case 10: Multiple User-agents in sequence
    User-agent: Bot1
    User-agent: Bot2
    User-agent: Bot3
    Disallow: /shared/
    
    # Case 11: Sitemap with various formats
    Sitemap: http://example.com/sitemap.xml
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 14 12:52:01 UTC 2025
    - 2.6K bytes
    - Viewed (0)
  3. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/EXTRACTOR_TESTS_README.md

    - Improved error messages with encoding
    - Various encoding support
    - Edge cases (empty, large, Unicode)
    - Special characters handling
    
    **Test Count**: 12 tests
    
    **Key Scenarios**:
    - ✅ Default UTF-8 encoding extraction
    - ✅ Custom encoding support
    - ✅ Error messages include encoding information
    - ✅ Large content handling
    - ✅ Unicode and special characters
    - ✅ Empty and whitespace-only content
    
    ---
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Wed Nov 19 08:55:01 UTC 2025
    - 5.7K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/client/http/conn/IdnDnsResolver.java

    import org.apache.http.conn.DnsResolver;
    
    /**
     * The {@code IdnDnsResolver} class implements the {@code DnsResolver} interface
     * to provide DNS resolution with support for Internationalized Domain Names (IDN).
     * It converts Unicode domain names to ASCII Compatible Encoding (ACE) using the
     * {@link java.net.IDN} class.
     *
     * <p>This class allows setting a custom flag for the IDN conversion and a custom
     * encoding for URL decoding.</p>
     *
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Nov 06 07:24:43 UTC 2025
    - 3.9K bytes
    - Viewed (0)
  5. fess-crawler/src/test/java/org/codelibs/fess/crawler/util/CharUtilTest.java

            assertFalse(CharUtil.isUrlChar('\u00A0')); // non-breaking space
            assertFalse(CharUtil.isUrlChar('\u00FF')); // ÿ
        }
    
        public void test_isUrlChar_unicode() {
            // Test Unicode characters (not valid URL chars without encoding)
            assertFalse(CharUtil.isUrlChar('\u3042')); // あ (Hiragana)
            assertFalse(CharUtil.isUrlChar('\u4E00')); // 一 (CJK)
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 5.6K bytes
    - Viewed (0)
  6. CLAUDE.md

    **Technology Stack:**
    - Java 21+, Maven
    - OpenSearch (provided dependency)
    - Apache Lucene (query parsing, text analysis)
    - ICU4J (Unicode text processing)
    - JUnit 4 (testing)
    
    **Repository:** https://github.com/codelibs/fess-suggest
    
    ---
    
    ## Architecture
    
    ### Package Structure
    
    ```
    org.codelibs.fess.suggest/
    Registered: Sat Dec 20 13:04:59 UTC 2025
    - Last Modified: Mon Nov 24 03:40:05 UTC 2025
    - 8.9K bytes
    - Viewed (0)
  7. src/test/java/org/codelibs/curl/CurlRequestTest.java

            // Body with unicode characters
            String unicodeBody = "{\"message\":\"こんにちは世界\"}";
            request.body(unicodeBody);
    
            assertEquals(unicodeBody, request.body());
        }
    
        @Test
        public void testParamWithUnicodeCharacters() {
            CurlRequest request = new CurlRequest(Method.GET, "https://example.com");
    
            // Param with unicode characters
    Registered: Sat Dec 20 09:13:53 UTC 2025
    - Last Modified: Mon Nov 24 03:10:07 UTC 2025
    - 20.5K bytes
    - Viewed (0)
  8. ADDING_NEW_LANGUAGE.md

    - Restart Fess application
    
    ### Translations not showing
    
    - Clear browser cache
    - Force language with `?browser_lang=[locale]` parameter
    - Check property file encoding (should be UTF-8 or use Unicode escapes `\uXXXX`)
    - Verify property keys match exactly with base English files
    
    ### Build errors after regeneration
    
    - Check property file syntax (no unescaped special characters)
    Registered: Sat Dec 20 09:19:18 UTC 2025
    - Last Modified: Thu Nov 06 11:36:30 UTC 2025
    - 10.4K bytes
    - Viewed (0)
  9. fess-crawler/src/test/java/org/codelibs/fess/crawler/helper/RobotsTxtHelperTest.java

            try {
                robotsTxt = robotsTxtHelper.parse(in, "UTF-8");
            } finally {
                CloseableUtil.closeQuietly(in);
            }
    
            // Should handle unicode content
            assertNotNull(robotsTxt);
            assertFalse(robotsTxt.allows("/test/", "TestBot"));
        }
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 20.6K bytes
    - Viewed (0)
  10. src/test/java/org/codelibs/fess/suggest/util/SuggestUtilTest.java

            assertEquals(expected, id); // Should not be truncated
        }
    
        @Test
        public void testCreateSuggestTextIdWithUnicodeCharacters() {
            // Test with various Unicode characters that may encode differently
            String unicodeText = "日本語テスト🎌🗾こんにちは世界";
            String id = SuggestUtil.createSuggestTextId(unicodeText);
            assertNotNull(id);
            assertTrue(id.length() <= 445);
        }
    
    Registered: Sat Dec 20 13:04:59 UTC 2025
    - Last Modified: Mon Nov 24 03:40:05 UTC 2025
    - 26.7K bytes
    - Viewed (0)
Back to top