Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 4 of 4 for punctuation (0.07 sec)

  1. src/test/java/org/codelibs/fess/suggest/analysis/SuggestAnalyzerTest.java

                assertEquals("WORD" + i, ((TestAnalyzeToken) tokens.get(i)).getReading());
            }
        }
    
        @Test
        public void testAnalyzeWithPunctuation() {
            // Test analyze with various punctuation
            String text = "Hello, world! How are you? I'm fine.";
            String field = "content";
            String lang = "en";
    
            List<AnalyzeToken> tokens = analyzer.analyze(text, field, lang);
    
    Registered: Fri Sep 19 09:08:11 UTC 2025
    - Last Modified: Mon Sep 01 13:33:03 UTC 2025
    - 15.7K bytes
    - Viewed (0)
  2. src/bytes/bytes.go

    // Title treats s as UTF-8-encoded bytes and returns a copy with all Unicode letters that begin
    // words mapped to their title case.
    //
    // Deprecated: The rule Title uses for word boundaries does not handle Unicode
    // punctuation properly. Use golang.org/x/text/cases instead.
    func Title(s []byte) []byte {
    	// Use a closure here to remember state.
    	// Hackish but effective. Depends on Map scanning in order and calling
    	// the closure once per rune.
    Registered: Tue Sep 09 11:13:09 UTC 2025
    - Last Modified: Wed Sep 03 14:04:47 UTC 2025
    - 35.5K bytes
    - Viewed (0)
  3. okhttp/src/jvmTest/kotlin/okhttp3/HttpUrlTest.kt

        // six-per-em space
        assertThat(parse("http://h/\u2006").encodedPath).isEqualTo("/%E2%80%86")
        // figure space
        assertThat(parse("http://h/\u2007").encodedPath).isEqualTo("/%E2%80%87")
        // punctuation space
        assertThat(parse("http://h/\u2008").encodedPath).isEqualTo("/%E2%80%88")
        // thin space
        assertThat(parse("http://h/\u2009").encodedPath).isEqualTo("/%E2%80%89")
        // hair space
    Registered: Fri Sep 05 11:42:10 UTC 2025
    - Last Modified: Mon Aug 04 07:38:48 UTC 2025
    - 69.9K bytes
    - Viewed (0)
  4. RELEASE.md

        *   Added additional `standardize` and `split` modes to `TextVectorization`:
            *   `standardize="lower"` will lowercase inputs.
            *   `standardize="string_punctuation"` will remove all punctuation.
            *   `split="character"` will split on every unicode character.
        *   Added an `output_mode` argument to the `Discretization` and `Hashing`
            layers with the same semantics as other preprocessing layers. All
    Registered: Tue Sep 09 12:39:10 UTC 2025
    - Last Modified: Mon Aug 18 20:54:38 UTC 2025
    - 740K bytes
    - Viewed (2)
Back to top