PDF - Code Search

src/main/assemblies/files/generate-thumbnail

  if [[ -z "${im_cmd}" ]] ; then
    echo "ImageMagick (convert or magick) does not exist."
    exit 1
  fi
  check_command pdftoppm
  check_command unoconv
  tmp_pdf_file=/tmp/thumbnail.$$.pdf
  unoconv -e PageRange=1-1 -o ${tmp_pdf_file} -f pdf "${target_file}"
  if [[ ! -f ${tmp_pdf_file} ]] ; then
    echo "unoconv does not work."
    exit 1
  fi
  tmp_png_prefix=/tmp/thumbnail.png.$$

Created: Tue Mar 31 13:07:34 GMT 2026

- Last Modified: Thu Dec 04 08:02:36 GMT 2025

- 3.9K bytes

- Click Count (0)

github.com/codelibs/fess

src/main/resources/crawler/rule.xml

		</postConstruct>
		<postConstruct name="addRule">
			<arg>"mimeType"</arg>
			<!-- Supported MIME type -->
			<arg>
  "(application/xml"
+ "|application/xhtml\+xml"
+ "|application/rdf\+xml"
+ "|application/pdf"
+ "|application/x-freemind"
+ "|text/xml"
+ "|text/xml-external-parsed-entity)"
			</arg>
		</postConstruct>
	</component>

	<component name="fsFileRule" class="org.codelibs.fess.crawler.rule.impl.RegexRule" >

Created: Tue Mar 31 13:07:34 GMT 2026

- Last Modified: Sun Mar 29 08:21:02 GMT 2026

- 4.6K bytes

- Click Count (0)

github.com/codelibs/fess

src/main/resources/fess_thumbnail.xml

		<property name="commandList">
			["${path}/generate-thumbnail",
			"pdf",
			"${url}",
			"${outputFile}"]
		</property>
		<property name="generatorList">
			["${path}/generate-thumbnail"]
		</property>
		<postConstruct name="addCondition">
			<arg>"mimetype"</arg>
			<arg>"application/pdf"
			</arg>
		</postConstruct>
		<postConstruct name="register"></postConstruct>
	</component>

Created: Tue Mar 31 13:07:34 GMT 2026

- Last Modified: Wed Feb 04 14:24:39 GMT 2026

- 6K bytes

- Click Count (0)

github.com/codelibs/fess

src/main/java/org/codelibs/fess/helper/FileTypeHelper.java

        }
    }

    /**
     * Adds or updates a MIME type to file type mapping.
     *
     * @param mimetype the MIME type to map (e.g., "application/pdf")
     * @param filetype the file type classification (e.g., "pdf")
     */
    public void add(final String mimetype, final String filetype) {
        mimetypeMap.put(mimetype, filetype);
    }

    /**

Created: Tue Mar 31 13:07:34 GMT 2026

- Last Modified: Thu Jul 17 08:28:31 GMT 2025

- 4.4K bytes

- Click Count (0)

github.com/codelibs/jcifs

src/test/resources/jcifs/smb1/util/mime.map

application/msword             doc              # Microsoft Word
application/octet-stream       bin exe ani      # Binary File
application/oda                oda
application/pagemaker          pm5 pt5 pm       # PageMaker
application/pdf                pdf              # Adobe Acrobat
application/postscript         ai eps ps        # Postscript File
application/rtf                rtf              # Rich Text File
application/toolbook           tbk              # Toolbook

Created: Sun Apr 05 00:10:12 GMT 2026

- Last Modified: Thu Aug 14 05:31:44 GMT 2025

- 5.9K bytes

- Click Count (0)

github.com/codelibs/jcifs

src/main/java/jcifs/smb1/util/mime.map

application/msword             doc              # Microsoft Word
application/octet-stream       bin exe ani      # Binary File
application/oda                oda
application/pagemaker          pm5 pt5 pm       # PageMaker
application/pdf                pdf              # Adobe Acrobat
application/postscript         ai eps ps        # Postscript File
application/rtf                rtf              # Rich Text File
application/toolbook           tbk              # Toolbook

Created: Sun Apr 05 00:10:12 GMT 2026

- Last Modified: Fri Mar 22 20:39:42 GMT 2019

- 5.9K bytes

- Click Count (0)

github.com/minio/minio

docs/compression/README.md

```

Default config includes most common highly compressible content extensions and mime-types.

```bash
~ mc admin config set myminio compression extensions=".pdf" mime_types="application/pdf"
```

To show help on setting compression config values.

```bash
~ mc admin config set myminio compression
```

To enable compression for all content, no matter the extension and content type

Created: Sun Apr 05 19:28:12 GMT 2026

- Last Modified: Tue Aug 12 18:20:36 GMT 2025

- 5.2K bytes

- Click Count (0)

github.com/square/okhttp

okhttp/src/jvmTest/kotlin/okhttp3/MultipartBodyTest.kt

      |Content-Type: application/pdf; charset=utf-8
      |
      |Jesse’s Resumé
      |--AaB03x--
      |
      """.trimMargin().replace("\n", "\r\n")
    val body =
      MultipartBody
        .Builder("AaB03x")
        .setType(MultipartBody.FORM)
        .addFormDataPart(
          "attachment",
          "resumé.pdf",

Created: Fri Apr 03 11:42:14 GMT 2026

- Last Modified: Wed Mar 19 19:25:20 GMT 2025

- 10.5K bytes

- Click Count (0)

github.com/codelibs/fess-crawler

README.md

crawler.crawlerContext.setDefaultIntervalTime(1000); // 1 second
```

### URL Filtering

```java
// Include patterns
crawler.urlFilter.addInclude("https://example.com/.*");
crawler.urlFilter.addInclude(".*\\.pdf$");

// Exclude patterns  
crawler.urlFilter.addExclude(".*\\.js$");
crawler.urlFilter.addExclude(".*login.*");
```

## Supported Protocols and Formats

### Protocols

Created: Sun Apr 12 03:50:13 GMT 2026

- Last Modified: Sun Aug 31 05:32:52 GMT 2025

- 15.3K bytes

- Click Count (0)

github.com/codelibs/fess

src/main/webapp/WEB-INF/orig/view/advance.jsp

						<option value="html" <c:if test="${as.filetype.contains('html')}">selected</c:if>><la:message
								key="labels.advance_search_filetype_html"
							/></option>
						<option value="pdf" <c:if test="${as.filetype.contains('pdf')}">selected</c:if>><la:message
								key="labels.advance_search_filetype_pdf"
							/></option>
						<option value="word" <c:if test="${as.filetype.contains('word')}">selected</c:if>><la:message

Created: Tue Mar 31 13:07:34 GMT 2026

- Last Modified: Mon Feb 23 08:03:44 GMT 2026

- 14.2K bytes

- Click Count (0)

Search Options