Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 26 for archives (0.03 sec)

  1. README.md

    #### PDFs and Images
    - PDF documents (text and metadata extraction)
    - Images (JPEG, PNG, GIF, TIFF, BMP)
    - Image metadata (EXIF, IPTC, XMP)
    
    #### Archives and Compressed Files
    - ZIP, TAR, GZ archives
    - LHA compression format
    - Nested archive extraction
    
    #### Web and Markup
    - HTML, XHTML with XPath support
    - XML documents
    - JSON and structured data
    
    #### Media Files
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  2. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/ZipExtractor.java

    import java.io.BufferedInputStream;
    import java.io.InputStream;
    import java.util.HashMap;
    import java.util.Map;
    
    import org.apache.commons.compress.archivers.ArchiveInputStream;
    import org.apache.commons.compress.archivers.ArchiveStreamFactory;
    import org.apache.commons.compress.archivers.zip.ZipArchiveEntry;
    import org.apache.logging.log4j.LogManager;
    import org.apache.logging.log4j.Logger;
    import org.codelibs.fess.crawler.entity.ExtractData;
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 4.5K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TarExtractor.java

    import java.io.InputStream;
    import java.util.HashMap;
    import java.util.Map;
    
    import org.apache.commons.compress.archivers.ArchiveInputStream;
    import org.apache.commons.compress.archivers.ArchiveStreamFactory;
    import org.apache.commons.compress.archivers.tar.TarArchiveEntry;
    import org.apache.logging.log4j.LogManager;
    import org.apache.logging.log4j.Logger;
    import org.codelibs.core.io.CloseableUtil;
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 5K bytes
    - Viewed (0)
  4. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/LhaExtractor.java

    import jp.gr.java_conf.dangan.util.lha.LhaFile;
    import jp.gr.java_conf.dangan.util.lha.LhaHeader;
    
    /**
     * Extractor implementation for LHA (LZH) archive files.
     * This extractor can extract text content from files within LHA archives
     * by using appropriate extractors for each contained file type.
     *
     * @author shinsuke
     */
    public class LhaExtractor extends AbstractExtractor {
    Registered: Sun Sep 21 03:50:09 UTC 2025
    - Last Modified: Sun Jul 06 02:13:03 UTC 2025
    - 5.8K bytes
    - Viewed (0)
  5. .teamcity/performance-test-durations.json

        "linux" : 349,
        "windows" : 702,
        "macOs" : 228
      } ]
    }, {
      "scenario" : "org.gradle.performance.regression.java.JavaIncrementalExecutionPerformanceTest.assemble for non-abi change with file system sensitive archives",
      "durations" : [ {
        "testProject" : "largeJavaMultiProject",
        "linux" : 508,
        "windows" : 918,
        "macOs" : 263
      }, {
        "testProject" : "largeMonolithicJavaProject",
        "linux" : 1577
    Registered: Wed Sep 10 11:36:15 UTC 2025
    - Last Modified: Mon Sep 08 06:58:31 UTC 2025
    - 25.9K bytes
    - Viewed (0)
  6. .github/workflows/codeql-analysis.yml

            DEVELOCITY_ACCESS_KEY: ${{ secrets.DEVELOCITY_ACCESS_KEY }}
            # Potential stop-gap solution for ReadTimeout issues with the Gradle Build Cache
            # https://gradle.slack.com/archives/CHDLT99C6/p1636477584059200
            GRADLE_OPTS: -Dhttp.keepAlive=false
    
        - name: Compile with Gradle without publishing a Build Scan
          if: ${{ matrix.language == 'java' && github.repository_owner != 'gradle' }}
    Registered: Wed Sep 10 11:36:15 UTC 2025
    - Last Modified: Thu Aug 21 23:08:02 UTC 2025
    - 4K bytes
    - Viewed (0)
  7. ci/official/utilities/setup_macos.sh

        git clone https://github.com/pyenv/pyenv.git "$PYENV_ROOT"
      fi
      echo "Upgraded pyenv version: $(pyenv --version)"
    fi
    
    # Scheduled nightly and release builds upload build artifacts (Pip packages,
    # Libtensorflow archives) to GCS buckets. TFCI Mac VMs need to authenticate as
    # a service account that has the right permissions to be able to do so.
    set +x
    if [[ -n "${GOOGLE_APPLICATION_CREDENTIALS:-}" ]]; then
    Registered: Tue Sep 09 12:39:10 UTC 2025
    - Last Modified: Mon Jun 23 23:03:02 UTC 2025
    - 6.8K bytes
    - Viewed (0)
  8. .teamcity/performance-tests-ci.json

          }
        } ]
      }, {
        "testId" : "org.gradle.performance.regression.java.JavaIncrementalExecutionPerformanceTest.assemble for non-abi change with file system sensitive archives",
        "groups" : [ {
          "testProject" : "largeJavaMultiProject",
          "coverage" : {
            "per_commit" : [ "linux", "windows", "macOs" ]
          }
        }, {
    Registered: Wed Sep 10 11:36:15 UTC 2025
    - Last Modified: Thu Jul 10 02:18:03 UTC 2025
    - 33.4K bytes
    - Viewed (0)
  9. doc/godebug.md

    There is no plan to remove any of these settings.
    
    ### Go 1.20
    
    Go 1.20 introduced support for rejecting insecure paths in tar and zip archives,
    controlled by the [`tarinsecurepath` setting](/pkg/archive/tar/#Reader.Next)
    and the [`zipinsecurepath` setting](/pkg/archive/zip/#NewReader).
    These default to `tarinsecurepath=1` and `zipinsecurepath=1`,
    preserving the behavior of earlier versions of Go.
    Registered: Tue Sep 09 11:13:09 UTC 2025
    - Last Modified: Tue Jul 08 18:30:38 UTC 2025
    - 22.9K bytes
    - Viewed (0)
  10. .bazelrc

    # hope to provide a better mechanism for control over symbol exporting, and
    # then tackle this issue again.
    #
    # TODO: Remove the following two lines once TF doesn't depend on Bazel wrapping
    # all library archives in -whole_archive -no_whole_archive.
    build --noincompatible_remove_legacy_whole_archive
    build --features=-force_no_whole_archive
    build --host_features=-force_no_whole_archive
    
    Registered: Tue Sep 09 12:39:10 UTC 2025
    - Last Modified: Fri Aug 22 21:03:34 UTC 2025
    - 56K bytes
    - Viewed (0)
Back to top