Search Options

Results per page
Sort
Preferred Languages
Advance

Results 1 - 10 of 42 for Archives (0.48 sec)

  1. src/main/java/jcifs/util/SecureKeyManager.java

                });
    
                // Keep only the specified number of versions
                for (int i = keepVersions; i < archives.size(); i++) {
                    removeSessionKey(archives.get(i));
                    removed++;
                }
            }
    
            if (removed > 0) {
                log.info("Cleaned up {} archived key versions", removed);
            }
        }
    
        private int extractVersion(String archivedKey) {
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Sat Aug 30 05:58:03 UTC 2025
    - 21.5K bytes
    - Viewed (0)
  2. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/EXTRACTOR_TESTS_README.md

    - ✅ Whitespace and empty string handling
    
    ---
    
    ### 3. ArchiveExtractorErrorHandlingTest.java
    **Purpose**: Test improved error handling in archive extractors.
    
    **Key Test Areas**:
    - Enhanced error messages
    - Partial extraction (continues on failure)
    - Invalid archive handling
    - Empty archive handling
    - Mixed valid/invalid entries
    
    **Covered Extractors**:
    - ZipExtractor
    - TarExtractor
    
    **Test Count**: 12 tests
    
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Wed Nov 19 08:55:01 UTC 2025
    - 5.7K bytes
    - Viewed (0)
  3. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/ZipExtractor.java

    import java.io.BufferedInputStream;
    import java.io.InputStream;
    import java.util.HashMap;
    import java.util.Map;
    
    import org.apache.commons.compress.archivers.ArchiveInputStream;
    import org.apache.commons.compress.archivers.ArchiveStreamFactory;
    import org.apache.commons.compress.archivers.zip.ZipArchiveEntry;
    import org.apache.logging.log4j.LogManager;
    import org.apache.logging.log4j.Logger;
    import org.codelibs.fess.crawler.entity.ExtractData;
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Dec 11 08:38:29 UTC 2025
    - 4.8K bytes
    - Viewed (0)
  4. fess-crawler/src/test/java/org/codelibs/fess/crawler/extractor/impl/ArchiveExtractorErrorHandlingTest.java

                assertTrue("Error message should indicate failure",
                        e.getMessage().contains("Failed to extract") || e.getMessage().contains("No entries could be processed"));
            }
        }
    
        /**
         * Test that TarExtractor handles invalid archive gracefully.
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Mon Nov 24 03:59:47 UTC 2025
    - 12.6K bytes
    - Viewed (0)
  5. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/TarExtractor.java

    import java.io.InputStream;
    import java.util.HashMap;
    import java.util.Map;
    
    import org.apache.commons.compress.archivers.ArchiveInputStream;
    import org.apache.commons.compress.archivers.ArchiveStreamFactory;
    import org.apache.commons.compress.archivers.tar.TarArchiveEntry;
    import org.apache.logging.log4j.LogManager;
    import org.apache.logging.log4j.Logger;
    import org.codelibs.fess.crawler.entity.ExtractData;
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Thu Dec 11 08:38:29 UTC 2025
    - 5.1K bytes
    - Viewed (0)
  6. fess-crawler/src/main/java/org/codelibs/fess/crawler/extractor/impl/LhaExtractor.java

    import jp.gr.java_conf.dangan.util.lha.LhaFile;
    import jp.gr.java_conf.dangan.util.lha.LhaHeader;
    
    /**
     * Extractor implementation for LHA (LZH) archive files.
     * This extractor can extract text content from files within LHA archives
     * by using appropriate extractors for each contained file type.
     *
     * @author shinsuke
     */
    public class LhaExtractor extends AbstractExtractor {
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Nov 23 12:19:14 UTC 2025
    - 5.9K bytes
    - Viewed (0)
  7. README.md

    #### PDFs and Images
    - PDF documents (text and metadata extraction)
    - Images (JPEG, PNG, GIF, TIFF, BMP)
    - Image metadata (EXIF, IPTC, XMP)
    
    #### Archives and Compressed Files
    - ZIP, TAR, GZ archives
    - LHA compression format
    - Nested archive extraction
    
    #### Web and Markup
    - HTML, XHTML with XPath support
    - XML documents
    - JSON and structured data
    
    #### Media Files
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Sun Aug 31 05:32:52 UTC 2025
    - 15.3K bytes
    - Viewed (0)
  8. src/main/java/jcifs/smb1/util/mime.map

    application/hep                hep              # Hummingbird Host Explorer Profiles
    application/lotus-123          wks              # Lotus 123
    application/mac-binhex40       hqx              # Macintosh binhexed archives
    application/mspowerpoint       ppt              # Microsoft Powerpoint
    application/msword             doc              # Microsoft Word
    application/octet-stream       bin exe ani      # Binary File
    Registered: Sat Dec 20 13:44:44 UTC 2025
    - Last Modified: Fri Mar 22 20:39:42 UTC 2019
    - 5.9K bytes
    - Viewed (0)
  9. CLAUDE.md

    - **File**: Local/network file systems
    - **FTP**: With authentication
    - **SMB/CIFS**: Windows shares (SMB1/SMB2+)
    - **Storage**: MinIO/S3-compatible
    
    ### Content Formats
    
    Office (Word, Excel, PowerPoint), PDF, Archives (ZIP, TAR, GZ), HTML, XML, JSON, Media (audio/video metadata), Images (EXIF/IPTC/XMP)
    
    ---
    
    ## Architecture
    
    ### Module Structure
    
    ```
    fess-crawler-parent/
    ├── fess-crawler/              # Core framework
    Registered: Sat Dec 20 11:21:39 UTC 2025
    - Last Modified: Fri Nov 28 17:31:34 UTC 2025
    - 10.7K bytes
    - Viewed (0)
  10. LICENSE

          file or class name and description of purpose be included on the
          same "printed page" as the copyright notice for easier
          identification within third-party archives.
    
       Copyright [yyyy] [name of copyright owner]
    
       Licensed under the Apache License, Version 2.0 (the "License");
       you may not use this file except in compliance with the License.
    Registered: Sat Dec 20 08:55:33 UTC 2025
    - Last Modified: Fri Apr 18 13:54:00 UTC 2025
    - 11.1K bytes
    - Viewed (0)
Back to top