Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to identify a zip file in java?

I want to identify my archive whether it is zip or rar. But the problem I get runtime error before I can validate my file. I want to create custom notification:

public class ZipValidator {
  public void validate(Path pathToFile) throws IOException {
    try {
      ZipFile zipFile = new ZipFile(pathToFile.toFile());
      String zipname = zipFile.getName();
    } catch (InvalidZipException e) {
      throw new InvalidZipException("Not a zip file");
    }
  }
}

At the moment I have runtime error:

java.util.zip.ZipException: error in opening zip file

like image 695
Jack Avatar asked Nov 26 '15 08:11

Jack


4 Answers

I'd suggest to open a plain InputStream an reading the first few bytes (magic bytes) and not to rely on the file extension as this can be easily spoofed. Also, you can omit the overhead creating and parsing the files.

For RAR the first bytes should be 52 61 72 21 1A 07.

For ZIP it should be one of:

  • 50 4B 03 04
  • 50 4B 05 06 (empty archive)
  • 50 4B 07 08 (spanned archive).

Source: https://en.wikipedia.org/wiki/List_of_file_signatures

Another point, just looked at your code:

Why do you catch die InvalidZipException, throw it away and construct a new one? This way you lose all the information from the original exception, making it hard to debug and understand what exactly went wrong. Either don't catch it at all or, if you have to wrap it, do it right:

} catch (InvalidZipException e) {
  throw new InvalidZipException("Not a zip file", e);
}
like image 60
bratkartoffel Avatar answered Nov 11 '22 00:11

bratkartoffel


Merging the answers of nanda & bratkartoffel.

private static boolean isArchive(File f) {
    int fileSignature = 0;
    try (RandomAccessFile raf = new RandomAccessFile(f, "r")) {
        fileSignature = raf.readInt();
    } catch (IOException e) {
        // handle if you like
    }
    return fileSignature == 0x504B0304 || fileSignature == 0x504B0506 || fileSignature == 0x504B0708;
}
like image 26
Fabian Braun Avatar answered Nov 11 '22 00:11

Fabian Braun


RandomAccessFile raf = new RandomAccessFile(f, "r");

long n = raf.readInt();

raf.close();

if (n == 0x504B0304)

    System.out.println("Should be a zip file");

else

    System.out.println("Not a zip file");

You can see it in the following link. http://www.coderanch.com/t/381509/java/java/check-file-zip-file-java

like image 4
nanda Avatar answered Nov 10 '22 22:11

nanda


Exception is thrown in line

ZipFile zipFile = new ZipFile(pathToFile.toFile());

That's because if a non-ZipFile is given as parameter for the ZipFileconstructor the ZipException is thrown. So you have to check before generating a new ZipFile Object if your file path points to a correct ZipFile. One solution might be to check the extension of the file path like so

 PathMatcher matcher = FileSystems.getDefault().getPathMatcher("glob:*.zip");
 boolean extensionCorrect = matcher.matches(path); 
like image 2
hatze Avatar answered Nov 10 '22 23:11

hatze