Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

XSSFWorkbook takes a lot of time to load

I am using the following code:

File file = new File("abc.xlsx");
InputStream st = new FileInputStream(file);
XSSFWorkbook wb = new XSSFWorkbook(st);

The xlsx file itself has 25,000 rows and each row has content in 500 columns. During debugging, I saw that the third row where I create a XSSFWorkbook, it takes a lot of time (1 hour!) to complete this statement.

Is there a better way to access the values of the original xlsx file?

like image 667
London guy Avatar asked Jun 22 '12 10:06

London guy


2 Answers

First up, don't load a XSSFWorkbook from an InputStream when you have a file! Using an InputStream requires buffering of everything into memory, which eats up space and takes time. Since you don't need to do that buffering, don't!

If you're running with the latest nightly builds of POI, then it's very easy. Your code becomes:

File file = new File("C:\\D\\Data Book.xlsx");
OPCPackage opcPackage = OPCPackage.open(file);
XSSFWorkbook workbook = new XSSFWorkbook(opcPackage);

Otherwise, it's very similar:

File file = new File("C:\\D\\Data Book.xlsx");
OPCPackage opcPackage = OPCPackage.open(file.getAbsolutePath());
XSSFWorkbook workbook = new XSSFWorkbook(opcPackage);
like image 127
Gagravarr Avatar answered Nov 10 '22 09:11

Gagravarr


Consider using the Streaming version of POI. This will load a subset of the file into memory as needed. It is the recommended method when dealing with large files.

POI SXSSF

like image 1
John B Avatar answered Nov 10 '22 09:11

John B