Why is using BufferedInputStream to read a file byte by byte faster than using FileInputStream?

Tags:

I was trying to read a file into an array by using FileInputStream, and an ~800KB file took about 3 seconds to read into memory. I then tried the same code except with the FileInputStream wrapped into a BufferedInputStream and it took about 76 milliseconds. Why is reading a file byte by byte done so much faster with a BufferedInputStream even though I'm still reading it byte by byte? Here's the code (the rest of the code is entirely irrelevant). Note that this is the "fast" code. You can just remove the BufferedInputStream if you want the "slow" code:

InputStream is = null;      try {         is = new BufferedInputStream(new FileInputStream(file));          int[] fileArr = new int[(int) file.length()];          for (int i = 0, temp = 0; (temp = is.read()) != -1; i++) {             fileArr[i] = temp;         }

BufferedInputStream is over 30 times faster. Far more than that. So, why is this, and is it possible to make this code more efficient (without using any external libraries)?

613

asked Sep 03 '13 19:09

ZimZim

2 Answers

In FileInputStream, the method read() reads a single byte. From the source code:

/**  * Reads a byte of data from this input stream. This method blocks  * if no input is yet available.  *  * @return     the next byte of data, or <code>-1</code> if the end of the  *             file is reached.  * @exception  IOException  if an I/O error occurs.  */ public native int read() throws IOException;

This is a native call to the OS which uses the disk to read the single byte. This is a heavy operation.

With a BufferedInputStream, the method delegates to an overloaded read() method that reads 8192 amount of bytes and buffers them until they are needed. It still returns only the single byte (but keeps the others in reserve). This way the BufferedInputStream makes less native calls to the OS to read from the file.

For example, your file is 32768 bytes long. To get all the bytes in memory with a FileInputStream, you will require 32768 native calls to the OS. With a BufferedInputStream, you will only require 4, regardless of the number of read() calls you will do (still 32768).

As to how to make it faster, you might want to consider Java 7's NIO FileChannel class, but I have no evidence to support this.

Note: if you used FileInputStream's read(byte[], int, int) method directly instead, with a byte[>8192] you wouldn't need a BufferedInputStream wrapping it.

answered Sep 28 '22 06:09

Sotirios Delimanolis

A BufferedInputStream wrapped around a FileInputStream, will request data from the FileInputStream in big chunks (512 bytes or so by default, I think.) Thus if you read 1000 characters one at a time, the FileInputStream will only have to go to the disk twice. This will be much faster!

answered Sep 28 '22 06:09

usha

Related questions
                            
                                Java 8 pass method as parameter
                            
                                Does C# have a way of giving me an immutable Dictionary?
                            
                                File upload along with other object in Jersey restful web service
                            
                                Is String Literal Pool a collection of references to the String Object, Or a collection of Objects
                            
                                How to remove a task from ScheduledExecutorService?
                            
                                How to declare a constant in Java?
                            
                                Difference between Intent.ACTION_GET_CONTENT and Intent.ACTION_PICK
                            
                                Lambda this reference in java
                            
                                How is values() implemented for Java 6 enums?
                            
                                How to convert Joda-Time DateTime to java.util.Date and vice versa?
                            
                                In Java, can & be faster than &&?
                            
                                Strange array return type
                            
                                What's the point of Guava checkNotNull
                            
                                Replacing if else statement with pattern
                            
                                Where is the application.properties file in a Spring Boot project?
                            
                                Java 8: How to create a ZonedDateTime from an Epoch value?
                            
                                long timestamp to LocalDateTime
                            
                                Gson and deserializing an array of objects with arrays in it
                            
                                Difference between RxJava API and the Java 9 Flow API
                            
                                Should you check if the map containsKey before using ConcurrentMap's putIfAbsent

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is using BufferedInputStream to read a file byte by byte faster than using FileInputStream?

Tags:

java

inputstream

file-io

fileinputstream

ZimZim

People also ask

2 Answers

Sotirios Delimanolis

usha

Recent Activity

Donate For Us