Performance / stability of a Memory Mapped file - Native or MappedByteBuffer - vs. plain ol' FileOutputStream

Tags:

I support a legacy Java application that uses flat files (plain text) for persistence. Due to the nature of the application, the size of these files can reach 100s MB per day, and often the limiting factor in application performance is file IO. Currently, the application uses a plain ol' java.io.FileOutputStream to write data to disk.

Recently, we've had several developers assert that using memory-mapped files, implemented in native code (C/C++) and accessed via JNI, would provide greater performance. However, FileOutputStream already uses native methods for its core methods (i.e. write(byte[])), so it appears a tenuous assumption without hard data or at least anecdotal evidence.

I have several questions on this:

Is this assertion really true? Will memory mapped files always provide faster IO compared to Java's FileOutputStream?
Does the class MappedByteBuffer accessed from a FileChannel provide the same functionality as a native memory mapped file library accessed via JNI? What is MappedByteBuffer lacking that might lead you to use a JNI solution?
What are the risks of using memory-mapped files for disk IO in a production application? That is, applications that have continuous uptime with minimal reboots (once a month, max). Real-life anecdotes from production applications (Java or otherwise) preferred.

Question #3 is important - I could answer this question myself partially by writing a "toy" application that perf tests IO using the various options described above, but by posting to SO I'm hoping for real-world anecdotes / data to chew on.

[EDIT] Clarification - each day of operation, the application creates multiple files that range in size from 100MB to 1 gig. In total, the application might be writing out multiple gigs of data per day.

293

asked Feb 11 '09 15:02

noahlz

2 Answers

Memory mapped I/O will not make your disks run faster(!). For linear access it seems a bit pointless.

A NIO mapped buffer is the real thing (usual caveat about any reasonable implementation).

As with other NIO direct allocated buffers, the buffers are not normal memory and wont get GCed as efficiently. If you create many of them you may find that you run out of memory/address space without running out of Java heap. This is obviously a worry with long running processes.

114

answered Oct 17 '22 05:10

Tom Hawtin - tackline

You might be able to speed things up a bit by examining how your data is being buffered during writes. This tends to be application specific as you would need an idea of the expected data writing patterns. If data consistency is important, there will be tradeoffs here.

If you are just writing out new data to disk from your application, memory mapped I/O probably won't help much. I don't see any reason you would want to invest time in some custom coded native solution. It just seems like too much complexity for your application, from what you have provided so far.

If you are sure you really need better I/O performance - or just O performance in your case, I would look into a hardware solution such as a tuned disk array. Throwing more hardware at the problem is often times more cost effective from a business point of view than spending time optimizing software. It is also usually quicker to implement and more reliable.

In general, there are a lot of pitfalls in over optimization of software. You will introduce new types of problems to your application. You might run into memory issues/ GC thrashing which would lead to more maintenance/tuning. The worst part is that many of these issues will be hard to test before going into production.

If it were my app, I would probably stick with the FileOutputStream with some possibly tuned buffering. After that I'd use the time honored solution of throwing more hardware at it.

answered Oct 17 '22 05:10

Gary

Related questions
                            
                                @NamedNativeQuery with @SqlResultSetMapping for non-entity
                            
                                Android Studio 3.1.3 Gradle Sync Error. Could Not Download Gradle-Core.jar
                            
                                Could not delete path
                            
                                Enum of directions with opposites [duplicate]
                            
                                When to use Optional.orElse() rather than Optional.orElseGet() [duplicate]
                            
                                Spring boot: 404 error when calling JSP using controller
                            
                                What is the difference in properties java.runtime.version and java.version
                            
                                What does the jlink option compress do?
                            
                                How to automate shadow DOM elements using selenium?
                            
                                VarHandle get/setOpaque
                            
                                Spring Boot controller not responding to POST request
                            
                                Is there a way to check if a Stream contains all collection elements?
                            
                                openapi springboot generator jackson no String-argument constructor/factory method to deserialize from String value
                            
                                Record cannot get parameter names from constructors?
                            
                                Calling .NET assembly from Java: JVM crashes
                            
                                What is the java signal dispatcher thread?
                            
                                How to output a String on multiple lines using Graphics
                            
                                Benefits of using JSTL vs Velocity for view layer in MVC app?
                            
                                How to make full screen java applets?
                            
                                How can I access spreadsheets in the open document format (.ods) with java?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Performance / stability of a Memory Mapped file - Native or MappedByteBuffer - vs. plain ol' FileOutputStream

Tags:

java

performance

file-io

java-native-interface

production

noahlz

People also ask

2 Answers

Tom Hawtin - tackline

Gary

Recent Activity

Donate For Us