POSIX environments provide at least two ways of accessing files. There's the standard system calls <code>open()</code>, <code>read()</code>, <code>write()</code>, and friends, but there's also the option of using <code>mmap()</code> to map the file into virtual memory. When is it preferable to use one over the other? What're their individual advantages that merit including two interfaces?

<code>mmap</code> is great if you have multiple processes accessing data in a read only fashion from the same file, which is common in the kind of server systems I write. <code>mmap</code> allows all those processes to share the same physical memory pages, saving a lot of memory. <code>mmap</code> also allows the operating system to optimize paging operations. For example, consider two programs; program <code>A</code> which reads in a <code>1MB</code> file into a buffer creating with <code>malloc</code>, and program B which <code>mmaps</code> the 1MB file into memory. If the operating system has to swap part of <code>A</code>'s memory out, it must write the contents of the buffer to swap before it can reuse the memory. In <code>B</code>'s case any unmodified <code>mmap</code>'d pages can be reused immediately because the OS knows how to restore them from the existing file they were <code>mmap</code>'d from. (The OS can detect which pages are unmodified by initially marking writable <code>mmap</code>'d pages as read only and catching seg faults, similar to Copy on Write strategy). <code>mmap</code> is also useful for inter process communication. You can <code>mmap</code> a file as read / write in the processes that need to communicate and then use synchronization primitives in the <code>mmap'd</code> region (this is what the <code>MAP_HASSEMAPHORE</code> flag is for). One place <code>mmap</code> can be awkward is if you need to work with very large files on a 32 bit machine. This is because <code>mmap</code> has to find a contiguous block of addresses in your process's address space that is large enough to fit the entire range of the file being mapped. This can become a problem if your address space becomes fragmented, where you might have 2 GB of address space free, but no individual range of it can fit a 1 GB file mapping. In this case you may have to map the file in smaller chunks than you would like to make it fit. Another potential awkwardness with <code>mmap</code> as a replacement for read / write is that you have to start your mapping on offsets of the page size. If you just want to get some data at offset <code>X</code> you will need to fixup that offset so it's compatible with <code>mmap</code>. And finally, read / write are the only way you can work with some types of files. <code>mmap</code> can't be used on things like pipes and ttys.

When should I use mmap for file access?

Tags:

c

posix

file-io

mmap

POSIX environments provide at least two ways of accessing files. There's the standard system calls open(), read(), write(), and friends, but there's also the option of using mmap() to map the file into virtual memory.

When is it preferable to use one over the other? What're their individual advantages that merit including two interfaces?

495

asked Nov 03 '08 07:11

Peter Burns

2 Answers

mmap is great if you have multiple processes accessing data in a read only fashion from the same file, which is common in the kind of server systems I write. mmap allows all those processes to share the same physical memory pages, saving a lot of memory.

mmap also allows the operating system to optimize paging operations. For example, consider two programs; program A which reads in a 1MB file into a buffer creating with malloc, and program B which mmaps the 1MB file into memory. If the operating system has to swap part of A's memory out, it must write the contents of the buffer to swap before it can reuse the memory. In B's case any unmodified mmap'd pages can be reused immediately because the OS knows how to restore them from the existing file they were mmap'd from. (The OS can detect which pages are unmodified by initially marking writable mmap'd pages as read only and catching seg faults, similar to Copy on Write strategy).

mmap is also useful for inter process communication. You can mmap a file as read / write in the processes that need to communicate and then use synchronization primitives in the mmap'd region (this is what the MAP_HASSEMAPHORE flag is for).

One place mmap can be awkward is if you need to work with very large files on a 32 bit machine. This is because mmap has to find a contiguous block of addresses in your process's address space that is large enough to fit the entire range of the file being mapped. This can become a problem if your address space becomes fragmented, where you might have 2 GB of address space free, but no individual range of it can fit a 1 GB file mapping. In this case you may have to map the file in smaller chunks than you would like to make it fit.

Another potential awkwardness with mmap as a replacement for read / write is that you have to start your mapping on offsets of the page size. If you just want to get some data at offset X you will need to fixup that offset so it's compatible with mmap.

And finally, read / write are the only way you can work with some types of files. mmap can't be used on things like pipes and ttys.

176

answered Nov 08 '22 10:11

Don Neufeld

One area where I found mmap() to not be an advantage was when reading small files (under 16K). The overhead of page faulting to read the whole file was very high compared with just doing a single read() system call. This is because the kernel can sometimes satisify a read entirely in your time slice, meaning your code doesn't switch away. With a page fault, it seemed more likely that another program would be scheduled, making the file operation have a higher latency.

answered Nov 08 '22 10:11

Ben Combee

Related questions
                            
                                Purpose of Unions in C and C++
                            
                                Wrapping a C library in Python: C, Cython or ctypes?
                            
                                What's the rationale for null terminated strings?
                            
                                Arrow operator (->) usage in C
                            
                                Removing trailing newline character from fgets() input
                            
                                Why do you have to link the math library in C?
                            
                                Why does glibc's strlen need to be so complicated to run quickly?
                            
                                "register" keyword in C?
                            
                                How do malloc() and free() work?
                            
                                What is a bus error? Is it different from a segmentation fault?
                            
                                How to convert a string to integer in C?
                            
                                Why does NaN - NaN == 0.0 with the Intel C++ Compiler?
                            
                                How do I use valgrind to find memory leaks?
                            
                                Why does the arrow (->) operator in C exist?
                            
                                How do I create an array of strings in C?
                            
                                Strange definitions of TRUE and FALSE macros
                            
                                Stack smashing detected
                            
                                Why does rand() + rand() produce negative numbers?
                            
                                What does a type followed by _t (underscore-t) represent?
                            
                                How to convert an int to string in C?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With