When and how is mmap'ed memory swapped in and out?

Tags:

mmap

In my understanding, mmap'ing a file that fits into RAM will be like having the file in memory.

Say that we have 16G of RAM, and we first mmap a 10G file that we use for a while. This should be fairly efficient in terms of access. If we then mmap a second 10G file, will that cause the first one be swapped out? Or parts of it? If so, when will this happen? At the mmap call, or on accessing the memory area of the newly loaded file?

And if we want to access the memory of the pointer for the first file again, will that make it load the swap the file in again? So, say we alternate reading between memory corresponding to the first file and the second file, will that lead to disastrous performance?

Lastly, if any of this is true, would it be better to mmap several smaller files?

938

asked Apr 21 '17 11:04

Masseman

2 Answers

As has been discussed, your file will be accessed in pages; on x86_64 (and IA32) architectures, a page is typically 4096 bytes. So, very little if any of the file will be loaded at mmap time. The first time you access some page in either file, then the kernel will generate a page fault and load some of your file. The kernel may prefetch pages, so more than one page may be loaded. Whether it does this depends on your access pattern.

In general, your performance should be good if your working set fits in memory. That is, if you're only regularly accesning 3G of file across the two files, so long as you have 3G of RAM available to your process, things should generally be fine.

On a 64-bit system there's no reason to split the files, and you'll be fine if the parts you need tend to fit in RAM.

Note that if you mmap an existing file, swap space will not be required to read that file. When an object is backed by a file on the filesystem, the kernel can read from that file rather than swap space. However, if you specify MMAP_PRIVATE in your call to mmap, swap space may be required to hold changed pages until you call msync.

131

answered Sep 20 '22 09:09

Sam Hartman

Your question does not have a definitive answer, as swapping in/out is handled by your kernel, and each kernel will have a different implementation (and linux itself offers different profiles depending on your usage, RT, desktop, server…)

Generally speaking, though, whatever you load in memory is done using pages, so your mmap'ed file in memory is loaded (and offloaded) by pages between all the levels of memory (the caches, RAM and swap). Then if you load two 10GB data into memory, you'll have parts of both between the RAM and your Swap, and the kernel will try to keep in RAM the pages you're likely to use now and guess what you'll load next.

What it means is that if you do truly random access to a few bytes of data in both files alternatively, you should expect awful performance, if you access contiguous chunks sequentially from both files alternatively, you should expect decent performance.

You can read some more details about kernel paging into virtual memory theory:

https://0xax.gitbooks.io/linux-insides/content/Theory/Paging.html
https://en.wikipedia.org/wiki/Paging

answered Sep 22 '22 09:09

zmo

Related questions
                            
                                ALSA Configuration How To Combine MMAP Emulation and Ladspa Plugin in asound.conf
                            
                                Starting FOREVER or PM2 as WWW-DATA from a PHP script
                            
                                Java program is getting slower after running for a while
                            
                                Eventloop has high ksoftirqd load; nginx does not but does same system-calls. Why?
                            
                                How can I find the alpha shape (concave hull) of a 2d point cloud?
                            
                                Docker: --ipc=host and security
                            
                                Send keystrokes to non-active GUI application without occupying the keyboard
                            
                                Calling Haskell from Java with C in between
                            
                                Docker container can reach DNS but not resolve hosts
                            
                                Is there a way to improve performance of linux pipes?
                            
                                where is amd64 psABI? [duplicate]
                            
                                How to conveniently sync a file between two git repositories
                            
                                How to extract C source code from .so file?
                            
                                Creating a high-performance network server in C++
                            
                                Why Linux/gnu linker chose address 0x400000?
                            
                                Amazon linux AMI vs Ubuntu
                            
                                GUI debugger for c++ on linux [closed]
                            
                                why is "autoreconf" not used often?
                            
                                npm hangs on postinstall / unlock
                            
                                How linux identify a particular file system to execute system call

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With