Typical implementations of <code>malloc</code> use <code>brk</code>/<code>sbrk</code> as the primary means of claiming memory from the OS. However, they also use <code>mmap</code> to get chunks for large allocations. Is there a real benefit to using <code>brk</code> instead of <code>mmap</code>, or is it just tradition? Wouldn't it work just as well to do it all with <code>mmap</code>? (Note: I use <code>sbrk</code> and <code>brk</code> interchangeably here because they are interfaces to the same Linux system call, <code>brk</code>.) <hr> For reference, here are a couple of documents describing the glibc <code>malloc</code>: GNU C Library Reference Manual: The GNU Allocator https://www.gnu.org/software/libc/manual/html_node/The-GNU-Allocator.html glibc wiki: Overview of Malloc https://sourceware.org/glibc/wiki/MallocInternals What these documents describe is that <code>sbrk</code> is used to claim a primary arena for small allocations, <code>mmap</code> is used to claim secondary arenas, and <code>mmap</code> is also used to claim space for large objects ("much larger than a page"). The use of both the application heap (claimed with <code>sbrk</code>) and <code>mmap</code> introduces some additional complexity that might be unnecessary: <blockquote> Allocated Arena - the main arena uses the application's heap. Other arenas use <code>mmap</code>'d heaps. To map a chunk to a heap, you need to know which case applies. If this bit is 0, the chunk comes from the main arena and the main heap. If this bit is 1, the chunk comes from <code>mmap</code>'d memory and the location of the heap can be computed from the chunk's address. </blockquote> [Glibc malloc is derived from <code>ptmalloc</code>, which was derived from dlmalloc, which was started in 1987.] <hr> The jemalloc manpage (http://jemalloc.net/jemalloc.3.html) has this to say: <blockquote> Traditionally, allocators have used <code>sbrk(2)</code> to obtain memory, which is suboptimal for several reasons, including race conditions, increased fragmentation, and artificial limitations on maximum usable memory. If <code>sbrk(2)</code> is supported by the operating system, this allocator uses both <code>mmap(2)</code> and sbrk(2), in that order of preference; otherwise only <code>mmap(2)</code> is used. </blockquote> So, they even say here that <code>sbrk</code> is suboptimal but they use it anyway, even though they've already gone to the trouble of writing their code so that it works without it. [Writing of jemalloc started in 2005.] UPDATE: Thinking about this more, that bit about "in order of preference" gives me a line on inquiry. Why the order of preference? Are they just using <code>sbrk</code> as a fallback in case <code>mmap</code> is not supported (or lacks necessary features), or is it possible for the process to get into some state where it can use <code>sbrk</code> but not <code>mmap</code>? I'll look at their code and see if I can figure out what it's doing. <hr> I'm asking because I'm implementing a garbage collection system in C, and so far I see no reason to use anything besides <code>mmap</code>. I'm wondering if there's something I'm missing, though. (In my case I have an additional reason to avoid <code>brk</code>, which is that I might need to use <code>malloc</code> at some point.)

<code>mmap()</code> didn't exist in the early versions of Unix. <code>brk()</code> was the only way to increase the size of the data segment of the process at that time. The first version of Unix with <code>mmap()</code> was <code>SunOS</code> in the mid 80's, the first open-source version was BSD-Reno in 1990. And to be usable for <code>malloc()</code> you don't want to require a real file to back up the memory. In 1988 SunOS implemented <code>/dev/zero</code> for this purpose, and in the 1990's HP-UX implemented the <code>MAP_ANONYMOUS</code> flag. There are now versions of <code>mmap()</code> that offer a variety of methods to allocate the heap.

In malloc, why use brk at all? Why not just use mmap?

Tags:

Typical implementations of malloc use brk/sbrk as the primary means of claiming memory from the OS. However, they also use mmap to get chunks for large allocations. Is there a real benefit to using brk instead of mmap, or is it just tradition? Wouldn't it work just as well to do it all with mmap?

(Note: I use sbrk and brk interchangeably here because they are interfaces to the same Linux system call, brk.)

For reference, here are a couple of documents describing the glibc malloc:

GNU C Library Reference Manual: The GNU Allocator
https://www.gnu.org/software/libc/manual/html_node/The-GNU-Allocator.html

glibc wiki: Overview of Malloc
https://sourceware.org/glibc/wiki/MallocInternals

What these documents describe is that sbrk is used to claim a primary arena for small allocations, mmap is used to claim secondary arenas, and mmap is also used to claim space for large objects ("much larger than a page").

The use of both the application heap (claimed with sbrk) and mmap introduces some additional complexity that might be unnecessary:

Allocated Arena - the main arena uses the application's heap. Other arenas use mmap'd heaps. To map a chunk to a heap, you need to know which case applies. If this bit is 0, the chunk comes from the main arena and the main heap. If this bit is 1, the chunk comes from mmap'd memory and the location of the heap can be computed from the chunk's address.

[Glibc malloc is derived from ptmalloc, which was derived from dlmalloc, which was started in 1987.]

The jemalloc manpage (http://jemalloc.net/jemalloc.3.html) has this to say:

Traditionally, allocators have used sbrk(2) to obtain memory, which is suboptimal for several reasons, including race conditions, increased fragmentation, and artificial limitations on maximum usable memory. If sbrk(2) is supported by the operating system, this allocator uses both mmap(2) and sbrk(2), in that order of preference; otherwise only mmap(2) is used.

So, they even say here that sbrk is suboptimal but they use it anyway, even though they've already gone to the trouble of writing their code so that it works without it.

[Writing of jemalloc started in 2005.]

UPDATE: Thinking about this more, that bit about "in order of preference" gives me a line on inquiry. Why the order of preference? Are they just using sbrk as a fallback in case mmap is not supported (or lacks necessary features), or is it possible for the process to get into some state where it can use sbrk but not mmap? I'll look at their code and see if I can figure out what it's doing.

I'm asking because I'm implementing a garbage collection system in C, and so far I see no reason to use anything besides mmap. I'm wondering if there's something I'm missing, though.

(In my case I have an additional reason to avoid brk, which is that I might need to use malloc at some point.)

296

asked Apr 19 '19 22:04

Nate C-K

2 Answers

The system call brk() has the advantage of having only a single data item to track memory use, which happily is also directly related to the total size of the heap.

This has been in the exact same form since 1975's Unix V6. Mind you, V6 supported a user address space of 65,535 bytes. So there wasn't a lot of thought given for managing much more than 64K, certainly not terabytes.

Using mmap seems reasonable until I start wondering how altered or added-on garbage collection could use mmap but without rewriting the allocation algorithm too.

Will that work nicely with realloc(), fork(), etc.?

168

answered Oct 04 '22 11:10

wallyk

mmap() didn't exist in the early versions of Unix. brk() was the only way to increase the size of the data segment of the process at that time. The first version of Unix with mmap() was SunOS in the mid 80's, the first open-source version was BSD-Reno in 1990.

And to be usable for malloc() you don't want to require a real file to back up the memory. In 1988 SunOS implemented /dev/zero for this purpose, and in the 1990's HP-UX implemented the MAP_ANONYMOUS flag.

There are now versions of mmap() that offer a variety of methods to allocate the heap.

answered Oct 04 '22 09:10

Barmar

Related questions
                            
                                Explicit direct #include vs. Non-contractual transitive #include
                            
                                Tensorflow GradientTape "Gradients does not exist for variables" intermittently
                            
                                why there is a White space on the top on html2canvas?
                            
                                How to fix: The feature watch recursively is unavailable on the current platform, which is being used to run Node.js
                            
                                How to write a minimally working pyproject.toml file that can install packages?
                            
                                How do I use SANs with openSSL instead of common name?
                            
                                Firebase phone authentication is not working on Android real device
                            
                                How to add SCSS styles to a React project?
                            
                                Generating a report by date range in rails
                            
                                VB.NET - Iterating through controls in a container object
                            
                                Are there benefits to a case sensitive database?
                            
                                .NET DataTable skips rows on Load(DataReader)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With