What data structure is used to implement the dynamic memory allocation heap?

What I'm not asking:

While searching online, I've seen tons of descriptions of how to implement heaps with severe restrictions.
To name a few, I've seen lots of descriptions of how to implement:

Heaps that never release memory back to the OS (!)
Heaps that only give reasonable performance on small, similarly-sized blocks
Heaps that only give reasonable performance for large, contiguous blocks
etc.

And it's funny, they all avoid the harder question:
How are "normal", general-purpose heaps (like the one behind malloc, HeapCreate) implemented?

What data structures (and perhaps algorithms) do they use?

854

asked Dec 09 '12 05:12

user541686

1 Answers

Allocators tend to be quite complex and often differ significantly in how they're implemented.

You can't really describe them in terms of one common data structure or algorithm, but there are some common themes:

Memory is taken from the system in large chunks -- often megabytes at a time.
These chunks are then split up into various smaller chunks as you perform allocations. Not exactly the same size as you allocate, but usually in certain ranges (200-250 bytes, 251-500 bytes, etc.). Sometimes this is multi-tiered, where you'd have an additional layer of "medium chunks" which come before your actual requests.
Controlling which "large chunk" to break a piece off of is a very difficult and important thing to do -- this greatly affects memory fragmentation.
One or more free pools (aka "free list", "memory pool", "lookaside list") are maintained for each of these ranges. Sometimes even thread-local pools. This can greatly speed up a pattern of allocating/deallocating many objects of similar size.
Large allocations are treated a bit differently so as to not waste a lot of RAM and not be pooled quite so much if at all.

If you wanted to check out some source code, jemalloc is a modern high-performance allocator and should be representative in complexity of other common ones. TCMalloc is another common general-purpose allocator, and their website goes into all the gory implementation details. Intel's Thread Building Blocks has an allocator built specifically for high concurrency.

One interesting difference can be seen between Windows and *nix. In *nix, the allocator has very low-level control over the address space an app uses. In Windows, you basically have a course-grained, slow allocator VirtualAlloc to base your own allocator off of.

This results in *nix-compatible allocators typically directly giving you an malloc/free implementation where it's assumed you'll only use one allocator for everything (otherwise they'd trample each-other), while Windows-specific allocators provide additional functions, leaving malloc/free alone, and can be used in harmony (for instance, you can use HeapCreate to make private heaps which can work alongside others).

In practice, this trade in flexibility gives *nix allocators a small leg up performance-wise. It's very rare to see an app intentionally use multiple heaps on Windows -- mostly it's by accident due to different DLLs using different runtimes which each have their own malloc/free, and can cause a lot of headaches if you're not diligent in tracking which heap some memory came from.

answered Sep 20 '22 11:09

Cory Nelson

Related questions
                            
                                How to check if a key,value pair exists in a Dictionary
                            
                                Clean Code: Should Objects have public properties?
                            
                                Are there any implementations of multiset for .Net?
                            
                                Optimizing queries for the next and previous element
                            
                                Is there priority queue data structure implementation in Ruby's standard library?
                            
                                check if a tree is a binary search tree
                            
                                QuickCheck: Arbitrary instances of nested data structures that generate balanced specimens
                            
                                Time Complexity for Java ArrayList
                            
                                How to find the number of different shortest paths between two vertices, in directed graph and with linear-time?
                            
                                How do I get the n-th element in a LinkedList<T>?
                            
                                Structs as keys in Go maps
                            
                                Why is Binary Search a divide and conquer algorithm?
                            
                                Represent directory tree as JSON
                            
                                Is there a way to avoid loops when adding to a list?
                            
                                The simplest algorithm for poker hand evaluation
                            
                                Linked List vs Vector
                            
                                Difference between backtracking and recursion?
                            
                                What's the difference between () vs [] vs {}?
                            
                                creating an array of structs in c++
                            
                                Why is O(n) better than O( nlog(n) )?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What data structure is used to implement the dynamic memory allocation heap?

Tags:

memory-management

heap-memory

data-structures

What I'm not asking:

user541686

People also ask

1 Answers

Cory Nelson

Recent Activity

Donate For Us