How does malloc work in a multithreaded environment?

Tags:

Does the typical malloc (for x86-64 platform and Linux OS) naively lock a mutex at the beginning and release it when done, or does it lock a mutex in a more clever way at a finer level, so that lock contention is reduced? If it indeed does it the second way, how does it do it?

510

asked May 22 '12 16:05

pythonic

2 Answers

glibc 2.15 operates multiple allocation arenas. Each arena has its own lock. When a thread needs to allocate memory, malloc() picks an arena, locks it, and allocates memory from it.

The mechanism for choosing an arena is somewhat elaborate and is aimed at reducing lock contention:

/* arena_get() acquires an arena and locks the corresponding mutex.    First, try the one last locked successfully by this thread.  (This    is the common case and handled with a macro for speed.)  Then, loop    once over the circularly linked list of arenas.  If no arena is    readily available, create a new one.  In this latter case, `size'    is just a hint as to how much memory will be required immediately    in the new arena. */

With this in mind, malloc() basically looks like this (edited for brevity):

  mstate ar_ptr;   void *victim;    arena_lookup(ar_ptr);   arena_lock(ar_ptr, bytes);   if(!ar_ptr)     return 0;   victim = _int_malloc(ar_ptr, bytes);   if(!victim) {     /* Maybe the failure is due to running out of mmapped areas. */     if(ar_ptr != &main_arena) {       (void)mutex_unlock(&ar_ptr->mutex);       ar_ptr = &main_arena;       (void)mutex_lock(&ar_ptr->mutex);       victim = _int_malloc(ar_ptr, bytes);       (void)mutex_unlock(&ar_ptr->mutex);     } else {       /* ... or sbrk() has failed and there is still a chance to mmap() */       ar_ptr = arena_get2(ar_ptr->next ? ar_ptr : 0, bytes);       (void)mutex_unlock(&main_arena.mutex);       if(ar_ptr) {         victim = _int_malloc(ar_ptr, bytes);         (void)mutex_unlock(&ar_ptr->mutex);       }     }   } else     (void)mutex_unlock(&ar_ptr->mutex);    return victim;

This allocator is called ptmalloc. It is based on earlier work by Doug Lea, and is maintained by Wolfram Gloger.

102

answered Sep 21 '22 11:09

NPE

Doug Lea's malloc used coarse locking (or no locking, depending on the configuration settings), where every call to malloc/realloc/free is protected by a global mutex. This is safe but can be inefficient in highly multithreaded environments.

ptmalloc3, which is the default malloc implementation in the GNU C library (libc) used on most Linux systems these days, has a more fine-grained strategy, as described in aix's answer, which allows multiple threads to concurrently allocate memory safely.

nedmalloc is another independent implementation which claims even better multithreaded performance than ptmalloc3 and various other allocators. I don't know how it works, and there doesn't seem to be any obvious documentation, so you'll have to check the source code to see how it works.

answered Sep 19 '22 11:09

Adam Rosenfield

Related questions
                            
                                What is the difference between far pointers and near pointers?
                            
                                C: using clock() to measure time in multi-threaded programs
                            
                                Difference between static in C and static in C++??
                            
                                How clear gdb command screen?
                            
                                Cumulative Normal Distribution Function in C/C++
                            
                                C: How do you declare a recursive mutex with POSIX threads?
                            
                                Printing leading zeroes for hexadecimal in C
                            
                                How define an array of function pointers in C
                            
                                undefined reference to curl_global_init, curl_easy_init and other function(C)
                            
                                How do I convert a Python list into a C array by using ctypes?
                            
                                Determining to which function a pointer is pointing in C?
                            
                                Zero an array in C code [duplicate]
                            
                                How to add two numbers without using ++ or + or another arithmetic operator
                            
                                Can XOR of two integers go out of bounds?
                            
                                Java - C-Like Fork?
                            
                                Writing a "real" interactive terminal program like vim, htop, ... in C/C++ without ncurses
                            
                                Anonymous functions using GCC statement expressions
                            
                                How do I print a #defined constant in GDB?
                            
                                Declaring and initializing arrays in C
                            
                                Why memory functions such as memset, memchr... are in string.h, but not in stdlib.h with another mem functions?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does malloc work in a multithreaded environment?

Tags:

c

linux

malloc

gcc

x86-64

pythonic

People also ask

2 Answers

NPE

Adam Rosenfield

Recent Activity

Donate For Us