Previous header: "Must I replace global operators new and delete to change memory allocation strategy in third party code?" Short story: We need to replace memory allocation technique in third-party library without changing its source code. Long story: Consider memory-bound application that makes huge dynamic allocations (perhaps, almost all available system memory). We use specialized allocators, and use them everywhere (<code>shared_ptr</code>'s, containers etc.). We have total control and power over every single byte of memory allocated in our application. Also, we need to link against a third-party helper library. That nasty guy makes allocations in some standard way, using default operators <code>new</code>, <code>new[]</code>, <code>delete</code> and <code>delete[]</code> or <code>malloc</code> or something else non-standard (let's generalize and say that we don't know how this library manages it's heap allocation). If this helper library makes allocation that are big enough we can get HDD thrashing, memory fragmentation and alignments issues, out-of-memory <code>bad_alloc</code>s and all sorts of problems. We can not (or do not want) to change library source code. First attempt: We never had such unholy "hacks" in release builds before. First test with overriding operator <code>new</code> works fine, except that: <ul> <li>we do not know what gotchas wait us in the future (and this is awful)</li> <li>our users (and even our allocators) now have to allocate same way that we do</li> </ul> Questions: <ol> <li>Are there ways to hook these allocations without overloading global operators? (local lib-only hooks?)</li> <li>...and if we don't know what exactly it uses: <code>malloc</code> or <code>new</code>?</li> <li> Is this list of signatures complete? (and there are no other things that we must implement): <pre class="prettyprint"><code>void* operator new (std::size_t size) throw (std::bad_alloc); void* operator new (std::size_t size, const std::nothrow_t& nothrow_value) throw(); void* operator new (std::size_t size, void* ptr) throw(); void* operator new[] (std::size_t size) throw (std::bad_alloc); void* operator new[] (std::size_t size, const std::nothrow_t& nothrow_value) throw(); void* operator new[] (std::size_t size, void* ptr) throw(); void operator delete (void* ptr) throw(); void operator delete (void* ptr, const std::nothrow_t& nothrow_constant) throw(); void operator delete (void* ptr, void* voidptr2) throw(); void operator delete[] (void* ptr) throw(); void operator delete[] (void* ptr, const std::nothrow_t& nothrow_constant) throw(); void operator delete[] (void* ptr, void* voidptr2) throw(); </code></pre> </li> <li>Something different if that library is dynamic?</li> </ol> Edit #1 Cross-platform solution is preferable if possible (looks like not very possible). If not, our major platforms: <ul> <li>Windows x86/x64 (msvc 10)</li> <li>Linux x86/x64 (gcc 4.6)</li> </ul> Edit #2 Almost 2 years have passed, few OS and compiler versions have evolved, so I am curious if there is something new and unexplored in this area? Any standard proposals? OS-specifics? Hacks? How do you write memory-thirsty applications today? Please share your experience.

Ugh, my sympathy. This is going to depend a lot on your compiler, your libc, etc. Some rubber-meets-road strategies that have "worked" to varying degrees for us in the past (/me braces for downvotes) are: <ul> <li>The <code>operator new</code> / <code>operator delete</code> overloads you suggested -- although note that some compilers are picky about not having <code>throw()</code> specs, some really want them, some want them for new but not for delete, etc (I have a giant platform-specific <code>#if</code>/<code>#elif</code> block for all of the 4+ platforms we're working on now).</li> <li>Also worth noting: you can generally ignore the placement versions, they don't allocate.</li> <li>Look at <code>__malloc_hook</code> and friends -- note that these are deprecated and have thread race conditions -- but they're nice in that new/delete tend to be implemented in terms of <code>malloc</code> (but not always).</li> <li>Providing a replacement <code>malloc</code>, <code>calloc</code>, <code>realloc</code>, and <code>free</code> and getting your linker args in the right order so that the overrides take place (this is what gcc recommends these days, although I've had situations where it was impossible to do, and I had to use deprecated <code>__malloc_hook</code>) -- again, <code>new</code> and <code>delete</code> tend to be implemented in terms of these, but not always.</li> <li>Avoiding all the standard allocation methods (<code>operator new</code>, <code>malloc</code>, etc) in "our code" and using custom functions instead -- not very easy with existing codebase.</li> <li>Tracking down the library author and delivering a <strike>savage beating</strike> polite request or patch to change their library to allow you to specify a different allocator (it may be faster than doing this yourself) -- I think this has lead to a cardinal rule of "client always specifies the allocator or does the allocation" with any libraries I write.</li> </ul> Please note that this is not an answer in terms of what the standards say should happen, just my experience. I've worked with more than a few buggy/broken compilers and libc implementations in the past, so YMMV. I also have the luxury of working on fairly "sealed systems", and not being all that worried about portability for any specific application. Regarding dynamic libraries: I'm currently in a bit of a pinch in this regard myself; our "app" gets loaded as a dynamic <code>.so</code> and we have to be pretty careful to pass any <code>delete</code>/<code>free</code> requests back to the default allocator if they didn't come from us. The current solution is to just cordon off our allocations to a specific area: if we get a delete/free from within that address range, we dispatch to our handler, otherwise back to the default... I've even toyed with (horrors) the idea of checking the caller address to see if it's in our address space. (The probability of going boom increases with such hacks, though.) This may be a useful strategy even if you are the process lead and you're using an outside library: tag or restrict or otherwise identify your own allocs somehow (even going so far as to keep a list of allocs you know about), and then pass on any unknowns. All of this has ugly side-effects and limitations, though. (Looking forward to other answers!)

How to control memory allocation strategy in third party library code?

Tags:

c++

new-operator

delete-operator

Previous header: "Must I replace global operators new and delete to change memory allocation strategy in third party code?"

Short story: We need to replace memory allocation technique in third-party library without changing its source code.

Long story:

Consider memory-bound application that makes huge dynamic allocations (perhaps, almost all available system memory). We use specialized allocators, and use them everywhere (shared_ptr's, containers etc.). We have total control and power over every single byte of memory allocated in our application.

Also, we need to link against a third-party helper library. That nasty guy makes allocations in some standard way, using default operators new, new[], delete and delete[] or malloc or something else non-standard (let's generalize and say that we don't know how this library manages it's heap allocation).

If this helper library makes allocation that are big enough we can get HDD thrashing, memory fragmentation and alignments issues, out-of-memory bad_allocs and all sorts of problems.

We can not (or do not want) to change library source code.

First attempt:

We never had such unholy "hacks" in release builds before. First test with overriding operator new works fine, except that:

we do not know what gotchas wait us in the future (and this is awful)
our users (and even our allocators) now have to allocate same way that we do

Questions:

Are there ways to hook these allocations without overloading global operators? (local lib-only hooks?)
...and if we don't know what exactly it uses: malloc or new?

Is this list of signatures complete? (and there are no other things that we must implement):

void* operator new (std::size_t size) throw (std::bad_alloc); void* operator new (std::size_t size, const std::nothrow_t& nothrow_value) throw(); void* operator new (std::size_t size, void* ptr) throw(); void* operator new[] (std::size_t size) throw (std::bad_alloc); void* operator new[] (std::size_t size, const std::nothrow_t& nothrow_value) throw(); void* operator new[] (std::size_t size, void* ptr) throw();  void operator delete (void* ptr) throw(); void operator delete (void* ptr, const std::nothrow_t& nothrow_constant) throw(); void operator delete (void* ptr, void* voidptr2) throw(); void operator delete[] (void* ptr) throw(); void operator delete[] (void* ptr, const std::nothrow_t& nothrow_constant) throw(); void operator delete[] (void* ptr, void* voidptr2) throw();

Something different if that library is dynamic?

Edit #1

Cross-platform solution is preferable if possible (looks like not very possible). If not, our major platforms:

Windows x86/x64 (msvc 10)
Linux x86/x64 (gcc 4.6)

Edit #2

Almost 2 years have passed, few OS and compiler versions have evolved, so I am curious if there is something new and unexplored in this area? Any standard proposals? OS-specifics? Hacks? How do you write memory-thirsty applications today? Please share your experience.

271

asked May 04 '13 19:05

Ivan Aksamentov - Drop

2 Answers

Ugh, my sympathy. This is going to depend a lot on your compiler, your libc, etc. Some rubber-meets-road strategies that have "worked" to varying degrees for us in the past (/me braces for downvotes) are:

The operator new / operator delete overloads you suggested -- although note that some compilers are picky about not having throw() specs, some really want them, some want them for new but not for delete, etc (I have a giant platform-specific #if/#elif block for all of the 4+ platforms we're working on now).
Also worth noting: you can generally ignore the placement versions, they don't allocate.
Look at __malloc_hook and friends -- note that these are deprecated and have thread race conditions -- but they're nice in that new/delete tend to be implemented in terms of malloc (but not always).
Providing a replacement malloc, calloc, realloc, and free and getting your linker args in the right order so that the overrides take place (this is what gcc recommends these days, although I've had situations where it was impossible to do, and I had to use deprecated __malloc_hook) -- again, new and delete tend to be implemented in terms of these, but not always.
Avoiding all the standard allocation methods (operator new, malloc, etc) in "our code" and using custom functions instead -- not very easy with existing codebase.
Tracking down the library author and delivering a ~~savage beating~~ polite request or patch to change their library to allow you to specify a different allocator (it may be faster than doing this yourself) -- I think this has lead to a cardinal rule of "client always specifies the allocator or does the allocation" with any libraries I write.

Please note that this is not an answer in terms of what the standards say should happen, just my experience. I've worked with more than a few buggy/broken compilers and libc implementations in the past, so YMMV. I also have the luxury of working on fairly "sealed systems", and not being all that worried about portability for any specific application.

Regarding dynamic libraries: I'm currently in a bit of a pinch in this regard myself; our "app" gets loaded as a dynamic .so and we have to be pretty careful to pass any delete/free requests back to the default allocator if they didn't come from us. The current solution is to just cordon off our allocations to a specific area: if we get a delete/free from within that address range, we dispatch to our handler, otherwise back to the default... I've even toyed with (horrors) the idea of checking the caller address to see if it's in our address space. (The probability of going boom increases with such hacks, though.)

This may be a useful strategy even if you are the process lead and you're using an outside library: tag or restrict or otherwise identify your own allocs somehow (even going so far as to keep a list of allocs you know about), and then pass on any unknowns. All of this has ugly side-effects and limitations, though.

(Looking forward to other answers!)

answered Sep 19 '22 06:09

leander

Without being able to modify the library's source code - or, better, being able to influence the author of the library to modify it - I'd say you're out of luck.

There are some things the library potentially can do (even unintentionally) to make it immune to any strategy you might employ - or, in worst cases, have the result that your usage would make the library unstable or it might make your program unstable. Such as using its own custom allocators, providing its own versions of global operator new() and operator delete(), overriding those operators in individual classes, etc.

A strategy which would probably work is to work with the library vendor and make some modifications. The modifications (from your end) would amount to being able to initialise the library by specifying allocators it uses. For the library the effort is potentially significant (having to touch all functions that dynamically allocate memory, that use standard containers, etc) but not intractable - use the supplied allocators (or sensible defaults) throughout their code.

Unfortunately, that is at odds with your requirement to not modify the library - I am skeptical of the chances of satisfying that, particularly within constraints you have outlined (memory-thirsty, hosted on windows/linux, etc).

answered Sep 17 '22 06:09

Peter

Related questions
                            
                                error LNK2005: _DllMain@12 already defined in MSVCRT.lib
                            
                                std::ofstream, check if file exists before writing
                            
                                return statement in ternary operator c++
                            
                                Very fast 3D distance check?
                            
                                Except OOP, why is C++ better than C? [closed]
                            
                                C++ int to byte array
                            
                                Android freeze in OpenGL|ES (CPU may be pegged. trying again.)
                            
                                Why does my logging library cause performance tests to run faster?
                            
                                Why does GCC -O3 cause infinite std::distance with filter iterators over a std::deque?
                            
                                Qt, MSVC, and /Zc:wchar_t- == I want to blow up the world
                            
                                Detecting constexpr with SFINAE
                            
                                C++ template and inline
                            
                                Why does C++ not have a const constructor?
                            
                                C++ decltype deducing current function returned type
                            
                                Is there an automatic noexcept specifier?
                            
                                Having LLVM IR library how to crosscompile it to iOS, Android, Windows and Mac from Ubuntu?
                            
                                Is delete allowed to modify its parameter?
                            
                                Why does the implicit copy constructor calls the base class copy constructor and the defined copy constructor doesn't?
                            
                                Query the alignment of a specific variable
                            
                                Communication between lexer and parser

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With