I'm using the <code>std::unordered_map</code> from gnu++0x to store a huge amount of data. I want to pre-allocate space for the large number of elements, since I can bound the total space used. What I would like to be able to do is call: <pre class="prettyprint"><code>std::unordered_map m; m.resize(pow(2,x)); </code></pre> where x is known. <code>std::unordered_map</code> doesn't support this. I would rather use <code>std::unordered_map</code> if possible, since it will eventually be part of the standard. Some other constraints: Need reliable O(1) access and mutation of the map. The desired hash and comparison functions are already non-standard and somewhat expensive. O(log n) mutation (as with <code>std::map</code>) is too expensive. -> The expensive hash and comparison also make amortization-based growth way too expensive. Each extra insert requires O(n) operations from those functions, which results in an extra quadratic term in the algorithm's run time, since the exponential storage requirements need O(n) growths.

I would suggest writing your own allocator for the <code>std::unordered_map</code> that allocates memory exactly in the way you want.

Pre-allocating buckets in a C++ std::unordered_map

Tags:

buckets

I'm using the std::unordered_map from gnu++0x to store a huge amount of data. I want to pre-allocate space for the large number of elements, since I can bound the total space used.

What I would like to be able to do is call:

std::unordered_map m;
m.resize(pow(2,x));

where x is known.

std::unordered_map doesn't support this. I would rather use std::unordered_map if possible, since it will eventually be part of the standard.

Some other constraints:

Need reliable O(1) access and mutation of the map. The desired hash and comparison functions are already non-standard and somewhat expensive. O(log n) mutation (as with std::map) is too expensive.

-> The expensive hash and comparison also make amortization-based growth way too expensive. Each extra insert requires O(n) operations from those functions, which results in an extra quadratic term in the algorithm's run time, since the exponential storage requirements need O(n) growths.

851

asked May 05 '11 23:05

JAD

3 Answers

m.rehash(pow(2,x));

if pow(2, x) is the number of buckets you want preallocated. You can also:

m.reserve(pow(2,x));

but now pow(2, x) is the number of elements you are planning on inserting. Both functions do nothing but preallocate buckets. They don't insert any elements. And they are both meant to be used exactly for your use case.

Note: You aren't guaranteed to get exactly pow(2, x) buckets. Some implementations will use only a number of buckets which is a power of 2. Other implementations will use only a prime number of buckets. Still others will use only a subset of primes for the number of buckets. But in any case, the implementation should accept your hint at the number of buckets you desire, and then internally round up to its next acceptable number of buckets.

Here is the precise wording that the latest (N4660) uses to specify the argument to rehash:

a.rehash(n) : Postconditions: a.bucket_count() >= a.size() / a.max_load_factor() and a.bucket_count() >= n.

This postcondition ensures that bucket()_count() >= n, and that load_factor() remains less than or equal to max_load_factor().

Subsequently reserve(n) is defined in terms of rehash(n):

a.reserve(n) : Same as a.rehash(ceil(n / a.max_load_factor())).

183

answered Oct 11 '22 20:10

Howard Hinnant

I don't think it matters for an unordered map to have pre-allocated memory. The STL is expected to be O(n) amortized insertion time. Save yourself the hassle of writing your own allocator until you know this is the bottle neck of your code, in my opinion.

answered Oct 11 '22 18:10

Mike Lyons

I would suggest writing your own allocator for the std::unordered_map that allocates memory exactly in the way you want.

answered Oct 11 '22 18:10

orlp

Related questions
                            
                                Qt Signals and slot thread safety
                            
                                Should I use shared_ptr or unique_ptr? [duplicate]
                            
                                CMake add target for invoking clang analyzer
                            
                                How to efficiently display OpenCV video in Qt?
                            
                                What happens when using make_shared
                            
                                Option to force either 32-bit or 64-bit build with cmake
                            
                                Problems generating solution for VS 2017 with CMake
                            
                                Does std::unordered_map operator[] do zero-initialization for non-exisiting key?
                            
                                C++ : What's the easiest library to open video file
                            
                                Qt: meaning of slot return value?
                            
                                Dependency injection in C++
                            
                                Linux optimistic malloc: will new always throw when out of memory?
                            
                                Is there a way of disabling the old c style casts in c++ [duplicate]
                            
                                Parallel OpenMP loop with break statement
                            
                                Is it possible to avoid repeating the class name in the implementation file?
                            
                                Client to Server Authentication in C++ using sockets
                            
                                How to use googletest Failures into Break-Points
                            
                                Boost Log 2.0 : empty Severity level in logs
                            
                                Obscure C++ operator overloading
                            
                                Weird linker error with static std::map

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With