Consider this code. I reserve 6 spots for an unordered_map and insert 6 elements. Afterwards, there are 7 buckets. Why is this? The max_load_factor is 1 and there are enough buckets for the number of elements I insert. <pre class="prettyprint"><code>#include <iostream> #include <unordered_map> using namespace std; int main () { unordered_map<std::string,std::string> mymap = { {"house","maison"}, {"apple","pomme"}, {"tree","arbre"}, {"book","livre"}, {"door","porte"}, {"grapefruit","pamplemousse"} }; unordered_map<std::string,std::string> mymap2; // THIS ONE!!! mymap2.reserve(6); for (auto i:mymap) { mymap2[i.first] = i.second; } std::cout << "max_load factor " << mymap2.max_load_factor() << " mymap has " << mymap2.bucket_count() << " buckets.\n"; for (unsigned i=0; i<mymap2.bucket_count(); ++i) { cout << "bucket #" <first << ":" << it->second << "] "; cout << endl; } return 0; } </code></pre> Output: <pre class="prettyprint"><code>max_load factor 1 mymap has 7 buckets. bucket #0 contains: bucket #1 contains: [book:livre] bucket #2 contains: [tree:arbre] bucket #3 contains: [house:maison] [grapefruit:pamplemousse] bucket #4 contains: bucket #5 contains: [door:porte] bucket #6 contains: [apple:pomme] </code></pre>

The cplusplus.com website gives this explanation: <pre class="prettyprint lang-cpp prettyprint-override"><code>void reserve (size_type n); </code></pre> <blockquote> Request a capacity change Sets the number of buckets in the container (<code>bucket_count</code>) to the most appropriate to contain at least n elements. If n is greater than the current <code>bucket_count</code> multiplied by the <code>max_load_factor</code>, the container's <code>bucket_count</code> is increased and a rehash is forced. If n is lower than that, the function may have no effect. </blockquote> At the time you declare your <code>unordered_map</code> variable, it has a <code>bucket_count</code> of <code>1</code> and a <code>max_load_factor</code> of <code>1</code>. Then you <code>reserve</code> <code>6</code> buckets which is greater than <code>max_load_factor</code> multiplied by <code>bucket_count</code> According to this definition, the behavior is, in my humble opinion, correct. I added at line <code>17</code> of your code the following line to show the <code>bucket_count</code> before the <code>reserve</code>and indeed, it is <code>1</code> <pre class="prettyprint lang-cpp prettyprint-override"><code> std::cout << "BEFORE RESERVE max_load factor " << mymap2.max_load_factor() << " mymap has " << mymap2.bucket_count() << " buckets.\n"; </code></pre> The display is as follows: <pre class="prettyprint lang-sh prettyprint-override"><code>BEFORE RESERVE max_load factor 1 mymap has 1 buckets. </code></pre> After the reserve: <pre class="prettyprint lang-sh prettyprint-override"><code>AFTER RESERVE max_load factor 1 mymap has 7 buckets. </code></pre> Thus the behavior is normal in my humble opinion.

Hash table implementations tend to pick between two desirable qualities: <ul> <li> maintaining a power-of-two <code>bucket_count()</code> (i.e. rounding any value passed to reserve up to the next power of two if necessary), so <ul> <li> a <code>size_t</code> value returned by the hash function can be mapped onto the range of buckets using a 1-CPU-cycle bitwise AND operation (e.g. 8 buckets -> hash value ANDed with 7); </li> <li> this has the undesirable effect of chopping off the high-order bits from the hash value, so they don't help randomise the bucket placement </li> <li> Visual C++ does this </li> </ul> </li> <li> maintaining a prime <code>bucket_count()</code> <ul> <li> this has the extremely desirable side-effect of having high-order bits in the hash value affect the bucket selection, so lower-quality (i.e. faster) hash functions still often manage a more equal, less-collision/clustering-prone, bucket placement </li> <li> implemented naively, this forces the compiler to do a mod ("%") operation by a runtime-variable <code>bucket_count()</code>, which may take e.g. 40-90 CPU cycles, depending on the CPU. A faster alternative is use the index into the table of prime numbers used when sizing the hash table to switch into a mod operation by that hard-coded constant prime value, so the compiler can try to optimise the mod using bit shifts, subtractions or additions, or multiplications if that's necessary (you can see the kinds of optimisations possible in this snippet on godbolt) </li> <li> GCC does this; I think clang does too. </li> </ul> </li> </ul> So, summarily - when you ask for 6 buckets, GCC or clang will increase that to some prime - not necessarily the next one, but it appears that's happened in this case - to reduce the collision-proneness when you later insert elements.

Why does unordered_map increase in size when it has enough buckets due to "reserve"?

Tags:

c++

hashtable

unordered-map

c++14

Consider this code. I reserve 6 spots for an unordered_map and insert 6 elements. Afterwards, there are 7 buckets. Why is this? The max_load_factor is 1 and there are enough buckets for the number of elements I insert.

#include <iostream>
#include <unordered_map>
using namespace std;

int main () {
  unordered_map<std::string,std::string> mymap = { 
            {"house","maison"},
            {"apple","pomme"},
            {"tree","arbre"},
            {"book","livre"},
            {"door","porte"},
            {"grapefruit","pamplemousse"}
  };

    unordered_map<std::string,std::string> mymap2; // THIS ONE!!!
    mymap2.reserve(6);
    for (auto i:mymap) {
        mymap2[i.first] = i.second;
    }


    std::cout << "max_load factor " << mymap2.max_load_factor() << " mymap has " << mymap2.bucket_count() << " buckets.\n";

      for (unsigned i=0; i<mymap2.bucket_count(); ++i) {
        cout << "bucket #" << i << " contains: ";
        for (auto it = mymap2.begin(i); it!=mymap2.end(i); ++it)
            cout << "[" << it->first << ":" << it->second << "] ";
          cout << endl;
      }

  return 0;
}

Output:

max_load factor 1 mymap has 7 buckets.
bucket #0 contains: 
bucket #1 contains: [book:livre] 
bucket #2 contains: [tree:arbre] 
bucket #3 contains: [house:maison] [grapefruit:pamplemousse] 
bucket #4 contains: 
bucket #5 contains: [door:porte] 
bucket #6 contains: [apple:pomme]

276

asked Jan 29 '21 06:01

JobHunter69

2 Answers

The cplusplus.com website gives this explanation:

void reserve (size_type n);

Request a capacity change

Sets the number of buckets in the container (bucket_count) to the most appropriate to contain at least n elements.

If n is greater than the current bucket_count multiplied by the max_load_factor, the container's bucket_count is increased and a rehash is forced.

If n is lower than that, the function may have no effect.

At the time you declare your unordered_map variable, it has a bucket_count of 1 and a max_load_factor of 1. Then you reserve 6 buckets which is greater than max_load_factor multiplied by bucket_count

According to this definition, the behavior is, in my humble opinion, correct.

I added at line 17 of your code the following line to show the bucket_count before the reserveand indeed, it is 1

 std::cout << "BEFORE RESERVE max_load factor " << mymap2.max_load_factor() << " mymap has " << mymap2.bucket_count() << " buckets.\n";

The display is as follows:

BEFORE RESERVE max_load factor 1 mymap has 1 buckets.

After the reserve:

AFTER RESERVE max_load factor 1 mymap has 7 buckets.

Thus the behavior is normal in my humble opinion.

132

answered Nov 09 '22 04:11

Pat. ANDRIA

Hash table implementations tend to pick between two desirable qualities:

maintaining a power-of-two bucket_count() (i.e. rounding any value passed to reserve up to the next power of two if necessary), so
- a size_t value returned by the hash function can be mapped onto the range of buckets using a 1-CPU-cycle bitwise AND operation (e.g. 8 buckets -> hash value ANDed with 7);
- this has the undesirable effect of chopping off the high-order bits from the hash value, so they don't help randomise the bucket placement
- Visual C++ does this
maintaining a prime bucket_count()
- this has the extremely desirable side-effect of having high-order bits in the hash value affect the bucket selection, so lower-quality (i.e. faster) hash functions still often manage a more equal, less-collision/clustering-prone, bucket placement
- implemented naively, this forces the compiler to do a mod ("%") operation by a runtime-variable bucket_count(), which may take e.g. 40-90 CPU cycles, depending on the CPU. A faster alternative is use the index into the table of prime numbers used when sizing the hash table to switch into a mod operation by that hard-coded constant prime value, so the compiler can try to optimise the mod using bit shifts, subtractions or additions, or multiplications if that's necessary (you can see the kinds of optimisations possible in this snippet on godbolt)
- GCC does this; I think clang does too.

So, summarily - when you ask for 6 buckets, GCC or clang will increase that to some prime - not necessarily the next one, but it appears that's happened in this case - to reduce the collision-proneness when you later insert elements.

answered Nov 09 '22 03:11

Tony Delroy

Related questions
                            
                                return type deduction of recursive function
                            
                                C++20 string literal template argument working example
                            
                                Can lambdas be used as non-type template parameter?
                            
                                Is there a function to load a non-atomic value atomically?
                            
                                code doesn't compile with a specific name function
                            
                                Why does the output_iterator Concept not require the output_iterator_tag?
                            
                                Should conversion operators be considered for function template argument deduction?
                            
                                How to compose generators with STL algorithms
                            
                                How does std::optional construct std::variant in place from initializer list?
                            
                                C++20 coroutines, std return type and state persistancy
                            
                                std::optional: Not participating in overload resolution vs. being defined as deleted
                            
                                Find a pointer T* in std::unordered_set<std::unique_ptr> (C++20)
                            
                                Access checking rules for template argument list in (particularly explicit) specializations
                            
                                How to use ranges::sort for ascending or descending sort controlled by a boolean
                            
                                Transforming an array of int to integer_sequence
                            
                                Rewritten comparison operators and expression templates
                            
                                Template argument deduction for implicit pair
                            
                                SFINAE code to detect whether operator<< is implemented for a type behaves differently on Clang and gcc [duplicate]
                            
                                Is this ray tracing function that runs on the GPU, GPU safe?
                            
                                Why are my C++ binary built with -LTO so very large?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With