How does C++ STL unordered_map resolve collisions?

Tags:

Looking at the http://www.cplusplus.com/reference/unordered_map/unordered_map/, it says "Unique keys No two elements in the container can have equivalent keys."

That should mean that the container is indeed resolving collisions. However, that page does not tell me how it is doing it. I know some ways to resolve collisions like using linked lists and/or probing. What I want to know is how the c++ STL unordered_map is resolving it.

980

asked Feb 03 '14 02:02

whiteSkar

1 Answers

The standard defines a little more about this than most people seem to realize.

Specifically, the standard requires (§23.2.5/9):

The elements of an unordered associative container are organized into buckets. Keys with the same hash code appear in the same bucket.

The interface includes a bucket_count that runs in constant time. (table 103). It also includes a bucket_size that has to run in time linear on the size of the bucket.

That's basically describing an implementation that uses collision chaining. When you do use collision chaining, meeting all the requirements is somewhere between easy and trivial. bucket_count() is the number of elements in your array. bucket_size() is the number of elements in the collision chain. Getting them in constant and linear time respectively is simple and straightforward.

By contrast, if you use something like linear probing or double hashing, those requirements become all but impossible to meet. Specifically, all the items that hashed to a specific value need to land in the same bucket, and you need to be able to count those buckets in constant time.

But, if you use something like linear probing or double hashing, finding all the items that hashed to the same value means you need to hash the value, then walk through the "chain" of non-empty items in your table to find how many of those hashed to the same value. That's not linear on the number of items that hashed to the same value though--it's linear on the number of items that hashed to the same or a colliding value.

With enough extra work and a fair amount of stretching the meaning of some of the requirements almost to the breaking point, it might be barely possible to create a hash table using something other than collision chaining, and still at least sort of meet the requirements--but I'm not really certain it's possible, and it would certain involve quite a lot of extra work.

Summary: all practical implementations of std::unordered_set (or unordered_map) undoubtedly use collision chaining. While it might be (just barely) possible to meet the requirements using linear probing or double hashing, such an implementation seems to lose a great deal and gain nearly nothing in return.

answered Sep 20 '22 16:09

Jerry Coffin

Related questions
                            
                                How to convert a single char into an int [duplicate]
                            
                                Is there an implicit default constructor in C++?
                            
                                How do you initialise a dynamic array in C++?
                            
                                C++ preprocessor: avoid code repetition of member variable list
                            
                                Are C++ Templates just Macros in disguise?
                            
                                OpenCV undistortPoints and triangulatePoint give odd results (stereo)
                            
                                Specifying a concept for a type that has a member function template using Concepts Lite
                            
                                `std::variant` vs. inheritance vs. other ways (performance)
                            
                                Performance difference between Windows and Linux using Intel compiler: looking at the assembly
                            
                                In a lambda, how reference is being captured by value
                            
                                Difference between rdtscp, rdtsc : memory and cpuid / rdtsc?
                            
                                How to fix: /usr/lib/libstdc++.so.6: version `GLIBCXX_3.4.15' not found
                            
                                How can I build a C++ project with multiple interdependent subdirectories?
                            
                                When and how to use GCC's stack protection feature?
                            
                                How do I call C++/CLI from C#?
                            
                                Why use QVector(Qt) instead of std::vector
                            
                                Programming languages that compile into C/C++ source? [closed]
                            
                                C++14 Variable Templates: what is their purpose? Any usage example?
                            
                                default override of virtual destructor
                            
                                What exactly is va_end for? Is it always necessary to call it?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does C++ STL unordered_map resolve collisions?

Tags:

c++

stl

unordered-map

whiteSkar

People also ask

1 Answers

Jerry Coffin

Recent Activity

Donate For Us