c++ - unordered_map complexity

Tags:

I need to create a lookup function where a (X,Y) pair corresponds to a specific Z value. One major requirement for this is that I need to do it in as close to O(1) complexity as I can. My plan is to use an unordered_map.

I generally do not use a hash table for lookup, as the lookup time has never been important to me. Am I correct in thinking that as long as I built the unordered_map with no collisions, my lookup time will be O(1)?

My concern then is what the complexity becomes if there the key is not present in the unordered map. If I use unordered_map::find():, for example, to determine whether a key is present in my hash table, how will it go about giving me an answer? Does it actually iterate over all the keys?

I greatly appreciate the help.

818

asked Mar 18 '13 06:03

user1764386

1 Answers

The standard more or less requires using buckets for collision resolution, which means that the actual look up time will probably be linear with respect to the number of elements in the bucket, regardless of whether the element is present or not. It's possible to make it O(lg N), but it's not usually done, because the number of elements in the bucket should be small, if the hash table is being used correctly.

To ensure that the number of elements in a bucket is small, you must ensure that the hashing function is effective. What effective means depends on the types and values being hashed. (The MS implementation uses FNV, which is one of the best generic hashs around, but if you have special knowledge of the actual data you'll be seeing, you might be able to do better.) Another thing which can help reduce the number of elements per bucket is to force more buckets or use a smaller load factor. For the first, you can pass the minimum initial number of buckets as an argument to the constructor. If you know the total number of elements that will be in the map, you can control the load factor this way. You can also forse a minumum number of buckets once the table has been filled, by calling rehash. Otherwise, there is a function std::unordered_map<>::max_load_factor which you can use. It is not guaranteed to do anything, but in any reasonable implementation, it will. Note that if you use it on an already filled unordered_map, you'll probably have to call unordered_map<>::rehash afterwards.

(There are several things I don't understand about the standard unordered_map: why the load factor is a float, instead of double; why it's not required to have an effect; and why it doesn't automatically call rehash for you.)

120

answered Oct 12 '22 14:10

James Kanze

Related questions
                            
                                What's the default value for a std::atomic?
                            
                                12 dominating knights puzzle (backtracking)
                            
                                large arrays, std::vector and stack overflow
                            
                                Is the front address of std::vector move invariant?
                            
                                Client in C++, use gethostbyname or getaddrinfo
                            
                                Can you `= delete` a templated function on a second declaration?
                            
                                C++ vim IDE. Things you'd need from it
                            
                                Conditional CXX_FLAGS using cmake based on compiler?
                            
                                C++: what are the most common vulnerabilities and how to avoid them?
                            
                                NaN ASCII I/O with Visual C++
                            
                                Recursive call in lambda (C++11) [duplicate]
                            
                                Online C++ compiler with input stream? [closed]
                            
                                Why does `basic_ios::swap` only do a partial swap?
                            
                                What type is in the range for loop?
                            
                                The Cost of thread_local
                            
                                Why is this code trying to call the copy constructor?
                            
                                Can you inline static member functions?
                            
                                std::unique_ptr with custom deleter for win32 LocalFree
                            
                                How to see all elements of a two dimensional array in Visual Studio 2010?
                            
                                Returning std::vector by value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

c++ - unordered_map complexity

Tags:

c++

complexity-theory

hashtable

unordered-map

user1764386

People also ask

1 Answers

James Kanze

Recent Activity

Donate For Us