Why do I keep seeing different runtime complexities for these functions on a hash table? On wiki, search and delete are O(n) (I thought the point of hash tables was to have constant lookup so what's the point if search is O(n)). In some course notes from a while ago, I see a wide range of complexities depending on certain details including one with all O(1). Why would any other implementation be used if I can get all O(1)? If I'm using standard hash tables in a language like C++ or Java, what can I expect the time complexity to be?

Hash tables are <code>O(1)</code> average and amortized case complexity, however it suffers from <code>O(n)</code> worst case time complexity. [And I think this is where your confusion is] Hash tables suffer from <code>O(n)</code> worst time complexity due to two reasons: <ol> <li>If too many elements were hashed into the same key: looking inside this key may take <code>O(n)</code> time.</li> <li>Once a hash table has passed its load balance - it has to rehash [create a new bigger table, and re-insert each element to the table]. </li> </ol> However, it is said to be <code>O(1)</code> average and amortized case because: <ol> <li>It is very rare that many items will be hashed to the same key [if you chose a good hash function and you don't have too big load balance.</li> <li>The rehash operation, which is <code>O(n)</code>, can at most happen after <code>n/2</code> ops, which are all assumed <code>O(1)</code>: Thus when you sum the average time per op, you get : <code>(n*O(1) + O(n)) / n) = O(1)</code> </li> </ol> Note because of the rehashing issue - a realtime applications and applications that need low latency - should not use a hash table as their data structure. EDIT: Annother issue with hash tables: cache Another issue where you might see a performance loss in large hash tables is due to cache performance. Hash Tables suffer from bad cache performance, and thus for large collection - the access time might take longer, since you need to reload the relevant part of the table from the memory back into the cache.

Hash table runtime complexity (insert, search and delete)

1 Answers

Hash tables are O(1) average and amortized case complexity, however it suffers from O(n) worst case time complexity. [And I think this is where your confusion is]

Hash tables suffer from O(n) worst time complexity due to two reasons:

If too many elements were hashed into the same key: looking inside this key may take O(n) time.
Once a hash table has passed its load balance - it has to rehash [create a new bigger table, and re-insert each element to the table].

However, it is said to be O(1) average and amortized case because:

It is very rare that many items will be hashed to the same key [if you chose a good hash function and you don't have too big load balance.
The rehash operation, which is O(n), can at most happen after n/2 ops, which are all assumed O(1): Thus when you sum the average time per op, you get : (n*O(1) + O(n)) / n) = O(1)

Note because of the rehashing issue - a realtime applications and applications that need low latency - should not use a hash table as their data structure.

EDIT: Annother issue with hash tables: cache
Another issue where you might see a performance loss in large hash tables is due to cache performance. Hash Tables suffer from bad cache performance, and thus for large collection - the access time might take longer, since you need to reload the relevant part of the table from the memory back into the cache.

140

answered Sep 16 '22 20:09

amit

Related questions
                            
                                How can Google be so fast?
                            
                                What is O(log* N)?
                            
                                How do I check if a directed graph is acyclic?
                            
                                What is amortized analysis of algorithms? [closed]
                            
                                Efficient way to search an element
                            
                                JavaScript: Calculate the nth root of a number
                            
                                Quick and Simple Hash Code Combinations
                            
                                Algorithm to check similarity of colors
                            
                                Fast prime factorization module
                            
                                Inverting a 4x4 matrix
                            
                                The Most Efficient Way To Find Top K Frequent Words In A Big Word Sequence
                            
                                Easiest algorithm of Voronoi diagram to implement? [closed]
                            
                                How do you like your primary keys? [closed]
                            
                                Find the shortest path in a graph which visits certain nodes
                            
                                Undo/Redo implementation
                            
                                algorithm used to calculate 5 star ratings
                            
                                Searching in a sorted and rotated array
                            
                                How to implement tag system
                            
                                How do I search for a number in a 2d array sorted left to right and top to bottom?
                            
                                Difference between Jaro-Winkler and Levenshtein distance? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Hash table runtime complexity (insert, search and delete)

Tags:

algorithm

time-complexity

hashtable

data-structures

hash

user1136342

People also ask

1 Answers

amit

Recent Activity

Donate For Us