if huge array is faster than hash-map for look-up?

Tags:

algorithm

I'm receiving "order update" from stock exchange. Each order id is between 1 and 100 000 000, so I can use 100 million array to store 100 million orders and when update is received I can look-up order from array very fast just accessing it by index arrray[orderId]. I will spent several gigabytes of memory but this is OK.

Alternatively I can use hashmap, and because at any moment the number of "active" orders is limited (to, very roughly, 100 000), look-up will be pretty fast too, but probaly a little bit slower then array.

The question is - will hashmap be actually slower? Is it reasonably to create 100 millions array?

I need latency and nothing else, I completely don't care about memory, what should I choose?

290

asked Jun 23 '13 22:06

Oleg Vazhnev

1 Answers

Whenever considering performance issues, one experiment is worth a thousand expert opinions. Test it!

That said, I'll take a wild stab in the dark: it's likely that if you can convince your OS to keep your multi-gigabyte array resident in physical memory (this isn't necessarily easy - consider looking at the mlock and munlock syscalls), you'll have relatively better performance. Any such performance gain you notice (should one exist) will likely be by virtue of bypassing the cost of the hashing function, and avoiding the overheads associated with whichever collision-resolution and memory allocation strategies your hashmap implementation uses.

It's also worth cautioning that many hash table implementations have non-constant complexity for some operations (e.g., separate chaining could degrade to O(n) in the worst case). Given that you are attempting to optimize for latency, an array with very aggressive signaling to the OS memory manager (e.g., madvise and mlock) are likely to result in the closest to constant-latency lookups that you can get on a microprocessor easily.

160

answered Oct 12 '22 13:10

Gian

Related questions
                            
                                Understanding solution to finding optimal strategy for game involving picking pots of gold
                            
                                Algorithm to emulate mouse movement as a human does?
                            
                                PHP URL Shortening Algorithm
                            
                                Find Duplicates in an array in O(N) time
                            
                                Determining the element that occurred the most in O(n) time and O(1) space
                            
                                Why is the order of an algorithm generally more important than the speed of the processor? [closed]
                            
                                Algorithm to find the most repetitive (not the most common) sequence in a string (aka tandem repeats)
                            
                                What is the optimal algorithm design for a water-saving urinal? [closed]
                            
                                Convex hull of 4 points
                            
                                How can I compute a Cartesian product iteratively?
                            
                                Calculate direction angle from two vectors?
                            
                                Compress two or more numbers into one byte
                            
                                Applications of Kruskal and Prim's algorithms
                            
                                Fast sort algorithms for arrays with mostly duplicated elements?
                            
                                Game of 2/9 (Interview at Facebook)
                            
                                Which algorithm should I use for signal (sound) one class classification?
                            
                                How does the Euclidean Algorithm work?
                            
                                Generating integers in ascending order using a set of prime numbers
                            
                                Separate range of numbers, if in sequence then by hyphen, and if break in sequence occurs then comma character
                            
                                Allocate an array of integers proportionally compensating for rounding errors

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With