I'm building a symbol table for a project I'm working on. I was wondering what peoples opinions are on the advantages and disadvantages of the various methods available for storing and creating a symbol table. I've done a fair bit of searching and the most commonly recommended are binary trees or linked lists or hash tables. What are the advantages and or disadvantages of all of the above? (working in c++)

The standard trade offs between these data structures apply. <ul> <li>Binary Trees <ul> <li>medium complexity to implement (assuming you can't get them from a library)</li> <li>inserts are O(logN)</li> <li>lookups are O(logN)</li> </ul> </li> <li>Linked lists (unsorted) <ul> <li>low complexity to implement</li> <li>inserts are O(1)</li> <li>lookups are O(N)</li> </ul> </li> <li>Hash tables <ul> <li>high complexity to implement</li> <li>inserts are O(1) on average</li> <li>lookups are O(1) on average</li> </ul> </li> </ul>

Binary Trees vs. Linked Lists vs. Hash Tables

Tags:

algorithm

hashtable

linked-list

binary-tree

symbol-tables

I'm building a symbol table for a project I'm working on. I was wondering what peoples opinions are on the advantages and disadvantages of the various methods available for storing and creating a symbol table.

I've done a fair bit of searching and the most commonly recommended are binary trees or linked lists or hash tables. What are the advantages and or disadvantages of all of the above? (working in c++)

548

asked Dec 16 '08 12:12

benmcredmond

2 Answers

The standard trade offs between these data structures apply.

Binary Trees
- medium complexity to implement (assuming you can't get them from a library)
- inserts are O(logN)
- lookups are O(logN)
Linked lists (unsorted)
- low complexity to implement
- inserts are O(1)
- lookups are O(N)
Hash tables
- high complexity to implement
- inserts are O(1) on average
- lookups are O(1) on average

answered Oct 07 '22 09:10

Darron

Your use case is presumably going to be "insert the data once (e.g., application startup) and then perform lots of reads but few if any extra insertions".

Therefore you need to use an algorithm that is fast for looking up the information that you need.

I'd therefore think the HashTable was the most suitable algorithm to use, as it is simply generating a hash of your key object and using that to access the target data - it is O(1). The others are O(N) (Linked Lists of size N - you have to iterate through the list one at a time, an average of N/2 times) and O(log N) (Binary Tree - you halve the search space with each iteration - only if the tree is balanced, so this depends on your implementation, an unbalanced tree can have significantly worse performance).

Just make sure that there are enough spaces (buckets) in the HashTable for your data (R.e., Soraz's comment on this post). Most framework implementations (Java, .NET, etc) will be of a quality that you won't need to worry about the implementations.

Did you do a course on data structures and algorithms at university?

answered Oct 07 '22 08:10

JeeBee

Related questions
                            
                                Easiest way of using min priority queue with key update in C++
                            
                                Algorithm Complexity & Security: MD5 or SHA1?
                            
                                Detecting consecutive integers in a list [duplicate]
                            
                                How do I scale one rectangle to the maximum size possible within another rectangle?
                            
                                Finding out nth fibonacci number for very large 'n'
                            
                                Best Compression algorithm for a sequence of integers
                            
                                Permutation of array
                            
                                Problem solving/ Algorithm Skill is a knack or can be developed with practice? [closed]
                            
                                Why not use heap sort always [duplicate]
                            
                                Rotate image and crop out black borders
                            
                                Most efficient way to see if an ArrayList contains an object in Java
                            
                                Combine Gyroscope and Accelerometer Data
                            
                                Manacher's algorithm (algorithm to find longest palindrome substring in linear time)
                            
                                Sorting an almost sorted array (elements misplaced by no more than k)
                            
                                Sparse matrices / arrays in Java
                            
                                Lazy Evaluation and Time Complexity
                            
                                Find the 2nd largest element in an array with minimum number of comparisons
                            
                                How to calculate elapsed time from now with Joda-Time?
                            
                                Generating combinations in c++
                            
                                Finding All Combinations (Cartesian product) of JavaScript array values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With