I need to know if a ternary tree is better than a hash table. I came across this question in a reply to another question I had where someone said that ternary trees are often faster than hash tables. I found that hard to believe, so I decided to research a little into it. This one website from Princeton appears to be the source of the belief. I took a look at algorithm which is described to be O(log n + k) where n is the number of words stored, and k is the length of the key. It seems to me that the only way this could be faster is if you are often searching for elements that are not already stored. Another thing that bothers me, is that the non-continuous crawling of a trie would tend to cause you to hit pages that have been swapped out, but whether this is a major effect could only be seen through benchmarks. Now I know that there are probably pros and cons to both of them, and if so, I want to know what they are. Benchmarks are also helpful.

Here is what I gather from the Dr. Dobbs Article reachable from the Princeton link you gave: <ol> <li>Ternary Search Trees are up to 10% faster than hash tables on some search problems. They are sometimes slower - depending vastly on the machine used.</li> <li>TRTs are a custom data structure tuned by two of the finest minds of Computer Science - Jon Bentley and Robert Sedgewick both wrote good textbooks, and have done their share of practical programming. Hash tables are considered run-of-the-mill.</li> <li>The constants involved are significant, as Hao Wooi Lin says.</li> <li>Overall, this depends on the problem you are solving. The faster development time and almost ubiquitous support for hash tables in many programming languages are often more important than a ten-percent improvement in run time.</li> </ol>

Ternary Tree Vs Hash Table

Tags:

algorithm

hashtable

ternary-search-tree

I need to know if a ternary tree is better than a hash table.

I came across this question in a reply to another question I had where someone said that ternary trees are often faster than hash tables. I found that hard to believe, so I decided to research a little into it.

This one website from Princeton appears to be the source of the belief. I took a look at algorithm which is described to be O(log n + k) where n is the number of words stored, and k is the length of the key.

It seems to me that the only way this could be faster is if you are often searching for elements that are not already stored. Another thing that bothers me, is that the non-continuous crawling of a trie would tend to cause you to hit pages that have been swapped out, but whether this is a major effect could only be seen through benchmarks.

Now I know that there are probably pros and cons to both of them, and if so, I want to know what they are. Benchmarks are also helpful.

913

asked May 05 '09 07:05

Unknown

2 Answers

Here is what I gather from the Dr. Dobbs Article reachable from the Princeton link you gave:

Ternary Search Trees are up to 10% faster than hash tables on some search problems. They are sometimes slower - depending vastly on the machine used.
TRTs are a custom data structure tuned by two of the finest minds of Computer Science - Jon Bentley and Robert Sedgewick both wrote good textbooks, and have done their share of practical programming. Hash tables are considered run-of-the-mill.
The constants involved are significant, as Hao Wooi Lin says.
Overall, this depends on the problem you are solving. The faster development time and almost ubiquitous support for hash tables in many programming languages are often more important than a ten-percent improvement in run time.

answered Oct 14 '22 09:10

Yuval F

The only way to answer this question is empirically. The answer depends on the details of your implementation, what kinds of searches you do, what hardware you have, and what compiler you're using. You can copy the C code from Princeton. If you want to try a functional language, Standard ML has hash tables (look at SML/NJ), and here is some ML for ternary search trees:

type key = Key.ord_key
type item = Key.ord_key list

datatype set = NODE of { key : key, lt : set, eq : set, gt : set }
             | LEAF

val empty = LEAF

fun member (_, LEAF) = false
  | member (h::t, NODE {key, lt, eq, gt}) =
      (case Key.compare (h, key)
         of EQUAL   => member(t, eq)
          | LESS    => member(h::t, lt)
          | GREATER => member(h::t, gt))
  | member ([], NODE {key, lt, eq, gt}) =
      (case Key.compare (Key.sentinel, key)
         of EQUAL   => true
          | LESS    => member([], lt)
          | GREATER => member([], gt))

exception AlreadyPresent

fun insert(h::t, LEAF) =
      NODE { key = h, eq = insert(t, LEAF), lt = LEAF, gt = LEAF }
  | insert([], LEAF) =
      NODE { key = Key.sentinel, eq = LEAF, lt = LEAF, gt = LEAF }
  | insert(h::t, NODE {key, lt, eq, gt}) =
      (case Key.compare (h, key)
         of EQUAL   => NODE {key = key, lt = lt, gt = gt, eq = insert(t, eq)}
          | LESS    => NODE {key = key, lt = insert(h::t, lt), gt = gt, eq = eq}
          | GREATER => NODE {key = key, lt = lt, gt = insert(h::t, gt), eq = eq})
  | insert([], NODE {key, lt, eq, gt}) =
      (case Key.compare (Key.sentinel, key)
         of EQUAL   => raise AlreadyPresent
          | LESS    => NODE {key = key, lt = insert([], lt), gt = gt, eq = eq}
          | GREATER => NODE {key = key, lt = lt, gt = insert([], gt), eq = eq})

fun add(l, n) = insert(l, n) handle AlreadyPresent => n

answered Oct 14 '22 10:10

Norman Ramsey

Related questions
                            
                                Faster way of searching array of sets
                            
                                How to keep track of players' rankings?
                            
                                Fast idiomatic Floyd-Warshall algorithm in Rust
                            
                                Correct formulation of the A* algorithm
                            
                                Ideas for converting straight quotes to curly quotes
                            
                                Algorithm for copying N bits at arbitrary position from one int to another
                            
                                Is there a C++ MinMax Heap implementation?
                            
                                Where can I find soft-multiply and divide algorithms?
                            
                                Optimizing Jaro-Winkler algorithm
                            
                                Algorithm for fairly assigning tasks to workers based on skills
                            
                                A* Search Algorithm
                            
                                Algorithm for fast Drop shadow in GDI+
                            
                                How to find multiplicative partitions of any integer?
                            
                                How to obtain index of element from predicate passed to some STL algorithm?
                            
                                Find the better intersection of two moving objects
                            
                                Finding a list of adjacent words between two words
                            
                                Memoization algorithm time complexity
                            
                                How is union find quadratic algorithm?
                            
                                Get longest continuous sequence of 1s
                            
                                Big O of an algorithm that relies on convergence

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With