How does one implement hash tables in a functional language?

Tags:

Is there any way to implement hash tables efficiently in a purely functional language? It seems like any change to the hash table would require creating a copy of the original hash table. I must be missing something. Hash tables are pretty darn important data structures, and a programming language would be limited without them.

289

asked Jul 22 '11 16:07

Matt Fichman

1 Answers

Is there any way to implement hash tables efficiently in a purely functional language?

Hash tables are a concrete implementation of the abstract "dictionary" or "associative array" data structure. So I think you really want to ask about the efficiency of purely functional dictionaries compared to imperative hash tables.

It seems like any change to the hash table would require creating a copy of the original hash table.

Yes, hash tables are inherently imperative and there is no direct purely functional equivalent. Perhaps the most similar purely functional dictionary type is the hash trie but they are significantly slower than hash tables due to allocations and indirections.

I must be missing something. Hash tables are pretty darn important data structures, and a programming language would be limited without them.

Dictionaries are a very important data structure (although its worth noting that they were rare in the mainstream until Perl made them popular in the 1990s, so people coded stuff for decades without benefit of dictionaries). I agree that hash tables are also important because they are often by far the most efficient dictionaries.

There are many purely functional dictionaries:

Balanced trees (red-black, AVL, weight-balanced, finger trees etc.), e.g. Map in OCaml and F# and Data.Map in Haskell.
Hash tries, e.g. PersistentHashMap in Clojure.

But these purely functional dictionaries are all much slower than a decent hash table (e.g. the .NET Dictionary).

Beware Haskell benchmarks comparing hash tables to purely functional dictionaries claiming that purely functional dictionaries are competitively performant. The correct conclusion is that Haskell's hash tables are so inefficient that they are almost as slow as purely functional dictionaries. If you compare with .NET, for example, you find that a .NET Dictionary can be 26× faster than Haskell's hash table!

I think to really conclude what you're trying to conclude about Haskell's performance you would need to test more operations, use a non-ridiculous key-type (doubles as keys, what?), not use -N8 for no reason, and compare to a 3rd language that also boxes its parametric types, like Java (as Java has acceptable performance in most cases), to see if its a common problem of boxing or some more serious fault of the GHC runtime. These benchmarks are along these lines (and ~2x as fast as the current hashtable implementation).

This is exactly the kind of misinformation I was referring to. Pay no attention to Haskell's hash tables in this context, just look at the performance of the fastest hash tables (i.e. not Haskell) and the fastest purely functional dictionaries.

163

answered Sep 22 '22 02:09

J D

Related questions
                            
                                Pass function as a parameter in vb.net?
                            
                                CMS in functional programming language [closed]
                            
                                Is it recommended to always have exhaustive pattern matches in Haskell, even for "impossible" cases?
                            
                                Writing a C# version of Haskell infinite Fibonacci series function
                            
                                .filter is not a function [duplicate]
                            
                                How do I get (a, b) => c from a => b => c in Scala?
                            
                                "functions are first class values" what does this exactly mean?
                            
                                Higher order functions in C
                            
                                Map values in Collectors.groupingBy()
                            
                                Which functional programming language should I choose as first functional programming language? [closed]
                            
                                Is OO design's strength in semantics or encapsulation?
                            
                                Implementing functional programming in Perl
                            
                                A grasp of immutable datastructures
                            
                                Distinctive traits of the functional languages
                            
                                Writing a functional and yet functional image processing library in Scala
                            
                                Difference between $ and ()
                            
                                Struggling with using pure functional programming to solve an everyday problem
                            
                                What is Haskell's style of polymorphism?
                            
                                Simple Node/Express app, the functional programming way (How to handle side-effects in JavaScript?)
                            
                                Best Practices for cache locality in Multicore Parallelism in F#

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does one implement hash tables in a functional language?

Tags:

hashtable

functional-programming

Matt Fichman

People also ask

1 Answers

J D

Recent Activity

Donate For Us