Advanced data structures in practice

Tags:

data-structures

In the 10 years I've been programming, I can count the number of data structures I've used on one hand: arrays, linked lists (I'm lumping stacks and queues in with this), and dictionaries. This isn't really surprising given that nearly all of the applications I've written fall into the forms-over-data / CRUD category.

I've never needed to use a red-black tree, skip list, double-ended queue, circularly linked list, priority queue, heaps, graphs, or any of the dozens of exotic data structures that have been researched in the past 50 years. I feel like I'm missing out.

This is an open-ended question, but where are these "exotic" data structures used in practice? Does anyone have any real-world experience using these data structures to solve a particular problem?

579

asked Dec 23 '08 15:12

Juliet

1 Answers

Some examples. They're vague because they were work for employers:

A heap to get the top N results in a Google-style search. (Starting from candidates in an index, go through them all linearly, sifting them through a min-heap of max size N.) This was for an image-search prototype.
Bloom filters cut the size of certain data about what millions of users had seen down to an amount that'd fit in existing servers (it all had to be in RAM for speed); the original design would have needed many new servers just for that database.
A triangular array representation halved the size of a dense symmetrical array for a recommendation engine (RAM again for the same reason).
Users had to be grouped according to certain associations; union-find made this easy, quick, and exact instead of slow, hacky, and approximate.
An app for choosing retail sites according to drive time for people in the neighborhood used Dijkstra shortest-path with priority queues. Other GIS work took advantage of quadtrees and Morton indexes.

Knowing what's out there in data-structures-land comes in handy -- "weeks in the lab can save you hours in the library". The bloom-filter case was only worthwhile because of the scale: if the problem had come up at a startup instead of Yahoo, I'd have used a plain old hashtable. The other examples I think are reasonable anywhere (though nowadays you're less likely to code them yourself).

answered Oct 04 '22 11:10

Darius Bacon

Related questions
                            
                                How to compare two Dictionaries in C#
                            
                                What prevents Van Emde Boas trees from being more popular in real-world applications?
                            
                                What does C++ struct syntax "a : b" mean
                            
                                Generic Key/Value pair collection in that preserves insertion order?
                            
                                Converting Clojure data structures to Java collections
                            
                                Data structure behind T9 type of dictionary
                            
                                Java implementation for Min-Max Heap?
                            
                                How to implement a binary search tree in Python?
                            
                                Difference between a LinkedList and a Binary Search Tree
                            
                                Directly accessible data structure Java
                            
                                What is the fastest way to find the closest point to a given point?
                            
                                in Swift: Difference between Array VS NSArray VS [AnyObject]
                            
                                Algorithm for detecting "clusters" of dots [closed]
                            
                                Is there a bug in java.util.Stack's Iterator?
                            
                                Applications of red-black trees
                            
                                Zipping lists of unequal size
                            
                                Breadth-first traversal
                            
                                What is a Memory-Efficient Doubly Linked List in C?
                            
                                LRU cache in Java with Generics and O(1) operations
                            
                                Scrabble tile checking

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With