I am wondering if anyone knows of a data structure which would efficiently handle the following situation: The data structure should store several, possibly overlapping, variable length ranges on some continuous timescale. <ul> <li>For example, you might add the ranges <code>a:[0,3], b:[4,7], c:[0,9]</code>.</li> <li>Insertion time does not need to be particularly efficient.</li> </ul> Retrievals would take a range as a parameter, and return all the ranges in the set that overlap with the range, for example: <ul> <li><code>Get(1,2)</code> would return a and c. <code>Get(6,7)</code> would return b and c. <code>Get(2,6)</code> would return all three.</li> <li>Retrievals need to be as efficient as possible.</li> </ul>

You could go for a binary tree, that stores the ranges in a hierarchy. Starting from the root node, that represents an all-encompassing range divided it its middle, you test if your range you are trying to insert belong to the left subrange, right subrange, or both, and recursively carry on in the matching subnodes until you reach a certain depth, at which you save the actual range. For lookup, you test your input range against the left and right subranges of the top node, and dive in the ones which overlap, repeating until you have reached actual ranges that you save. This way, retrieval has a logarithmic complexity. You'd still need to manage duplicates in your retrieval, as some ranges are going to belong to several nodes.

Data Structure for Storing Ranges

2 Answers

One data structure you could use is a one-dimensional R-tree. These are designed to deal with ranges and to provide efficient retrieval. You will also learn about Allen's Operators; there are a dozen other relationships between time intervals than just 'overlaps'.

There are other questions on SO that impinge on this area, including:

Determine Whether Two Date Ranges Overlap
Data structure for non-overlapping ranges within a single dimension

188

answered Oct 16 '22 14:10

Jonathan Leffler

You could go for a binary tree, that stores the ranges in a hierarchy. Starting from the root node, that represents an all-encompassing range divided it its middle, you test if your range you are trying to insert belong to the left subrange, right subrange, or both, and recursively carry on in the matching subnodes until you reach a certain depth, at which you save the actual range.

For lookup, you test your input range against the left and right subranges of the top node, and dive in the ones which overlap, repeating until you have reached actual ranges that you save.

This way, retrieval has a logarithmic complexity. You'd still need to manage duplicates in your retrieval, as some ranges are going to belong to several nodes.

answered Oct 16 '22 12:10

small_duck

Related questions
                            
                                How do I determine which kind of tree data structure to choose?
                            
                                How to sort the list with duplicate keys?
                            
                                What are data structures in Objective-C? [closed]
                            
                                Substring algorithm suggestion
                            
                                Which Delphi data structure can hold a list of unique integers?
                            
                                Find duplicates in large file
                            
                                One-way sync of two hierarchies
                            
                                Fast priority queue with incremental updates
                            
                                Optimising MySQL queries across hierarchical data
                            
                                Can a monadic rose tree have a MonadFix instance?
                            
                                Calculate the number of unordered pairs in an array whose bitwise "AND" is a power of 2 in O(n) or O(n*log(n))
                            
                                Any examples of production applications that use signature trees?
                            
                                What is the proper problem name / algorithm for this problem description in computer science theory?
                            
                                Is there a javascript equivalent of the Multimap data structure? [closed]
                            
                                Testing concurrent data structures
                            
                                Efficient represention for growing circles in 2D space?
                            
                                Spaghetti stack in C
                            
                                Finding k most common words in a file - memory usage
                            
                                Algorithm / Data structure for largest set intersection in a collection of sets with a given set
                            
                                Find the maximum product of two non overlapping palindromic subsequences

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Data Structure for Storing Ranges

Tags:

performance

data-structures

range

Kevin

People also ask

2 Answers

Jonathan Leffler

small_duck

Recent Activity

Donate For Us