Best continuously sorting algorithm?

2 Answers

Building a self-balancing binary tree like a red-black tree or AVL tree will allow for Θ(lg n) insertion and removal, and Θ(n) retrieval of all elements in sorted order (by doing a depth-first traversal), with Θ(n) memory usage. The implementation is somewhat complex, but they're efficient, and most languages will have library implementations, so they're a good first choice in most cases.

Additionally, retreiving the i-th element can be done by annotating each edge (or, equivalently, node) in the tree with the total number of nodes below it. Then one can find the i-th element in Θ(lg n) time and Θ(1) space with something like:

node *find_index(node *root, int i) {
  while (node) {
    if (i == root->left_count)
      return root;
    else if (i < root->left_count)
      root = root->left;
    else {
      i -= root->left_count + 1;
      root = root->right;
    }
  }
  return NULL; // i > number of nodes
}

An implementation that supports this can be found in debian's libavl; unfortunately, the maintainer's site seems down, but it can be retrieved from debian's servers.

answered Oct 07 '22 09:10

bdonlan

The structure that is used for indexes of database programs is a B+ Tree. It is a balanced bucketed n-ary tree.

From Wikipedia:

For a b-order B+ tree with h levels of index:

The maximum number of records stored is n = b^h
The minimum number of keys is 2(b/2)^(h−1)
The space required to store the tree is O(n)
Inserting a record requires O(log-b(n)) operations in the worst case
Finding a record requires O(log-b(n)) operations in the worst case
Removing a (previously located) record requires O(log-b(n)) operations in the worst case
Performing a range query with k elements occurring within the range requires O(log-b(n+k)) operations in the worst case.

I use this in my program. You can add your data to the structure as it comes and you can always traverse it in order, front to back or back to front, or search quickly for any value. If you don't find the value, you will have the insertion point where you can add the value.

You can optimize the structure for your program by playing around with b, the size of the buckets.

An interesting presentation about B+ trees: Tree-Structured Indexes

You can get the entire code in C++.

Edit: Now I see your comment that your requirement to know the "i-th sorted element in the set" is an important one. All of a sudden, that makes many data structures less than optimal.

You are probably best off with a SortedList or even better, a SortedDictionary. See the article: Squeezing more performance from SortedList. Both structures have a GetKey function that will return the i-th element.

answered Oct 07 '22 08:10

lkessler

Related questions
                            
                                How to recognize what is, and what is not tail recursion?
                            
                                Why is an "unstable sort" considered bad
                            
                                How to change a text file's name in C++?
                            
                                Better way to check if all lists in a list are the same length? [duplicate]
                            
                                Using an imported module inside Google App Script
                            
                                The fastest way to find union of sets
                            
                                Implementation of Luhn algorithm
                            
                                Stable, efficient sort?
                            
                                Balancing a Binary Tree (AVL)
                            
                                How would you sort 1 million 32-bit integers in 2MB of RAM?
                            
                                Find the subsequence with largest sum of elements in an array
                            
                                Calculate Cron Next Run Time in C#
                            
                                PHP If/ELSE or Switch/Case Statement
                            
                                Calculating permutations in F#
                            
                                Finding how many bits are on in a number
                            
                                2^n complexity algorithm
                            
                                Find ith permutation in javascript
                            
                                Determine rows/columns needed given a number
                            
                                Limit floating point precision?
                            
                                Intersection of two strings in Java

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best continuously sorting algorithm?

Tags:

algorithm

sorting

Wilhelm

People also ask

2 Answers

bdonlan

lkessler

Recent Activity

Donate For Us