Near Sorting Algorithms - When to use?

Tags:

From time to time I browse the web and look for interesting algorithms and datastructures to put into my bag of tricks. A year ago I came across the Soft Heap data-structure and learned about near sorting.

The idea behind this is that it's possible to break the O(n log n) barrier of compare based sorts if you can live with the fact that the sort algorithm cheats a bit. You get a almost sorted list but you have to live with some errors as well.

I played around with the algorithms in a test environement but never found a use for them.

So the question: Has anyone ever used near sorting in practice? If so in which kind of applications? Can you think up a use-case where near sorting is the right thing to do?

372

asked Sep 28 '08 15:09

Nils Pipenbrinck

1 Answers

This is a total flying guess, but given the inherent subjectivity of "relevance" measures when sorting search results, I'd venture that it doesn't really matter whether or not they're perfectly sorted. The same could be said for recommendations. If you can somehow arrange that every other part of your algorithm for those things is O(n) then you might look to avoid a sort.

Be aware also that in the worst case your "nearly sorted" data does not meet one possible intuitive idea of "nearly sorted", which is that it has only a small number of inversions. The reason for this is just that if your data has only O(n) inversions, then you can finish sorting it in O(n) time using insertion sort or cocktail sort (i.e. two-way bubble sort). It follows that you cannot possibly have reached this point from completely unsorted, in O(n) time (using comparisons). So you're looking for applications where a majority subset of the data is sorted and the remainder is scattered, not for applications requiring that every element is close to its correct position.

154

answered Oct 03 '22 12:10

Steve Jessop

Related questions
                            
                                Where can I learn more about "ant colony" optimizations?
                            
                                Possible NP-complete problem?
                            
                                What algorithm to use to calculate a check digit?
                            
                                Number of Zeros in the binary representation of an Integer [duplicate]
                            
                                Calculate the cosine of a sequence
                            
                                How does one remove duplicate elements in place in an array in O(n) in C or C++?
                            
                                Level Order Traversal of a Binary Tree
                            
                                Curve fitting points in 3D space
                            
                                Find the two repeating elements in a given array
                            
                                What are some good methods to finding a heuristic for the A* algorithm?
                            
                                Fast hashing of 2-dimensional array
                            
                                Unique methods to generate sudoku puzzle [duplicate]
                            
                                Efficient accumulate
                            
                                What data structures and algorithms are not implementable in C? [closed]
                            
                                More efficient solution: Project Euler #2: Even Fibonacci Numbers
                            
                                Count subarrays divisible by K
                            
                                Count number of 1 digits in 11 to the power of N
                            
                                Explain how recursion works in an algorithm to determine depth of binary tree?
                            
                                how many numbers between 1 to 10 billion contains 14
                            
                                Comparing two different python counter objects

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Near Sorting Algorithms - When to use?

Tags:

language-agnostic

algorithm

sorting

Nils Pipenbrinck

People also ask

1 Answers

Steve Jessop

Recent Activity

Donate For Us