I know the implementation for most of these algorithms, but I don't know for what sized data sets to use them for (and the data included): <ol> <li>Merge Sort</li> <li>Bubble Sort (I know, not very often)</li> <li>Quick Sort</li> <li>Insertion Sort</li> <li>Selection Sort</li> <li>Radix Sort</li> </ol>

Use Bubble Sort only when the data to be sorted is stored on rotating drum memory. It's optimal for that purpose, but not for random-access memory. These days, that amounts to "don't use Bubble Sort". Use Insertion Sort or Selection Sort up to some size that you determine by testing it against the other sorts you have available. This usually works out to be around 20-30 items, but YMMV. In particular, when implementing divide-and-conquer sorts like Merge Sort and Quick Sort, you should "break out" to an Insertion sort or a Selection sort when your current block of data is small enough. Also use Insertion Sort on nearly-sorted data, for example if you somehow know that your data used to be sorted, and hasn't changed very much since. Use Merge Sort when you need a stable sort (it's also good when sorting linked lists), beware that for arrays it uses significant additional memory. Generally you don't use "plain" Quick Sort at all, because even with intelligent choice of pivots it still has <code>Omega(n^2)</code> worst case but unlike Insertion Sort it doesn't have any useful best cases. The "killer" cases can be constructed systematically, so if you're sorting "untrusted" data then some user could deliberately kill your performance, and anyway there might be some domain-specific reason why your data approximates to killer cases. If you choose random pivots then the probability of killer cases is negligible, so that's an option, but the usual approach is "IntroSort" - a QuickSort that detects bad cases and switches to HeapSort. Radix Sort is a bit of an oddball. It's difficult to find common problems for which it is best, but it has good asymptotic limit for fixed-width data (<code>O(n)</code>, where comparison sorts are <code>Omega(n log n)</code>). If your data is fixed-width, and the input is larger than the number of possible values (for example, more than 4 billion 32-bit integers) then there starts to be a chance that some variety of radix sort will perform well.

In what situations do I use these sorting algorithms?

2 Answers

First of all, you take all the sorting algorithms that have a O(n2) complexity and throw them away.

Then, you have to study several proprieties of your sorting algorithms and decide whether each one of them will be better suited for the problem you want to solve. The most important are:

Is the algorithm in-place? This means that the sorting algorithm doesn't use any (O(1) actually) extra memory. This propriety is very important when you are running memory-critical applications.

Bubble-sort, Insertion-sort and Selection-sort use constant memory. There is an in-place variant for Merge-sort too.

Is the algorithm stable? This means that if two elements x and y are equal given your comparison method, and in the input x is found before y, then in the output x will be found before y.

Merge-sort, Bubble-sort and Insertion-sort are stable.

Can the algorithm be parallelized? If the application you are building can make use of parallel computation, you might want to choose parallelizable sorting algorithms.

More info here.

answered Sep 28 '22 16:09

alestanis

Use Bubble Sort only when the data to be sorted is stored on rotating drum memory. It's optimal for that purpose, but not for random-access memory. These days, that amounts to "don't use Bubble Sort".

Use Insertion Sort or Selection Sort up to some size that you determine by testing it against the other sorts you have available. This usually works out to be around 20-30 items, but YMMV. In particular, when implementing divide-and-conquer sorts like Merge Sort and Quick Sort, you should "break out" to an Insertion sort or a Selection sort when your current block of data is small enough.

Also use Insertion Sort on nearly-sorted data, for example if you somehow know that your data used to be sorted, and hasn't changed very much since.

Use Merge Sort when you need a stable sort (it's also good when sorting linked lists), beware that for arrays it uses significant additional memory.

Generally you don't use "plain" Quick Sort at all, because even with intelligent choice of pivots it still has Omega(n^2) worst case but unlike Insertion Sort it doesn't have any useful best cases. The "killer" cases can be constructed systematically, so if you're sorting "untrusted" data then some user could deliberately kill your performance, and anyway there might be some domain-specific reason why your data approximates to killer cases. If you choose random pivots then the probability of killer cases is negligible, so that's an option, but the usual approach is "IntroSort" - a QuickSort that detects bad cases and switches to HeapSort.

Radix Sort is a bit of an oddball. It's difficult to find common problems for which it is best, but it has good asymptotic limit for fixed-width data (O(n), where comparison sorts are Omega(n log n)). If your data is fixed-width, and the input is larger than the number of possible values (for example, more than 4 billion 32-bit integers) then there starts to be a chance that some variety of radix sort will perform well.

answered Sep 28 '22 18:09

Steve Jessop

Related questions
                            
                                Finding the largest size of 100 structs at compile time in C
                            
                                How to consolidate date ranges in a list in C#
                            
                                Trying to build algorithm for optimal tower placement in a game
                            
                                Find a missing 32bit integer among a unsorted array containing at most 4 billion ints
                            
                                Dynamic nested loops level
                            
                                Multiples of Numbers in a List
                            
                                Simple integer encryption
                            
                                Using bit manipulation to tell if an unsigned integer can be expressed in the form 2^n-1
                            
                                Interesting algorithm problem
                            
                                Influence Math.random()
                            
                                Quadtree explanation and C implementation [closed]
                            
                                Is converting a maximization algorithm into a minimization one a matter of changing max to min?
                            
                                Getting the subsets of a set in Python
                            
                                Deterministically checking whether a large number is prime or composite?
                            
                                Filter items that only occurs once in a very large list
                            
                                Search algorithm but for functions
                            
                                How to generate random permutations with CUDA
                            
                                Algorithm for smoothing
                            
                                How to partially compare two graphs
                            
                                What is the difference between "hill climbing" and "branch-and-bound" search algorithms?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In what situations do I use these sorting algorithms?

Tags:

algorithm

sorting

implementation

Jake Byman

People also ask

2 Answers

alestanis

Steve Jessop

Recent Activity

Donate For Us