I found that <code>java.util.Arrays.sort(Object[])</code> use 2 kinds of sorting algorithms(in JDK 1.6). pseudocode: <pre class="prettyprint"><code>if(array.length<7) insertionSort(array); else mergeSort(array); </code></pre> Why does it need 2 kinds of sorting here? for efficiency?

It's important to note that an algorithm that is <code>O(N log N)</code> is not always faster in practice than an <code>O(N^2)</code> algorithm. It depends on the constants, and the range of <code>N</code> involved. (Remember that asymptotic notation measures relative growth rate, not absolute speed). For small <code>N</code>, insertion sort in fact does beat merge sort. It's also faster for almost-sorted arrays. Here's a quote: <blockquote> Although it is one of the elementary sorting algorithms with <code>O(N^2)</code> worst-case time, insertion sort is the algorithm of choice either when the data is nearly sorted (because it is adaptive) or when the problem size is small (because it has low overhead). For these reasons, and because it is also stable, insertion sort is often used as the recursive base case (when the problem size is small) for higher overhead divide-and-conquer sorting algorithms, such as merge sort or quick sort. </blockquote> Here's another quote from Best sorting algorithm for nearly sorted lists paper: <blockquote> straight insertion sort is best for small or very nearly sorted lists </blockquote> What this means is that, in practice: <ul> <li>Some algorithm A1 with higher asymptotic upper bound may be preferable than another known algorithm A2 with lower asymptotic upper bound <ul> <li>Perhaps A2 is just too complicated to implement</li> <li>Or perhaps it doesn't matter in the range of <code>N</code> considered <ul> <li>See e.g. Coppersmith–Winograd algorithm </li> </ul> </li> </ul> </li> <li>Some hybrid algorithms may adapt different algorithms depending on the input size</li> </ul> <h3>Related questions</h3> <ul> <li>Which sorting algorithm is best suited to re-sort an almost fully sorted list?</li> <li>Is there ever a good reason to use Insertion Sort?</li> </ul> <hr> <h3>A numerical example</h3> Let's consider these two functions: <ul> <li> <code>f(x) = 2x^2</code>; this function has a quadratic growth rate, i.e. "<code>O(N^2)</code>"</li> <li> <code>g(x) = 10x</code>; this function has a linear growth rate, i.e. "<code>O(N)</code>"</li> </ul> Now let's plot the two functions together: <img src="https://i.stack.imgur.com/2IxL5.gif" alt="alt text"> Source: WolframAlpha: <code>plot 2x^2 and 10x for x from 0 to 10</code> Note that between <code>x=0..5</code>, <code>f(x) <= g(x)</code>, but for any larger <code>x</code>, <code>f(x)</code> quickly outgrows <code>g(x)</code>. Analogously, if A1 is a quadratic algorithm with a low overhead, and A2 is a linear algorithm with a high overhead, for smaller input, A1 may be faster than A2. Thus, you can, should you choose to do so, create a hybrid algorithm A3 which simply selects one of the two algorithms depending on the size of the input. Whether or not this is worth the effort depends on the actual parameters involved. Many tests and comparisons of sorting algorithms have been made, and it was decided that because insertion sort beats merge sort for small arrays, it was worth it to implement both for <code>Arrays.sort</code>.

Why does java.util.Arrays.sort(Object[]) use 2 kinds of sorting algorithms?

Tags:

java

algorithm

collections

sorting

I found that java.util.Arrays.sort(Object[]) use 2 kinds of sorting algorithms(in JDK 1.6).

pseudocode:

if(array.length<7)    insertionSort(array); else    mergeSort(array);

Why does it need 2 kinds of sorting here? for efficiency?

380

asked Aug 25 '10 14:08

卢声远 Shengyuan Lu

2 Answers

It's for speed. The overhead of mergeSort is high enough that for short arrays it would be slower than insertion sort.

answered Sep 19 '22 08:09

DJClayworth

It's important to note that an algorithm that is O(N log N) is not always faster in practice than an O(N^2) algorithm. It depends on the constants, and the range of N involved. (Remember that asymptotic notation measures relative growth rate, not absolute speed).

For small N, insertion sort in fact does beat merge sort. It's also faster for almost-sorted arrays.

Here's a quote:

Although it is one of the elementary sorting algorithms with O(N^2) worst-case time, insertion sort is the algorithm of choice either when the data is nearly sorted (because it is adaptive) or when the problem size is small (because it has low overhead).

For these reasons, and because it is also stable, insertion sort is often used as the recursive base case (when the problem size is small) for higher overhead divide-and-conquer sorting algorithms, such as merge sort or quick sort.

Here's another quote from Best sorting algorithm for nearly sorted lists paper:

straight insertion sort is best for small or very nearly sorted lists

What this means is that, in practice:

Some algorithm A₁ with higher asymptotic upper bound may be preferable than another known algorithm A₂ with lower asymptotic upper bound
- Perhaps A₂ is just too complicated to implement
- Or perhaps it doesn't matter in the range of N considered
  - See e.g. Coppersmith–Winograd algorithm
Some hybrid algorithms may adapt different algorithms depending on the input size

A numerical example

Let's consider these two functions:

f(x) = 2x^2; this function has a quadratic growth rate, i.e. "O(N^2)"
g(x) = 10x; this function has a linear growth rate, i.e. "O(N)"

Now let's plot the two functions together:

alt text
^{Source: WolframAlpha: plot 2x^2 and 10x for x from 0 to 10}

Note that between x=0..5, f(x) <= g(x), but for any larger x, f(x) quickly outgrows g(x).

Analogously, if A₁ is a quadratic algorithm with a low overhead, and A₂ is a linear algorithm with a high overhead, for smaller input, A₁ may be faster than A₂.

Thus, you can, should you choose to do so, create a hybrid algorithm A₃ which simply selects one of the two algorithms depending on the size of the input. Whether or not this is worth the effort depends on the actual parameters involved.

Many tests and comparisons of sorting algorithms have been made, and it was decided that because insertion sort beats merge sort for small arrays, it was worth it to implement both for Arrays.sort.

186

answered Sep 19 '22 08:09

polygenelubricants

Related questions
                            
                                R CMD javareconf not finding jni.h
                            
                                How to send data to COM PORT using JAVA? [duplicate]
                            
                                Find out what variable is throwing a NullPointerException programmatically
                            
                                Benefits of arrays
                            
                                How do I copy-protect my Java application? [closed]
                            
                                JavaFX: Undecorated Window
                            
                                Efficiency: switch statements over if statements
                            
                                Where can I find a list of the Java Standard libraries?
                            
                                Mockito mock objects returns null
                            
                                Handle mouse event anywhere with JavaFX
                            
                                ClassLoader getResourceAsStream returns null
                            
                                How does Double.intValue() work?
                            
                                JPA + Hibernate + Spring + OneToMany delete cascade
                            
                                How to run java program in command prompt,created by intellij
                            
                                NoClassDefFoundError: org/apache/commons/lang3/StringUtils
                            
                                Ignore specific nodes/attributes while comparing two JSONs
                            
                                Difference between doAfterTerminate and doFinally
                            
                                Can I use @Requestparam annotation for a Post request?
                            
                                Wait until tomcat finishes starting up
                            
                                Run exe which is packaged inside jar file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With