Can someone explain to me in simple English or an easy way to explain it?

The Merge Sort use the Divide-and-Conquer approach to solve the sorting problem. First, it divides the input in half using recursion. After dividing, it sort the halfs and merge them into one sorted output. See the figure <img src="https://i.stack.imgur.com/VUbeY.jpg" alt="MergeSort recursion tree"> It means that is better to sort half of your problem first and do a simple merge subroutine. So it is important to know the complexity of the merge subroutine and how many times it will be called in the recursion. The pseudo-code for the merge sort is really simple. <pre class="prettyprint"><code># C = output [length = N] # A 1st sorted half [N/2] # B 2nd sorted half [N/2] i = j = 1 for k = 1 to n if A[i] < B[j] C[k] = A[i] i++ else C[k] = B[j] j++ </code></pre> It is easy to see that in every loop you will have 4 operations: k++, i++ or j++, the if statement and the attribution C = A|B. So you will have less or equal to 4N + 2 operations giving a O(N) complexity. For the sake of the proof 4N + 2 will be treated as 6N, since is true for N = 1 (4N +2 <= 6N). So assume you have an input with N elements and assume N is a power of 2. At every level you have two times more subproblems with an input with half elements from the previous input. This means that at the the level j = 0, 1, 2, ..., lgN there will be 2^j subproblems with an input of length N / 2^j. The number of operations at each level j will be less or equal to <blockquote> 2^j * 6(N / 2^j) = 6N </blockquote> Observe that it doens't matter the level you will always have less or equal 6N operations. Since there are lgN + 1 levels, the complexity will be <blockquote> O(6N * (lgN + 1)) = O(6N*lgN + 6N) = O(n lgN) </blockquote> References: <ul> <li>Coursera course Algorithms: Design and Analysis, Part 1</li> </ul>

On a "traditional" merge sort, each pass through the data doubles the size of the sorted subsections. After the first pass, the file will be sorted into sections of length two. After the second pass, length four. Then eight, sixteen, etc. up to the size of the file. It's necessary to keep doubling the size of the sorted sections until there's one section comprising the whole file. It will take lg(N) doublings of the section size to reach the file size, and each pass of the data will take time proportional to the number of records.

Why is merge sort worst case run time O (n log n)?

2 Answers

The Merge Sort use the Divide-and-Conquer approach to solve the sorting problem. First, it divides the input in half using recursion. After dividing, it sort the halfs and merge them into one sorted output. See the figure

MergeSort recursion tree

It means that is better to sort half of your problem first and do a simple merge subroutine. So it is important to know the complexity of the merge subroutine and how many times it will be called in the recursion.

The pseudo-code for the merge sort is really simple.

# C = output [length = N] # A 1st sorted half [N/2] # B 2nd sorted half [N/2] i = j = 1 for k = 1 to n     if A[i] < B[j]         C[k] = A[i]         i++     else         C[k] = B[j]         j++

It is easy to see that in every loop you will have 4 operations: k++, i++ or j++, the if statement and the attribution C = A|B. So you will have less or equal to 4N + 2 operations giving a O(N) complexity. For the sake of the proof 4N + 2 will be treated as 6N, since is true for N = 1 (4N +2 <= 6N).

So assume you have an input with N elements and assume N is a power of 2. At every level you have two times more subproblems with an input with half elements from the previous input. This means that at the the level j = 0, 1, 2, ..., lgN there will be 2^j subproblems with an input of length N / 2^j. The number of operations at each level j will be less or equal to

2^j * 6(N / 2^j) = 6N

Observe that it doens't matter the level you will always have less or equal 6N operations.

Since there are lgN + 1 levels, the complexity will be

O(6N * (lgN + 1)) = O(6N*lgN + 6N) = O(n lgN)

References:

Coursera course Algorithms: Design and Analysis, Part 1

167

answered Oct 14 '22 11:10

Davi Sampaio

On a "traditional" merge sort, each pass through the data doubles the size of the sorted subsections. After the first pass, the file will be sorted into sections of length two. After the second pass, length four. Then eight, sixteen, etc. up to the size of the file.

It's necessary to keep doubling the size of the sorted sections until there's one section comprising the whole file. It will take lg(N) doublings of the section size to reach the file size, and each pass of the data will take time proportional to the number of records.

answered Oct 14 '22 11:10

supercat

Related questions
                            
                                Understanding "median of medians" algorithm
                            
                                Algorithm to find articles with similar text
                            
                                Sorting algorithms for data of known statistical distribution?
                            
                                Why there is no std::copy_if algorithm?
                            
                                std::transform() and toupper(), no matching function
                            
                                Emulate "double" using 2 "float"s
                            
                                Correctness of Sakamoto's algorithm to find the day of week
                            
                                Calculating mid in binary search
                            
                                Finding the number of digits of an integer
                            
                                find if 4 points on a plane form a rectangle?
                            
                                Calculate Time Remaining
                            
                                Circle-circle intersection points
                            
                                How do you validate a binary search tree?
                            
                                Most efficient code for the first 10000 prime numbers?
                            
                                Point in Polygon Algorithm
                            
                                Is there an algorithm that tells the semantic similarity of two phrases
                            
                                What is the first character in the sort order used by Windows Explorer?
                            
                                Evaluation & Calculate Top-N Accuracy: Top 1 and Top 5
                            
                                Pseudorandom Number Generator - Exponential Distribution
                            
                                Find the number of occurrences of a subsequence in a string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is merge sort worst case run time O (n log n)?

Tags:

algorithm

mergesort

adit

People also ask

2 Answers

Davi Sampaio

supercat

Recent Activity

Donate For Us