Given an array <code>a</code> and integer <code>k</code>. Someone uses following algorithm to get first k smallest elements: <pre class="prettyprint"><code>cnt = 0 for i in [1, k]: for j in [i + 1, n]: if a[i] > a[j]: swap(a[i], a[j]) cnt = cnt + 1 </code></pre> The problem is: How to calculate value of <code>cnt</code> (when we get final k-sorted array), i.e. the number of swaps, in <code>O(n log n)</code> or better ? Or simply put: calculate the number of swaps needed to get first k-smallest number sorted using the above algorithm, in less than <code>O(n log n)</code>. I am thinking about a binary search tree, but I get confused (How array will change when increase i ? How to calculate number of swap for a fixed i ?...).

This is a very good question: it involves Inverse Pairs, Stack and some proof techniques. Note 1: All index used below are 1-based, instead of traditional 0-based. Note 2: If you want to see the algorithm directly, please start reading from the bottom. First we define Inverse Pairs as: <blockquote> For <code>a[i]</code> and <code>a[j]</code>, in which <code>i < j</code> holds, if we have <code>a[i] > a[j]</code>, then <code>a[i]</code> and <code>a[j]</code> are called an Inverse Pair. </blockquote> For example, In the following array: <pre class="prettyprint"><code>3 2 1 5 4 </code></pre> <code>a[1]</code> and <code>a[2]</code> is a pair of Inverse Pair, <code>a[2]</code> and <code>a[3]</code> is another pair. Before we start the analysis, let's define a common language: in the reset of the post, "inverse pair starting from <code>i</code>" means the total number of inverse pairs involving <code>a[i]</code>. For example, for <code>a = {3, 1, 2}</code>, inverse pair starting from 1 is 2, and inverse pair starting from 2 is 0. Now let's look at some facts: <ol> <li>If we have <code>i < j < k</code>, and <code>a[i] > a[k]</code>, <code>a[j] > a[k]</code>, swap <code>a[i]</code> and <code>a[j]</code> (if they are an inverse pair) won't affect the total number of inverse pair starting from <code>j</code>;</li> <li>Total inverse pairs starting from i may change after a swap (e.g. suppose we have <code>a = {5, 3, 4}</code>, before <code>a[1]</code> is swapped with <code>a[2]</code>, total number of inverse pair starting from 1 is 2, but after swap, array becomes <code>a = {3, 5, 4}</code>, and the number of inverse pair starting from 1 becomes 1);</li> <li>Given an array <code>A</code> and 2 numbers, <code>a</code> and <code>b</code>, as the head element of <code>A</code>, if we can form more inverse pair with <code>a</code> than <code>b</code>, we have <code>a > b</code>;</li> <li>Let's denote the total number of inverse pair starting from <code>i</code> as <code>ip[i]</code>, then we have: if <code>k</code> is the min number satisfies <code>ip[i] > ip[i + k]</code>, then <code>a[i] > a[i + k]</code> while <code>a[i] < a[i + 1 .. i + k - 1]</code> must be true. In words, if <code>ip[i + k]</code> is the first number smaller than <code>ip[i]</code>, <code>a[i + k]</code> is also the first number smaller than <code>a[i]</code>;</li> </ol> Proof of point 1: By definition of inverse pair, for all <code>a[k]</code>, <code>k > j</code> that forms inverse pair with <code>a[j]</code>, <code>a[k] < a[j]</code> must hold. Since <code>a[i]</code> and <code>a[j]</code> are a pair of inverse and provided that <code>i < j</code>, we have <code>a[i] > a[j]</code>. Therefore, we have <code>a[i] > a[j] > a[k]</code>, which indicates the inverse-pair-relationships are not broken. Proof of point 3: Leave as empty since quite obvious. Proof of point 4: First, it's easy to see that when <code>i < j</code>, <code>a[i] > a[j]</code>, we have <code>ip[i] >= ip[j] + 1 > ip[j]</code>. Then, it's inverse-contradict statement is also true, i.e. when <code>i < j</code>, <code>ip[i] <= ip[j]</code>, we have <code>a[i] <= a[j]</code>. Now back to the point. Since k is the min number to satisfy <code>ip[i] > ip[i + k]</code>, then we have <code>ip[i] <= ip[i + 1 .. i + k - 1]</code>, which indicates <code>a[i] <= a[i + 1.. i + k - 1]</code> by the lemma we just proved, which also indicates there's no inverse pairs in the region <code>[i + 1, i + k - 1]</code>. Therefore, <code>ip[i]</code> is the same as the number of inverse pairs starting from <code>i + k</code>, but involving <code>a[i]</code>. Given <code>ip[i + k] < ip[i]</code>, we know <code>a[i + k]</code> has less inverse pair than <code>a[i]</code> in the region of <code>[i + k + 1, n]</code>, which indicates <code>a[i + k] < a[i]</code> (by Point 3). You can write down some sequences and try out the 4 facts mentioned above and convince yourself or disprove them :P Now it's about the algorithm. A naive implementation will take <code>O(nk)</code> to compute the result, and the worst case will be <code>O(n^2)</code> when <code>k = n</code>. But how about we make use of the facts above: First we compute <code>ip[i]</code> using Fenwick Tree (see Note 1 below), which takes <code>O(n log n)</code> to construct and <code>O(n log n)</code> to get all <code>ip[i]</code> calculated. Next, we need to make use of facts. Since swap of 2 numbers only affect current position's inverse pair number but not values after (point 1 and 2), we don't need to worry about the value change. Also, since the nearest smaller number to the right shares the same index in <code>ip</code> and <code>a</code>, we only need to find the first <code>ip[j]</code> that is smaller than <code>ip[i]</code> in <code>[i + 1, n]</code>. If we denote the number of swaps to get first <code>i</code> element sorted as <code>f[i]</code>, we have <code>f[i] = f[j] + 1</code>. But how to find this "first smaller number" fast? Use stack! Here is a post which asks a highly similar problem: Given an array A,compute B s.t B[i] stores the nearest element to the left of A[i] which is smaller than A[i] In short, we are able to do this in <code>O(n)</code>. But wait, the post says "to the left" but in our case it's "to the right". The solution is simple: we do backward in our case, then everything the same :D Therefore, in summary, the total time complexity of the algorithm is <code>O(n log n) + O(n) = O(n log n)</code>. Finally, let's talk with an example (a simplified example of @make_lover's example in the comment): <code>a = {2, 5, 3, 4, 1, 6}</code>, <code>k = 2</code> First, let's get the inverse pairs: <code>ip = {1, 3, 1, 1, 0, 0}</code> To calculate <code>f[i]</code>, we do backward (since we need to use the stack technique): <pre class="prettyprint"><code>f[6] = 0, since it's the last one f[5] = 0, since we could not find any number that is smaller than 0 f[4] = f[5] + 1 = 1, since ip[5] is the first smaller number to the right f[3] = f[5] + 1 = 1, since ip[5] is the first smaller number to the right f[2] = f[3] + 1 = 2, since ip[3] is the first smaller number to the right f[1] = f[5] + 1 = 1, since ip[5] is the first smaller number to the right </code></pre> Therefore, <code>ans = f[1] + f[2] = 3</code> Note 1: Using Fenwick Tree (Binary Index Tree) to get inverse pair can be done in O(N log N), here is a post on this topic, please have a look :) Update Aug/20/2014: There was a critical error in my previous post (thanks to @make_lover), here is the latest update.

Count number of swaps to sort first k-smallest element using a bubble sort like algorithm

Tags:

algorithm

sorting

data-structures

Given an array a and integer k. Someone uses following algorithm to get first k smallest elements:

cnt = 0
for i in [1, k]:
    for j in [i + 1, n]:
        if a[i] > a[j]:
            swap(a[i], a[j])
            cnt = cnt + 1

The problem is: How to calculate value of cnt (when we get final k-sorted array), i.e. the number of swaps, in O(n log n) or better ?

Or simply put: calculate the number of swaps needed to get first k-smallest number sorted using the above algorithm, in less than O(n log n).

I am thinking about a binary search tree, but I get confused (How array will change when increase i ? How to calculate number of swap for a fixed i ?...).

867

asked Aug 17 '14 07:08

make_lover

1 Answers

This is a very good question: it involves Inverse Pairs, Stack and some proof techniques.

Note 1: All index used below are 1-based, instead of traditional 0-based.

Note 2: If you want to see the algorithm directly, please start reading from the bottom.

First we define Inverse Pairs as:

For a[i] and a[j], in which i < j holds, if we have a[i] > a[j], then a[i] and a[j] are called an Inverse Pair.

For example, In the following array:

3 2 1 5 4

a[1] and a[2] is a pair of Inverse Pair, a[2] and a[3] is another pair.

Before we start the analysis, let's define a common language: in the reset of the post, "inverse pair starting from i" means the total number of inverse pairs involving a[i].

For example, for a = {3, 1, 2}, inverse pair starting from 1 is 2, and inverse pair starting from 2 is 0.

Now let's look at some facts:

If we have i < j < k, and a[i] > a[k], a[j] > a[k], swap a[i] and a[j] (if they are an inverse pair) won't affect the total number of inverse pair starting from j;
Total inverse pairs starting from i may change after a swap (e.g. suppose we have a = {5, 3, 4}, before a[1] is swapped with a[2], total number of inverse pair starting from 1 is 2, but after swap, array becomes a = {3, 5, 4}, and the number of inverse pair starting from 1 becomes 1);
Given an array A and 2 numbers, a and b, as the head element of A, if we can form more inverse pair with a than b, we have a > b;
Let's denote the total number of inverse pair starting from i as ip[i], then we have: if k is the min number satisfies ip[i] > ip[i + k], then a[i] > a[i + k] while a[i] < a[i + 1 .. i + k - 1] must be true. In words, if ip[i + k] is the first number smaller than ip[i], a[i + k] is also the first number smaller than a[i];

Proof of point 1:

By definition of inverse pair, for all a[k], k > j that forms inverse pair with a[j], a[k] < a[j] must hold. Since a[i] and a[j] are a pair of inverse and provided that i < j, we have a[i] > a[j]. Therefore, we have a[i] > a[j] > a[k], which indicates the inverse-pair-relationships are not broken.

Proof of point 3:

Leave as empty since quite obvious.

Proof of point 4:

First, it's easy to see that when i < j, a[i] > a[j], we have ip[i] >= ip[j] + 1 > ip[j]. Then, it's inverse-contradict statement is also true, i.e. when i < j, ip[i] <= ip[j], we have a[i] <= a[j].

Now back to the point. Since k is the min number to satisfy ip[i] > ip[i + k], then we have ip[i] <= ip[i + 1 .. i + k - 1], which indicates a[i] <= a[i + 1.. i + k - 1] by the lemma we just proved, which also indicates there's no inverse pairs in the region [i + 1, i + k - 1]. Therefore, ip[i] is the same as the number of inverse pairs starting from i + k, but involving a[i]. Given ip[i + k] < ip[i], we know a[i + k] has less inverse pair than a[i] in the region of [i + k + 1, n], which indicates a[i + k] < a[i] (by Point 3).

You can write down some sequences and try out the 4 facts mentioned above and convince yourself or disprove them :P

Now it's about the algorithm.

A naive implementation will take O(nk) to compute the result, and the worst case will be O(n^2) when k = n.

But how about we make use of the facts above:

First we compute ip[i] using Fenwick Tree (see Note 1 below), which takes O(n log n) to construct and O(n log n) to get all ip[i] calculated.

Next, we need to make use of facts. Since swap of 2 numbers only affect current position's inverse pair number but not values after (point 1 and 2), we don't need to worry about the value change. Also, since the nearest smaller number to the right shares the same index in ip and a, we only need to find the first ip[j] that is smaller than ip[i] in [i + 1, n]. If we denote the number of swaps to get first i element sorted as f[i], we have f[i] = f[j] + 1.

But how to find this "first smaller number" fast? Use stack! Here is a post which asks a highly similar problem: Given an array A,compute B s.t B[i] stores the nearest element to the left of A[i] which is smaller than A[i]

In short, we are able to do this in O(n).

But wait, the post says "to the left" but in our case it's "to the right". The solution is simple: we do backward in our case, then everything the same :D

Therefore, in summary, the total time complexity of the algorithm is O(n log n) + O(n) = O(n log n).

Finally, let's talk with an example (a simplified example of @make_lover's example in the comment):

a = {2, 5, 3, 4, 1, 6}, k = 2

First, let's get the inverse pairs:

ip = {1, 3, 1, 1, 0, 0}

To calculate f[i], we do backward (since we need to use the stack technique):

f[6] = 0, since it's the last one
f[5] = 0, since we could not find any number that is smaller than 0
f[4] = f[5] + 1 = 1, since ip[5] is the first smaller number to the right
f[3] = f[5] + 1 = 1, since ip[5] is the first smaller number to the right
f[2] = f[3] + 1 = 2, since ip[3] is the first smaller number to the right
f[1] = f[5] + 1 = 1, since ip[5] is the first smaller number to the right

Therefore, ans = f[1] + f[2] = 3

Note 1: Using Fenwick Tree (Binary Index Tree) to get inverse pair can be done in O(N log N), here is a post on this topic, please have a look :)

Update

Aug/20/2014: There was a critical error in my previous post (thanks to @make_lover), here is the latest update.

158

answered Oct 14 '22 06:10

nevets

Related questions
                            
                                Find all maximal complete bipartite subgraph from given bipartite graph
                            
                                How do I calculate parking rates over several days when rates are specified daily?
                            
                                Grouping algorithm for combinations
                            
                                A fast, rank based Radix Sort for floats?
                            
                                Get possible array combinations
                            
                                Calculating the Sum of values in a linked list
                            
                                DFS algorithm in Python with generators
                            
                                Using Bhattacharyya Distance for feature selection
                            
                                Generating integer partition by its number
                            
                                Currency Exchange Altorithm (Android / Java /Pseudocode)
                            
                                Looking for text file search algorithm for very large projects in C#
                            
                                All pairs shortest path - warm restart?
                            
                                Using mergesort with presorted intervals
                            
                                Algorithm - Find Optimal element of an array
                            
                                Given a number check if digits form an equation with addition?
                            
                                What Sort Of Algorithm Should I Use To Sort Students?
                            
                                How to generate all subsets of a given size?
                            
                                Get all elements in viewport Javascript
                            
                                Algorithm that discovers all the fields on a map with as least turns as possible
                            
                                Creating arbitrary number from one digit numbers and simple operations

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With