Given an array which might contain duplicates, How can we find if it is a sequence? Eg. <code>{7, 3, 5, 4, 6, 2}</code> is a sequence <code>2, 3, 4, 5, 6, 7</code> Sorting is an obvious solution. How can we do this in O(n) time and O(1) space?

assuming <code>1,2,3,3,4,1</code> is a valid unsorted sequence and <code>2,4,6,8</code> is a valid sequence (of step two) as well, but <code>1,3,5,9</code> isn't (7 is missing) and assuming the input array can be overwritten, <ol> <li>determine the maximum and minimum: O(n) time, O(1) space. You can use the first and the last position of the array for this.</li> <li>determine the step. The step is the least common multiplier of all <code>a_n - min</code> </li> <li>if they are too far apart (<code>max - min > (count + 1) * step</code>), this cannot be a sequence. Otherwise,</li> <li>do an in-place integer sort. Until start > end: <ul> <li>look at the first position. Let the value there be <code>v_0</code> </li> <li>let its target position when we assume no duplicates (<code>(v_0 - min) / step + start</code>) be <code>i</code> <ul> <li>if the target position is less than <code>start</code>, it's a duplicate. Move it to the back and decrement the end pointer</li> <li>if the target position is more than <code>end</code>, some element is missing in the sequence. Claim the array is not a sequence.</li> </ul> </li> <li>if the element is at the target position, increment the start pointer and the <code>min</code> reference</li> <li>else if the element at the target position is less than the reference minimum or equal to <code>v_0</code>, swap it to the end of the array and decrement the end pointer. It's a duplicate.</li> <li>else swap the element at the target position with <code>v_0</code>.</li> </ul> </li> <li>Claim the array a sequence</li> </ol> The in-place integer sort is O(n). In each step it either: <ul> <li>shortens the input array and keeps all sorted elements at their target positions or</li> <li>sorts one or two previously unsorted elements to their target position.</li> </ul> At the end of sorting, each element is either a duplicate in the duplicate block, or at its correct position in the sorted block. Note that step #3 can be left out. #4 will correctly determine this is not a sequence, albeit slower. If the step has to be 1, then this algorithm can be simplified somewhat (see revision #1)

Find if an array is a sequence in O(n) time and O(1) space [duplicate]

2 Answers

assuming 1,2,3,3,4,1 is a valid unsorted sequence and 2,4,6,8 is a valid sequence (of step two) as well, but 1,3,5,9 isn't (7 is missing) and assuming the input array can be overwritten,

determine the maximum and minimum: O(n) time, O(1) space. You can use the first and the last position of the array for this.
determine the step. The step is the least common multiplier of all a_n - min
if they are too far apart (max - min > (count + 1) * step), this cannot be a sequence. Otherwise,
do an in-place integer sort. Until start > end:
- look at the first position. Let the value there be v_0
- let its target position when we assume no duplicates ((v_0 - min) / step + start) be i
  - if the target position is less than start, it's a duplicate. Move it to the back and decrement the end pointer
  - if the target position is more than end, some element is missing in the sequence. Claim the array is not a sequence.
- if the element is at the target position, increment the start pointer and the min reference
- else if the element at the target position is less than the reference minimum or equal to v_0, swap it to the end of the array and decrement the end pointer. It's a duplicate.
- else swap the element at the target position with v_0.
Claim the array a sequence

The in-place integer sort is O(n). In each step it either:

shortens the input array and keeps all sorted elements at their target positions or
sorts one or two previously unsorted elements to their target position.

At the end of sorting, each element is either a duplicate in the duplicate block, or at its correct position in the sorted block.

Note that step #3 can be left out. #4 will correctly determine this is not a sequence, albeit slower.

If the step has to be 1, then this algorithm can be simplified somewhat (see revision #1)

answered Nov 02 '22 23:11

John Dvorak

This algorithm (Python) destroys the original array, but otherwise satisfies O(n) time and O(1) extra space.

# INPUT: An array 'arr' of N integers.
# OUTPUT: If the array consists exactly of the integers
#         S, S+1, ..., S+N-1, for some S, in any order,
#         then modifies 'arr' into a sorted array and returns it.
#         Otherwise, returns False, and 'arr' may have been modified.
def sort_sequence (arr):
    the_min = min(arr)
    the_max = max(arr)
    if the_max - the_min != len(arr) - 1:
        return False
    for i in range(len(arr)):
        arr[i] -= the_min
    for i in range(len(arr)):
        while arr[i] != i:
            j = arr[i]
            t = arr[j]
            if t == j:
                return False
            arr[j] = j
            arr[i] = t
    for i in range(len(arr)):
        arr[i] += the_min
    return arr

I have not formally proven that it works yet.

Why is this O(n)? In the final double loop, after an element is first put into its correct spot, it can only be visited one more time - either at the beginning of another inner loop where it is seen to be in the right spot, or where it is found to be in the way of a duplicate element (the if t == h part).

answered Nov 03 '22 01:11

Ambroz Bizjak

Related questions
                            
                                Tree traversal with corecursion
                            
                                How to equidistant resample a line (or curve)?
                            
                                Comparing/Clustering Trajectories (GPS data of (x,y) points) and Mining the data
                            
                                how to create "pretty" numbers?
                            
                                Collatz Conjecture related interview
                            
                                Guess the number, with lying allowed
                            
                                Shuffle list with some conditions
                            
                                How to get the nearest neighbor in weka using java
                            
                                What is performance of ContainsKey and TryGetValue?
                            
                                Solving a Fibonacci like recurrence in log n time
                            
                                How to check if an undirected graph has an odd length cycle
                            
                                intersection of two list of different object in java
                            
                                Given a set S, find all the maximal subsets whose sum <= k
                            
                                Are there any Online judges which provide with users all test cases the engine uses?
                            
                                Grid algorithm puzzle
                            
                                Binary selection process
                            
                                How to find the permutation of a sort in Java
                            
                                Algorithmic task which requires quadratic time?
                            
                                Algorithm for maximum non-dominated set
                            
                                Hashing a string between two integers with a good distribution (Uniform Hash)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Find if an array is a sequence in O(n) time and O(1) space [duplicate]

Tags:

language-agnostic

algorithm

Jayram

People also ask

2 Answers

John Dvorak

Ambroz Bizjak

Recent Activity

Donate For Us