find the start position of the longest sequence of 1's

Tags:

I want to find the start position of the longest sequence of 1's in my array:

a1=[0,0,1,1,1,1,0,0,1,1]
#2

I am following this answer to find the length of the longest sequence. However, I was not able to determine the position.

297

asked Jul 02 '16 15:07

5 Answers

Inspired by this solution, here's a vectorized approach to solve it -

# Get start, stop index pairs for islands/seq. of 1s
idx_pairs = np.where(np.diff(np.hstack(([False],a1==1,[False]))))[0].reshape(-1,2)

# Get the island lengths, whose argmax would give us the ID of longest island.
# Start index of that island would be the desired output
start_longest_seq = idx_pairs[np.diff(idx_pairs,axis=1).argmax(),0]

Sample run -

In [89]: a1 # Input array
Out[89]: array([0, 0, 1, 1, 1, 1, 0, 0, 1, 1])

In [90]: idx_pairs # Start, stop+1 index pairs
Out[90]: 
array([[ 2,  6],
       [ 8, 10]])

In [91]: np.diff(idx_pairs,axis=1) # Island lengths
Out[91]: 
array([[4],
       [2]])

In [92]: np.diff(idx_pairs,axis=1).argmax() # Longest island ID
Out[92]: 0

In [93]: idx_pairs[np.diff(idx_pairs,axis=1).argmax(),0] # Longest island start
Out[93]: 2

134

answered Oct 19 '22 19:10

A more compact one-liner using groupby(). Uses enumerate() on the raw data to keep the starting positions through the analysis pipeline, evenutally ending up with the list of tuples [(2, 4), (8, 2)] each tuple containing the starting position and length of non-zero runs:

from itertools import groupby

L = [0,0,1,1,1,1,0,0,1,1]

print max(((lambda y: (y[0][0], len(y)))(list(g)) for k, g in groupby(enumerate(L), lambda x: x[1]) if k), key=lambda z: z[1])[0]

lambda: x is the key function for groupby() since we enumerated L

lambda: y packages up results we need since we can only evaluate g once, without saving

lambda: z is the key function for max() to pull out the lengths

Prints '2' as expected.

answered Oct 19 '22 19:10

cdlane

You could use a for loop and check if the next few items (of length m where m is the max length) are the same as the maximum length:

# Using your list and the answer from the post you referred
from itertools import groupby
L = [0,0,1,1,1,1,0,0,1,1]
m = max(sum(1 for i in g) for k, g in groupby(L))
# Here is the for loop
for i, s in enumerate(L):
    if len(L) - i + 2 < len(L) - m:
        break
    if s == 1 and 0 not in L[i:i+m]:
        print i
        break

This will give:

answered Oct 19 '22 18:10

Moon Cheesez

Another way of doing in a single loop, but without resorting to itertool's groupby.

max_start = 0
max_reps = 0
start = 0
reps = 0
for (pos, val) in enumerate(a1):
    start = pos if reps == 0 else start
    reps = reps + 1 if val == 1 else 0
    max_reps = max(reps, max_reps)
    max_start = start if reps == max_reps else max_start

This could also be done in a one-liner fashion using reduce:

max_start = reduce(lambda (max_start, max_reps, start, reps), (pos, val): (start if reps == max(reps, max_reps) else max_start, max(reps, max_reps), pos if reps == 0 else start, reps + 1 if val == 1 else 0), enumerate(a1), (0, 0, 0, 0))[0]

In Python 3, you cannot unpack tuples inside the lambda arguments definition, so it's preferable to define the function using def first:

def func(acc, x):
    max_start, max_reps, start, reps = acc
    pos, val = x
    return (start if reps == max(reps, max_reps) else max_start,
            max(reps, max_reps),
            pos if reps == 0 else start,
            reps + 1 if val == 1 else 0)

max_start = reduce(func, enumerate(a1), (0, 0, 0, 0))[0]

In any of the three cases, max_start gives your answer (i.e. 2).

answered Oct 19 '22 19:10

Douglas Vieira

Related questions
                            
                                Py2app: Operation not permitted
                            
                                Why does the 'in' keyword claim it needs an iterable object?
                            
                                Run django application without django.contrib.admin
                            
                                Adding column(s) from one dataframe to another python pandas
                            
                                how to speed up NE recognition with stanford NER with python nltk
                            
                                How to test tensorflow cifar10 cnn tutorial model
                            
                                how to use matplotlib quiver scale
                            
                                Add multiple columns with zero values from a list to a Pandas data frame
                            
                                seaborn boxplots at desired distances along the x axis
                            
                                Reading in csv file as dataframe from hdfs
                            
                                Python mock object instantiation
                            
                                parallel processing in pandas python
                            
                                Is there a difference between setting a variable to None or deleting it? [duplicate]
                            
                                how to understand empty dimension in python numpy array?
                            
                                Use pdist() in python with a custom distance function defined by you
                            
                                PUT and DELETE Django
                            
                                Why are multiprocessing.sharedctypes assignments so slow?
                            
                                using decorators to persist python objects
                            
                                Python import CSV short code (pandas?) delimited with ';' and ',' in entires
                            
                                Numpy - check if elements of a array belong to another array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

find the start position of the longest sequence of 1's

Tags:

python

numpy

scipy

MAS

People also ask

5 Answers

Divakar

Psidom

cdlane

Moon Cheesez

Douglas Vieira

Recent Activity

Donate For Us