I use Python with <code>numpy</code>. I have a numpy array <code>may_a</code>: <pre class="prettyprint"><code>may_a = numpy.array([False, True, False, True, True, False, True, False, True, True, False]) </code></pre> I have a numpy array <code>may_b</code>: <pre class="prettyprint"><code>may_b = numpy.array([False,True,True,False]) </code></pre> I need to find array <code>may_b</code> in array <code>may_a</code>. In the output I need to get indexes of occurrences. <pre class="prettyprint"><code>out_index=[2,7] </code></pre> Can someone please suggest, how do I get <code>out_index</code>?

EDIT The following code does allow to perform a convolution based check of equality. It maps <code>True</code> to <code>1</code> and <code>False</code> to <code>-1</code>. It also reverses <code>b</code>, which is needed for it to work properly: <pre class="prettyprint"><code>def search(a, b) : return np.where(np.round(fftconvolve(a * 2 - 1, (b * 2 - 1)[::-1], mode='valid') - len(b)) == 0)[0] </code></pre> I have checked that it gives the same output as the <code>as_strided</code> method for a variety of random inputs, which it does. I have also timed both approached, and convolution only starts paying off with largish search tokens of around 256 items. <hr> It seems like a little overkill, but with boolean data you can use (abuse?) convolution: <pre class="prettyprint"><code>In [8]: np.where(np.convolve(may_a, may_b.astype(int), ...: mode='valid') == may_b.sum())[0] Out[8]: array([2, 7]) </code></pre> For larger datasets it may be faster to go with <code>scipy.signal.fftconvolve</code>: <pre class="prettyprint"><code>In [13]: np.where(scipy.signal.fftconvolve(may_a, may_b, ....: mode='valid') == may_b.sum())[0] Out[13]: array([2, 7]) </code></pre> You have to be careful though, because the output now is floating point, and rounding may spoil the equality check: <pre class="prettyprint"><code>In [14]: scipy.signal.fftconvolve(may_a, may_b, mode='valid') Out[14]: array([ 1., 1., 2., 1., 1., 1., 1., 2.]) </code></pre> So you may be better off with something along the lines of: <pre class="prettyprint"><code>In [15]: np.where(np.round(scipy.signal.fftconvolve(may_a, may_b, mode='valid') - ....: may_b.sum()) == 0)[0] Out[15]: array([2, 7]) </code></pre>

Return the indexes of a sub-array in an array

Tags:

python

arrays

numpy

I use Python with numpy.

I have a numpy array may_a:

Click to copy

may_a = numpy.array([False, True, False, True, True, False, True, False, True, True, False])

I have a numpy array may_b:

Click to copy

may_b = numpy.array([False,True,True,False])

I need to find array may_b in array may_a.

In the output I need to get indexes of occurrences.

Click to copy

out_index=[2,7]

Can someone please suggest, how do I get out_index?

434

asked Feb 15 '13 07:02

Olga

1 Answers

EDIT The following code does allow to perform a convolution based check of equality. It maps True to 1 and False to -1. It also reverses b, which is needed for it to work properly:

Click to copy

def search(a, b) :
    return np.where(np.round(fftconvolve(a * 2 - 1, (b * 2 - 1)[::-1],
                                         mode='valid') - len(b)) == 0)[0]

I have checked that it gives the same output as the as_strided method for a variety of random inputs, which it does. I have also timed both approached, and convolution only starts paying off with largish search tokens of around 256 items.

It seems like a little overkill, but with boolean data you can use (abuse?) convolution:

Click to copy

In [8]: np.where(np.convolve(may_a, may_b.astype(int),
   ...:                      mode='valid') == may_b.sum())[0]
Out[8]: array([2, 7])

For larger datasets it may be faster to go with scipy.signal.fftconvolve:

Click to copy

In [13]: np.where(scipy.signal.fftconvolve(may_a, may_b,
   ....:                                   mode='valid') == may_b.sum())[0]
Out[13]: array([2, 7])

You have to be careful though, because the output now is floating point, and rounding may spoil the equality check:

Click to copy

In [14]: scipy.signal.fftconvolve(may_a, may_b, mode='valid')
Out[14]: array([ 1.,  1.,  2.,  1.,  1.,  1.,  1.,  2.])

So you may be better off with something along the lines of:

Click to copy

In [15]: np.where(np.round(scipy.signal.fftconvolve(may_a, may_b, mode='valid') -
   ....:                   may_b.sum()) == 0)[0]
Out[15]: array([2, 7])

149

answered Sep 20 '22 02:09

Jaime

Related questions
                            
                                How to remove ^M from a text file and replace it with the next line
                            
                                How to query for distinct results in mongodb with python?
                            
                                Which Regular Expression flavour is used in Python?
                            
                                Which to use: OneToOne vs ForeignKey?
                            
                                Nicer way to iterate to dictionary in python to avoid many nested for loops
                            
                                unresolved import in python opencv samples
                            
                                highest palindrome with 3 digit numbers in python
                            
                                Running scipy's oneway anova in a script
                            
                                File upload with Tornado
                            
                                Returning the highest 6 names in a List of tuple in Python
                            
                                How to store itertools.chain and use it more than once?
                            
                                Python Cubes OLAP Framework - how to work with joins?
                            
                                Why are Python builds suddenly not Framework builds when using virtualenv?
                            
                                Overlay polygon on top of image in Python
                            
                                Cannot open ".mp4" video files using OpenCV 2.4.3, Python 2.7 in Windows 7 machine
                            
                                Perform simple math on regular expression output? (Python)
                            
                                Python: process image and save to file stream
                            
                                Django DateTimeField() and timezone.now()
                            
                                Delete letters from string
                            
                                Django1.4: How to use order_by in template?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Return the indexes of a sub-array in an array

Tags:

python

arrays

numpy

Olga

People also ask

1 Answers

Jaime

Recent Activity

Donate For Us