I have an array of values, x. Given 'start' and 'stop' indices, I need to construct an array y using sub-arrays of x. <pre class="prettyprint"><code>import numpy as np x = np.arange(20) start = np.array([2, 8, 15]) stop = np.array([5, 10, 20]) nsubarray = len(start) </code></pre> Where I would like y to be: <pre class="prettyprint"><code>y = array([ 2, 3, 4, 8, 9, 15, 16, 17, 18, 19]) </code></pre> (In practice the arrays I am using are much larger). One way to construct y is using a list comprehension, but the list needs to be flattened afterwards: <pre class="prettyprint"><code>import itertools as it y = [x[start[i]:stop[i]] for i in range(nsubarray)] y = np.fromiter(it.chain.from_iterable(y), dtype=int) </code></pre> I found that it is actually faster to use a for-loop: <pre class="prettyprint"><code>y = np.empty(sum(stop - start), dtype = int) a = 0 for i in range(nsubarray): b = a + stop[i] - start[i] y[a:b] = x[start[i]:stop[i]] a = b </code></pre> I was wondering if anyone knows of a way that I can optimize this? Thank you very much! EDIT The following tests all of the times: <pre class="prettyprint"><code>import numpy as np import numpy.random as rd import itertools as it def get_chunks(arr, start, stop): rng = stop - start rng = rng[rng!=0] #Need to add this in case of zero sized ranges np.cumsum(rng, out=rng) inds = np.ones(rng[-1], dtype=np.int) inds[rng[:-1]] = start[1:]-stop[:-1]+1 inds[0] = start[0] np.cumsum(inds, out=inds) return np.take(arr, inds) def for_loop(arr, start, stop): y = np.empty(sum(stop - start), dtype = int) a = 0 for i in range(nsubarray): b = a + stop[i] - start[i] y[a:b] = arr[start[i]:stop[i]] a = b return y xmax = 1E6 nsubarray = 100000 x = np.arange(xmax) start = rd.randint(0, xmax - 10, nsubarray) stop = start + 10 </code></pre> Which results in: <pre class="prettyprint"><code>In [379]: %timeit np.hstack([x[i:j] for i,j in it.izip(start, stop)]) 1 loops, best of 3: 410 ms per loop In [380]: %timeit for_loop(x, start, stop) 1 loops, best of 3: 281 ms per loop In [381]: %timeit np.concatenate([x[i:j] for i,j in it.izip(start, stop)]) 10 loops, best of 3: 97.8 ms per loop In [382]: %timeit get_chunks(x, start, stop) 100 loops, best of 3: 16.6 ms per loop </code></pre>

This is a bit complicated, but quite fast. Basically what we do is create the index list based off vector addition and the use <code>np.take</code> instead of any python loops: <pre class="prettyprint"><code>def get_chunks(arr, start, stop): rng = stop - start rng = rng[rng!=0] #Need to add this in case of zero sized ranges np.cumsum(rng, out=rng) inds = np.ones(rng[-1], dtype=np.int) inds[rng[:-1]] = start[1:]-stop[:-1]+1 inds[0] = start[0] np.cumsum(inds, out=inds) return np.take(arr, inds) </code></pre> Check that it is returning the correct result: <pre class="prettyprint"><code>xmax = 1E6 nsubarray = 100000 x = np.arange(xmax) start = np.random.randint(0, xmax - 10, nsubarray) stop = start + np.random.randint(1, 10, nsubarray) old = np.concatenate([x[b:e] for b, e in izip(start, stop)]) new = get_chunks(x, start, stop) np.allclose(old,new) True </code></pre> Some timings: <pre class="prettyprint"><code>%timeit np.hstack([x[i:j] for i,j in zip(start, stop)]) 1 loops, best of 3: 354 ms per loop %timeit np.concatenate([x[b:e] for b, e in izip(start, stop)]) 10 loops, best of 3: 119 ms per loop %timeit get_chunks(x, start, stop) 100 loops, best of 3: 7.59 ms per loop </code></pre>

Construct single numpy array from smaller arrays of different sizes

Tags:

python

arrays

optimization

numpy

I have an array of values, x. Given 'start' and 'stop' indices, I need to construct an array y using sub-arrays of x.

import numpy as np
x = np.arange(20)
start = np.array([2, 8, 15])
stop = np.array([5, 10, 20])
nsubarray = len(start)

Where I would like y to be:

y = array([ 2,  3,  4,  8,  9, 15, 16, 17, 18, 19])

(In practice the arrays I am using are much larger).

One way to construct y is using a list comprehension, but the list needs to be flattened afterwards:

import itertools as it
y = [x[start[i]:stop[i]] for i in range(nsubarray)]
y = np.fromiter(it.chain.from_iterable(y), dtype=int)

I found that it is actually faster to use a for-loop:

y = np.empty(sum(stop - start), dtype = int)
a = 0
for i in range(nsubarray):
    b = a + stop[i] - start[i]
    y[a:b] = x[start[i]:stop[i]]
    a = b

I was wondering if anyone knows of a way that I can optimize this? Thank you very much!

EDIT

The following tests all of the times:

import numpy as np
import numpy.random as rd
import itertools as it


def get_chunks(arr, start, stop):
    rng = stop - start
    rng = rng[rng!=0]      #Need to add this in case of zero sized ranges
    np.cumsum(rng, out=rng)
    inds = np.ones(rng[-1], dtype=np.int)
    inds[rng[:-1]] = start[1:]-stop[:-1]+1
    inds[0] = start[0]
    np.cumsum(inds, out=inds)
    return np.take(arr, inds)


def for_loop(arr, start, stop):
    y = np.empty(sum(stop - start), dtype = int)
    a = 0
    for i in range(nsubarray):
        b = a + stop[i] - start[i]
        y[a:b] = arr[start[i]:stop[i]]
        a = b
    return y

xmax = 1E6
nsubarray = 100000
x = np.arange(xmax)
start = rd.randint(0, xmax - 10, nsubarray)
stop = start + 10

Which results in:

In [379]: %timeit np.hstack([x[i:j] for i,j in it.izip(start, stop)])
1 loops, best of 3: 410 ms per loop

In [380]: %timeit for_loop(x, start, stop)
1 loops, best of 3: 281 ms per loop

In [381]: %timeit np.concatenate([x[i:j] for i,j in it.izip(start, stop)])
10 loops, best of 3: 97.8 ms per loop

In [382]: %timeit get_chunks(x, start, stop)
100 loops, best of 3: 16.6 ms per loop

690

asked Mar 11 '14 13:03

turnerm

1 Answers

This is a bit complicated, but quite fast. Basically what we do is create the index list based off vector addition and the use np.take instead of any python loops:

def get_chunks(arr, start, stop):
     rng = stop - start
     rng = rng[rng!=0]      #Need to add this in case of zero sized ranges
     np.cumsum(rng, out=rng)
     inds = np.ones(rng[-1], dtype=np.int)
     inds[rng[:-1]] = start[1:]-stop[:-1]+1
     inds[0] = start[0]
     np.cumsum(inds, out=inds)
     return np.take(arr, inds)

Check that it is returning the correct result:

xmax = 1E6
nsubarray = 100000
x = np.arange(xmax)
start = np.random.randint(0, xmax - 10, nsubarray)
stop = start + np.random.randint(1, 10, nsubarray)

old = np.concatenate([x[b:e] for b, e in izip(start, stop)])
new = get_chunks(x, start, stop)
np.allclose(old,new)
True

Some timings:

%timeit np.hstack([x[i:j] for i,j in zip(start, stop)])
1 loops, best of 3: 354 ms per loop

%timeit np.concatenate([x[b:e] for b, e in izip(start, stop)])
10 loops, best of 3: 119 ms per loop

%timeit get_chunks(x, start, stop)
100 loops, best of 3: 7.59 ms per loop

131

answered Sep 20 '22 22:09

Daniel

Related questions
                            
                                Django TypeError int() argument must be a string or a number, not 'QueryDict'
                            
                                Python script involving Outlook through win32com runs when double-clicking, but not through task scheduler
                            
                                Count number of results for a particular word on Twitter (API v1.1)
                            
                                Pandas sparse dataframe larger on disk than dense version
                            
                                Mode/Median/Mean of a 3d numpy array
                            
                                Pip not working on Cygwin
                            
                                How to count number of combinations?
                            
                                subclass string.Formatter
                            
                                str.encode adds a 'b' to the front of data
                            
                                Mongoengine, Flask and ReferenceField in WTForms
                            
                                google app engine modules - long running tasks > 10 minutes
                            
                                Can isAlive() be False immediately after calling start() because the thread hasn't yet started?
                            
                                Regex for splitting a string which contains commas
                            
                                No module 'zlib' for Python 2.6
                            
                                How to slice and extend a 2D numpy array?
                            
                                How to insert a blank space(&nbsp) into a Beautifulsoup tag?
                            
                                Redis Error 8 connecting localhost:6379. nodename nor servname provided, or not known
                            
                                Flask RESTful POST JSON fails
                            
                                Passing numpy string-format arrays to fortran using f2py
                            
                                sending powershell script to Windows ec2 in user data

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With