My code for slicing a numpy array (via fancy indexing) is very slow. It is currently a bottleneck in program. <pre class="prettyprint"><code>a.shape (3218, 6) ts = time.time(); a[rows][:, cols]; te = time.time(); print('%.8f' % (te-ts)); 0.00200009 </code></pre> What is the correct numpy call to get an array consisting of the subset of rows 'rows' and columns 'col' of the matrix a? (in fact, I need the transpose of this result)

Let my try to summarize the excellent answers by Jaime and TheodrosZelleke and mix in some comments. <ol> <li> Advanced (fancy) indexing always returns a copy, never a view.</li> <li> <code>a[rows][:,cols]</code> implies two fancy indexing operations, so an intermediate copy <code>a[rows]</code> is created and discarded. Handy and readable, but not very efficient. Moreover beware that <code>[:,cols]</code> usually generates a Fortran contiguous copy form a C-cont. source.</li> <li> <code>a[rows.reshape(-1,1),cols]</code> is a single advanced indexing expression basing on the fact that <code>rows.reshape(-1,1)</code> and <code>cols</code> are broadcast to the shape of the intended result.</li> <li> A common experience is that indexing in a flattened array can be more efficient than fancy indexing, so another approach is <pre class="prettyprint"><code>indx = rows.reshape(-1,1)*a.shape[1] + cols a.take(indx) </code></pre> or <pre class="prettyprint"><code>a.take(indx.flat).reshape(rows.size,cols.size) </code></pre> </li> <li>Efficiency will depend on memory access patterns and whether the starting array is C-countinous or Fortran continuous, so experimentation is needed.</li> <li>Use fancy indexing only if really needed: basic slicing <code>a[rstart:rstop:rstep, cstart:cstop:cstep]</code> returns a view (although not continuous) and should be faster!</li> </ol>

To my surprise this, kind of lenghty expression, which calculates first linear 1D-indices, is more than 50% faster than the consecutive array indexing presented in the question: <pre class="prettyprint"><code>(a.ravel()[( cols + (rows * a.shape[1]).reshape((-1,1)) ).ravel()]).reshape(rows.size, cols.size) </code></pre> UPDATE: OP updated the description of the shape of the initial array. With the updated size the speedup is now above 99%: <pre class="prettyprint"><code>In [93]: a = np.random.randn(3218, 1415) In [94]: rows = np.random.randint(a.shape[0], size=2000) In [95]: cols = np.random.randint(a.shape[1], size=6) In [96]: timeit a[rows][:, cols] 10 loops, best of 3: 186 ms per loop In [97]: timeit (a.ravel()[(cols + (rows * a.shape[1]).reshape((-1,1))).ravel()]).reshape(rows.size, cols.size) 1000 loops, best of 3: 1.56 ms per loop </code></pre> INITAL ANSWER: Here is the transcript: <pre class="prettyprint"><code>In [79]: a = np.random.randn(3218, 6) In [80]: a.shape Out[80]: (3218, 6) In [81]: rows = np.random.randint(a.shape[0], size=2000) In [82]: cols = np.array([1,3,4,5]) </code></pre> Time method 1: <pre class="prettyprint"><code>In [83]: timeit a[rows][:, cols] 1000 loops, best of 3: 1.26 ms per loop </code></pre> Time method 2: <pre class="prettyprint"><code>In [84]: timeit (a.ravel()[(cols + (rows * a.shape[1]).reshape((-1,1))).ravel()]).reshape(rows.size, cols.size) 1000 loops, best of 3: 568 us per loop </code></pre> Check that results are actually the same: <pre class="prettyprint"><code>In [85]: result1 = a[rows][:, cols] In [86]: result2 = (a.ravel()[(cols + (rows * a.shape[1]).reshape((-1,1))).ravel()]).reshape(rows.size, cols.size) In [87]: np.sum(result1 - result2) Out[87]: 0.0 </code></pre>

Fast numpy fancy indexing

Tags:

python

slice

indexing

numpy

My code for slicing a numpy array (via fancy indexing) is very slow. It is currently a bottleneck in program.

a.shape
(3218, 6)

ts = time.time(); a[rows][:, cols]; te = time.time(); print('%.8f' % (te-ts));
0.00200009

What is the correct numpy call to get an array consisting of the subset of rows 'rows' and columns 'col' of the matrix a? (in fact, I need the transpose of this result)

537

asked Jan 17 '13 19:01

Oren

2 Answers

Let my try to summarize the excellent answers by Jaime and TheodrosZelleke and mix in some comments.

Advanced (fancy) indexing always returns a copy, never a view.
a[rows][:,cols] implies two fancy indexing operations, so an intermediate copy a[rows] is created and discarded. Handy and readable, but not very efficient. Moreover beware that [:,cols] usually generates a Fortran contiguous copy form a C-cont. source.
a[rows.reshape(-1,1),cols] is a single advanced indexing expression basing on the fact that rows.reshape(-1,1) and cols are broadcast to the shape of the intended result.
A common experience is that indexing in a flattened array can be more efficient than fancy indexing, so another approach is
```
indx = rows.reshape(-1,1)*a.shape[1] + cols
a.take(indx)
```
or
```
a.take(indx.flat).reshape(rows.size,cols.size)
```
Efficiency will depend on memory access patterns and whether the starting array is C-countinous or Fortran continuous, so experimentation is needed.
Use fancy indexing only if really needed: basic slicing a[rstart:rstop:rstep, cstart:cstop:cstep] returns a view (although not continuous) and should be faster!

123

answered Sep 17 '22 17:09

2 revs

To my surprise this, kind of lenghty expression, which calculates first linear 1D-indices, is more than 50% faster than the consecutive array indexing presented in the question:

(a.ravel()[(
   cols + (rows * a.shape[1]).reshape((-1,1))
   ).ravel()]).reshape(rows.size, cols.size)

UPDATE: OP updated the description of the shape of the initial array. With the updated size the speedup is now above 99%:

In [93]: a = np.random.randn(3218, 1415)

In [94]: rows = np.random.randint(a.shape[0], size=2000)

In [95]: cols = np.random.randint(a.shape[1], size=6)

In [96]: timeit a[rows][:, cols]
10 loops, best of 3: 186 ms per loop

In [97]: timeit (a.ravel()[(cols + (rows * a.shape[1]).reshape((-1,1))).ravel()]).reshape(rows.size, cols.size)
1000 loops, best of 3: 1.56 ms per loop

INITAL ANSWER: Here is the transcript:

In [79]: a = np.random.randn(3218, 6)
In [80]: a.shape
Out[80]: (3218, 6)

In [81]: rows = np.random.randint(a.shape[0], size=2000)
In [82]: cols = np.array([1,3,4,5])

Time method 1:

In [83]: timeit a[rows][:, cols]
1000 loops, best of 3: 1.26 ms per loop

Time method 2:

In [84]: timeit (a.ravel()[(cols + (rows * a.shape[1]).reshape((-1,1))).ravel()]).reshape(rows.size, cols.size)
1000 loops, best of 3: 568 us per loop

Check that results are actually the same:

In [85]: result1 = a[rows][:, cols]
In [86]: result2 = (a.ravel()[(cols + (rows * a.shape[1]).reshape((-1,1))).ravel()]).reshape(rows.size, cols.size)

In [87]: np.sum(result1 - result2)
Out[87]: 0.0

answered Sep 21 '22 17:09

tzelleke

Related questions
                            
                                How to make zip_longest available in itertools using Python 2.7
                            
                                Reading python documentation in the terminal?
                            
                                Compute rolling z-score in pandas dataframe
                            
                                Cython: How to print without GIL
                            
                                name 'DataFrameSelector' is not defined
                            
                                Adding strings in lists together
                            
                                How to scroll automatically within a Tkinter message window
                            
                                Elegant way to compare sequences
                            
                                Importing methods for a Python class
                            
                                Python find list lengths in a sublist
                            
                                In python, how do I exclude files from a loop if they begin with a specific set of letters?
                            
                                socket.accept error 24: To many open files
                            
                                How to divide a list into n equal parts, python
                            
                                Is it possible to colour a specific item in a Listbox widget?
                            
                                module object has no attribute 'Screen'
                            
                                How to get the size of tar.gz in (MB) file in python
                            
                                How to temporary hide stdout or stderr while running a unittest in Python
                            
                                Find all words in a string that start with the $ sign in Python
                            
                                Python: How to convert string into datetime [duplicate]
                            
                                Errors while building/installing C module for Python 2.7

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With