Compound assignment operators in Python's Numpy library

Tags:

numpy

The "vectorizing" of fancy indexing by Python's numpy library sometimes gives unexpected results. For example:

import numpy
a = numpy.zeros((1000,4), dtype='uint32')
b = numpy.zeros((1000,4), dtype='uint32')
i = numpy.random.random_integers(0,999,1000)
j = numpy.random.random_integers(0,3,1000)

a[i,j] += 1
for k in xrange(1000):
    b[i[k],j[k]] += 1

Gives different results in the arrays 'a' and 'b' (i.e. the appearance of tuple (i,j) appears as 1 in 'a' regardless of repeats, whereas repeats are counted in 'b'). This is easily verified as follows:

numpy.sum(a)
883
numpy.sum(b)
1000

It is also notable that the fancy indexing version is almost two orders of magnitude faster than the for loop. My question is: "Is there an efficient way for numpy to compute the repeat counts as implemented using the for loop in the provided example?"

560

asked Jun 12 '12 16:06

user1451766

1 Answers

This should do what you want:

np.bincount(np.ravel_multi_index((i, j), (1000, 4)), minlength=4000).reshape(1000, 4)

As a breakdown, ravel_multi_index converts the index pairs specified by i and j to integer indices into a C-flattened array; bincount counts the number of times each value 0..4000 appears in that list of indices; and reshape converts the C-flattened array back to a 2d array.

In terms of performance, I measure it at 200 times faster than "b", and 5 times faster than "a"; your mileage may vary.

Since you need to write the counts to an existing array a, try this:

u, inv = np.unique(np.ravel_multi_index((i, j), (1000, 4)), return_inverse=True)
a.flat[u] += np.bincount(inv)

I make this second method a little slower (2x) than "a", which isn't too surprising as the unique stage is going to be slow.

188

answered Oct 21 '22 03:10

ecatmur

Related questions
                            
                                Setting up Django settings for sphinx (documentation)
                            
                                Using itertools.product and want to seed a value
                            
                                Interactive Brokers automated trading
                            
                                Prevent MySQL-Python from inserting quotes around database name parameter
                            
                                Is there a way to get code-hints for gtk3 and python working on aptana?
                            
                                Beautifulsoup, maximum recursion depth reached
                            
                                Two-dimensional vs. One-dimensional dictionary efficiency in Python
                            
                                How can I prefetch_related across a reverse one-to-one relationship where the one-to-one relationship may be different?
                            
                                Why sys.getsizeof(numpy.int8(1)) returns 12?
                            
                                Re evaluate django query after changes done to database
                            
                                Mac OSX: Switch to Python 2.7.3
                            
                                How can I make Django-Tastypie override a resource if it already exists?
                            
                                Pass a JSON object to an url with requests
                            
                                Reading data blocks from a file in Python
                            
                                Will Distribute be outdated when new packaging comes with Python 3.3?
                            
                                tkinter default button in a widget
                            
                                Moving from multiprocessing to threading
                            
                                No luck pip-installing pylint for Python 3
                            
                                Best way to modify and generalize spaced repetition software
                            
                                Protocol buffers python - unicode decode error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With