How to apply the output of numpy.argpartition for 2-D Arrays?

Tags:

I have a largish 2d numpy array, and I want to extract the lowest 10 elements of each row as well as their indexes. Since my array is largish, I would prefer not to sort the whole array.

I heard about the argpartition() function, with which I can get the indexes of the lowest 10 elements:

top10indexes = np.argpartition(myBigArray,10)[:,:10]

Note that argpartition() partitions axis -1 by default, which is what I want. The result here has the same shape as myBigArray containing indexes into the respective rows such that the first 10 indexes point to the 10 lowest values.

How can I now extract the elements of myBigArray corresponding to those indexes?

Obvious fancy indexing like myBigArray[top10indexes] or myBigArray[:,top10indexes] do something quite different. I could also use list comprehensions, something like:

array([row[idxs] for row,idxs in zip(myBigArray,top10indexes)])

but that would incur a performance hit iterating numpy rows and converting the result back to an array.

nb: I could just use np.partition() to get the values, and they may even correspond to the indexes (or may not..), but I don't want to do the partition twice if I can avoid it.

694

asked Oct 12 '14 05:10

drevicko

1 Answers

You can avoid using the flattened copies and the need to extract all the values by doing:

num = 10
top = np.argpartition(myBigArray, num, axis=1)[:, :num]
myBigArray[np.arange(myBigArray.shape[0])[:, None], top]

For NumPy >= 1.9.0 this will be very efficient and comparable to np.take().

136

answered Oct 08 '22 00:10

Saullo G. P. Castro

Related questions
                            
                                AttributeError: 'bytes' object has no attribute 'timeout'
                            
                                How to read columns of varying length from a text file in NumPy using genfromtxt()?
                            
                                tweepy how to get a username from id
                            
                                Latex font style issues using amsmath and sfmath for plot labeling
                            
                                Using sqlalchemy to create an index on a json key (expression index)
                            
                                Python 2 newline tokens in tokenize module
                            
                                how to make wxpython grid automatic fit-to-window
                            
                                escape whitespaces in linux path and file names
                            
                                Python: Detect number separator symbols and parse into a float without locale
                            
                                Reading Single Line CSV using numpy.genfromtxt
                            
                                Microsecond accurate timestamp in python?
                            
                                2D Interpolation with periodic boundary conditions
                            
                                Python: calling stop on mock patch class decorator
                            
                                TypeError: list indices must be integers, not str,while parsing json
                            
                                Get a permutation as a function of a unique given index in O(n)
                            
                                Tkinter looks different on different computers
                            
                                Security of regular expressions [duplicate]
                            
                                How do you pass a numpy array to openCV without saving the file as a png or jpeg first?
                            
                                Creating a column from another column in SQLAlchemy
                            
                                How to remove a module using Anaconda in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to apply the output of numpy.argpartition for 2-D Arrays?

Tags:

performance

python

arrays

indexing

numpy

drevicko

People also ask

1 Answers

Saullo G. P. Castro

Recent Activity

Donate For Us