I know that I can get min or max values with: <pre class="prettyprint"><code>max(matrix) min(matrix) </code></pre> out of a numpy matrix/vector. The indices for those vales are returned by: <pre class="prettyprint"><code>argmax(matrix) argmin(matrix) </code></pre> So e.g. when I have a 5x5 matrix: <pre class="prettyprint"><code>a = np.arange(5*5).reshape(5, 5) + 10 # array([[10, 11, 12, 13, 14], # [15, 16, 17, 18, 19], # [20, 21, 22, 23, 24], # [25, 26, 27, 28, 29], # [30, 31, 32, 33, 34]]) </code></pre> I could get the max value via: <pre class="prettyprint"><code>In [86]: np.max(a) # getting the max-value out of a Out[86]: 34 In [87]: np.argmax(a) # index of max-value 34 is 24 if array a were flattened Out[87]: 24 </code></pre> ...but what is the most efficient way to get the max or min n-elements? So let's say out of a I want to have the 5 highest and 5 lowest elements. This should return me <code>[30, 31, 32, 33, 34]</code> for the 5 highest values respectively <code>[20, 21, 22, 23, 24]</code> for their indices. Likewise <code>[10, 11, 12, 13, 14]</code> for the 5 lowest values and <code>[0, 1, 2, 3, 4]</code> for the indices of the 5 lowest elements. What would be an efficient, reasonable solution for this? My first idea was flattening and sorting the array and taking the last and first 5 values. Afterwards I search through the original 2D matrix for the indices of those values. Although this procedure works flattening + sorting isn't very efficient...does anyone know a faster solution? Additionally I would like to have the indices of the original 2D array and not the flattening one. So instead of <code>24</code> returned by <code>np.argmax(a)</code> I would like to have <code>(4, 4)</code>.

The standard way to get the indices of the largest or smallest values in an array is to use <code>np.argpartition</code>. This function uses an introselect algorithm and runs with linear complexity - this performs better than fully sorting for larger arrays (which is typically O(n log n)). By default this function works along the last axis of the array. To consider an entire array, you need to use <code>ravel()</code>. For example, here's a random array <code>a</code>: <pre class="prettyprint"><code>>>> a = np.random.randint(0, 100, size=(5, 5)) >>> a array([[60, 68, 86, 66, 9], [66, 26, 83, 87, 50], [41, 26, 0, 55, 9], [57, 80, 71, 50, 22], [94, 30, 95, 99, 76]]) </code></pre> Then to get the indices of the five largest values in the (flattened) 2D array, use: <pre class="prettyprint"><code>>>> i = np.argpartition(a.ravel(), -5)[-5:] # argpartition(a.ravel(), 5)[:5] for smallest >>> i array([ 2, 8, 22, 23, 20]) </code></pre> To get back the corresponding 2D indices of these positions in <code>a</code>, use <code>unravel_index</code>: <pre class="prettyprint"><code>>>> i2d = np.unravel_index(i, a.shape) >>> i2d (array([0, 1, 4, 4, 4]), array([2, 3, 2, 3, 0])) </code></pre> Then indexing <code>a</code> with <code>i2d</code> gives back the five largest values: <pre class="prettyprint"><code>>>> a[i2d] array([86, 87, 95, 99, 94]) </code></pre>

Get max or min n-elements out of numpy array? (preferably not flattened)

Tags:

python

arrays

slice

max

numpy

I know that I can get min or max values with:

max(matrix)
min(matrix)

out of a numpy matrix/vector. The indices for those vales are returned by:

argmax(matrix)
argmin(matrix)

So e.g. when I have a 5x5 matrix:

a = np.arange(5*5).reshape(5, 5) + 10

# array([[10, 11, 12, 13, 14],
#        [15, 16, 17, 18, 19],
#        [20, 21, 22, 23, 24],
#        [25, 26, 27, 28, 29],
#        [30, 31, 32, 33, 34]])

I could get the max value via:

In [86]: np.max(a) # getting the max-value out of a
Out[86]: 34

In [87]: np.argmax(a) # index of max-value 34 is 24 if array a were flattened
Out[87]: 24

...but what is the most efficient way to get the max or min n-elements?

So let's say out of a I want to have the 5 highest and 5 lowest elements. This should return me [30, 31, 32, 33, 34] for the 5 highest values respectively [20, 21, 22, 23, 24] for their indices. Likewise [10, 11, 12, 13, 14] for the 5 lowest values and [0, 1, 2, 3, 4] for the indices of the 5 lowest elements.

What would be an efficient, reasonable solution for this?

My first idea was flattening and sorting the array and taking the last and first 5 values. Afterwards I search through the original 2D matrix for the indices of those values. Although this procedure works flattening + sorting isn't very efficient...does anyone know a faster solution?

Additionally I would like to have the indices of the original 2D array and not the flattening one. So instead of 24 returned by np.argmax(a) I would like to have (4, 4).

935

asked Jan 19 '16 14:01

daniel451

1 Answers

The standard way to get the indices of the largest or smallest values in an array is to use np.argpartition. This function uses an introselect algorithm and runs with linear complexity - this performs better than fully sorting for larger arrays (which is typically O(n log n)).

By default this function works along the last axis of the array. To consider an entire array, you need to use ravel(). For example, here's a random array a:

>>> a = np.random.randint(0, 100, size=(5, 5))
>>> a
array([[60, 68, 86, 66,  9],
       [66, 26, 83, 87, 50],
       [41, 26,  0, 55,  9],
       [57, 80, 71, 50, 22],
       [94, 30, 95, 99, 76]])

Then to get the indices of the five largest values in the (flattened) 2D array, use:

>>> i = np.argpartition(a.ravel(), -5)[-5:] # argpartition(a.ravel(), 5)[:5] for smallest
>>> i
array([ 2,  8, 22, 23, 20])

To get back the corresponding 2D indices of these positions in a, use unravel_index:

>>> i2d = np.unravel_index(i, a.shape)
>>> i2d
(array([0, 1, 4, 4, 4]), array([2, 3, 2, 3, 0]))

Then indexing a with i2d gives back the five largest values:

>>> a[i2d]
array([86, 87, 95, 99, 94])

174

answered Nov 15 '22 00:11

Alex Riley

Related questions
                            
                                How can I mock/patch a decorator in python?
                            
                                Detect star shape in opencv-python
                            
                                skimage resize giving weird output
                            
                                Removing duplicates from a list of lists based on a comparison of an element of the inner lists
                            
                                Sublime Text 3: Write text to output panel
                            
                                Django 1.9 can't modify unique_together (ValueError) wrong number of constrains
                            
                                Etags used in RESTful APIs are still susceptible to race conditions
                            
                                Browsers close socket before the response is fully downloaded
                            
                                Completing Spotify Authorization Code Flow via desktop application without using browser
                            
                                combine values of several objects into a single dictionary
                            
                                Checking divisibility for (sort of) big numbers in python
                            
                                Upgrading a Python 3 virtual environment [duplicate]
                            
                                how to pass context data with django redirect function?
                            
                                How to calculate the click-through rate
                            
                                JavaScript/Ajax to Dynamically Update WTForms Select Field
                            
                                Python building cython extension with setup creates subfolder when __init__.py exists
                            
                                How to resample a Pandas dataframe of mixed type?
                            
                                Shared x axes in Pandas Python
                            
                                pivot irregular dictionary of lists into pandas dataframe
                            
                                Sympy second order ode

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With