I'm interested in getting the location of the minimum value in an 1-d NumPy array that meets a certain condition (in my case, a medium threshold). For example: <pre class="prettyprint"><code>import numpy as np limit = 3 a = np.array([1, 2, 4, 5, 2, 5, 3, 6, 7, 9, 10]) </code></pre> I'd like to effectively mask all numbers in <code>a</code> that are under the limit, such that the result of <code>np.argmin</code> would be 6. Is there a computationally cheap way to mask values that don't meet a condition and then apply <code>np.argmin</code>?

This can simply be accomplished using numpy's <code>MaskedArray</code> <pre class="prettyprint"><code>import numpy as np limit = 3 a = np.array([1, 2, 4, 5, 2, 5, 3, 6, 7, 9, 10]) b = np.ma.MaskedArray(a, a<limit) np.ma.argmin(b) # == 6 </code></pre>

You could store the valid indices and use those for both selecting the valid elements from <code>a</code> and also indexing into with the <code>argmin()</code> among the selected elements to get the final index output. Thus, the implementation would look something like this - <pre class="prettyprint"><code>valid_idx = np.where(a >= limit)[0] out = valid_idx[a[valid_idx].argmin()] </code></pre> Sample run - <pre class="prettyprint"><code>In [32]: limit = 3 ...: a = np.array([1, 2, 4, 5, 2, 5, 3, 6, 7, 9, 10]) ...: In [33]: valid_idx = np.where(a >= limit)[0] In [34]: valid_idx[a[valid_idx].argmin()] Out[34]: 6 </code></pre> Runtime test - For performance benchmarking, in this section I am comparing the <code>other solution based on masked array</code> against a regular array based solution as proposed earlier in this post for various datasizes. <pre class="prettyprint"><code>def masked_argmin(a,limit): # Defining func for regular array based soln valid_idx = np.where(a >= limit)[0] return valid_idx[a[valid_idx].argmin()] In [52]: # Inputs ...: a = np.random.randint(0,1000,(10000)) ...: limit = 500 ...: In [53]: %timeit np.argmin(np.ma.MaskedArray(a, a<limit)) 1000 loops, best of 3: 233 µs per loop In [54]: %timeit masked_argmin(a,limit) 10000 loops, best of 3: 101 µs per loop In [55]: # Inputs ...: a = np.random.randint(0,1000,(100000)) ...: limit = 500 ...: In [56]: %timeit np.argmin(np.ma.MaskedArray(a, a<limit)) 1000 loops, best of 3: 1.73 ms per loop In [57]: %timeit masked_argmin(a,limit) 1000 loops, best of 3: 1.03 ms per loop </code></pre>

numpy.argmin for elements greater than a threshold

Tags:

performance

python

arrays

numpy

I'm interested in getting the location of the minimum value in an 1-d NumPy array that meets a certain condition (in my case, a medium threshold). For example:

import numpy as np

limit = 3
a = np.array([1, 2, 4, 5, 2, 5, 3, 6, 7, 9, 10])

I'd like to effectively mask all numbers in a that are under the limit, such that the result of np.argmin would be 6. Is there a computationally cheap way to mask values that don't meet a condition and then apply np.argmin?

367

asked Jun 22 '16 16:06

triphook

2 Answers

This can simply be accomplished using numpy's MaskedArray

import numpy as np

limit = 3
a = np.array([1, 2, 4, 5, 2, 5, 3, 6, 7, 9, 10])
b = np.ma.MaskedArray(a, a<limit)
np.ma.argmin(b)    # == 6

186

answered Sep 26 '22 19:09

MaxPowers

You could store the valid indices and use those for both selecting the valid elements from a and also indexing into with the argmin() among the selected elements to get the final index output. Thus, the implementation would look something like this -

valid_idx = np.where(a >= limit)[0]
out = valid_idx[a[valid_idx].argmin()]

Sample run -

In [32]: limit = 3
    ...: a = np.array([1, 2, 4, 5, 2, 5, 3, 6, 7, 9, 10])
    ...: 

In [33]: valid_idx = np.where(a >= limit)[0]

In [34]: valid_idx[a[valid_idx].argmin()]
Out[34]: 6

Runtime test -

For performance benchmarking, in this section I am comparing the other solution based on masked array against a regular array based solution as proposed earlier in this post for various datasizes.

def masked_argmin(a,limit): # Defining func for regular array based soln
    valid_idx = np.where(a >= limit)[0]
    return valid_idx[a[valid_idx].argmin()]

In [52]: # Inputs
    ...: a = np.random.randint(0,1000,(10000))
    ...: limit = 500
    ...: 

In [53]: %timeit np.argmin(np.ma.MaskedArray(a, a<limit))
1000 loops, best of 3: 233 µs per loop

In [54]: %timeit masked_argmin(a,limit)
10000 loops, best of 3: 101 µs per loop

In [55]: # Inputs
    ...: a = np.random.randint(0,1000,(100000))
    ...: limit = 500
    ...: 

In [56]: %timeit np.argmin(np.ma.MaskedArray(a, a<limit))
1000 loops, best of 3: 1.73 ms per loop

In [57]: %timeit masked_argmin(a,limit)
1000 loops, best of 3: 1.03 ms per loop

answered Sep 24 '22 19:09

Divakar

Related questions
                            
                                Get list of column names from a Firebird database table
                            
                                calculating percentage error by comparing two arrays
                            
                                How to use QFileDialog options and retrieve saveFileName?
                            
                                Python code works, but eclipse shows error - Syntax error while detecting tuple
                            
                                Remove attribute from all MongoDB documents using Python and PyMongo
                            
                                How to compare two classes/types in python?
                            
                                PyQt: app.exec_() stops all following code from running
                            
                                Django how to make form fields optional
                            
                                request.args.get('key') gives NULL - Flask
                            
                                Possible to add newline to .format() method?
                            
                                Django File Upload and Rename
                            
                                Python idiom to get same result as calling os.path.dirname multiple times?
                            
                                Python / Remove special character from string
                            
                                Python and Pandas - Moving Average Crossover
                            
                                extract dictionary from Counter object
                            
                                Python: Apply function to values in nested dictionary
                            
                                How to create standalone executable file from python 3.5 scripts?
                            
                                set legend for plot with several lines (in python)
                            
                                SQLAlchemy - condition on join fails with AttributeError: Neither 'BinaryExpression' object nor 'Comparator' object has an attribute 'selectable'
                            
                                Python/Pandas Dataframe replace 0 with median value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With