Input example: I have a numpy array, e.g. <code>a=np.array([[0,1], [2, 1], [4, 8]])</code> Desired output: I would like to produce a mask array with the max value along a given axis, in my case axis 1, being True and all others being False. e.g. in this case <code>mask = np.array([[False, True], [True, False], [False, True]])</code> Attempt: I have tried approaches using <code>np.amax</code> but this returns the max values in a flattened list: <pre class="prettyprint"><code>>>> np.amax(a, axis=1) array([1, 2, 8]) </code></pre> and <code>np.argmax</code> similarly returns the indices of the max values along that axis. <pre class="prettyprint"><code>>>> np.argmax(a, axis=1) array([1, 0, 1]) </code></pre> I could iterate over this in some way but once these arrays become bigger I want the solution to remain something native in numpy.

Method #1 Using <code>broadcasting</code>, we can use comparison against the max values, while keeping dims to facilitate <code>broadcasting</code> - <pre class="prettyprint"><code>a.max(axis=1,keepdims=1) == a </code></pre> Sample run - <pre class="prettyprint"><code>In [83]: a Out[83]: array([[0, 1], [2, 1], [4, 8]]) In [84]: a.max(axis=1,keepdims=1) == a Out[84]: array([[False, True], [ True, False], [False, True]], dtype=bool) </code></pre> Method #2 Alternatively with <code>argmax</code> indices for one more case of <code>broadcasted-comparison</code> against the range of indices along the columns - <pre class="prettyprint"><code>In [92]: a.argmax(axis=1)[:,None] == range(a.shape[1]) Out[92]: array([[False, True], [ True, False], [False, True]], dtype=bool) </code></pre> Method #3 To finish off the set, and if we are looking for performance, use intialization and then <code>advanced-indexing</code> - <pre class="prettyprint"><code>out = np.zeros(a.shape, dtype=bool) out[np.arange(len(a)), a.argmax(axis=1)] = 1 </code></pre>

You're already halfway in the answer. Once you compute the max along an axis, you can compare it with the input array and you'll have the required binary mask! <pre class="prettyprint"><code>In [7]: maxx = np.amax(a, axis=1) In [8]: maxx Out[8]: array([1, 2, 8]) In [12]: a >= maxx[:, None] Out[12]: array([[False, True], [ True, False], [False, True]], dtype=bool) </code></pre> Note: This uses NumPy broadcasting when doing the comparison between <code>a</code> and <code>maxx</code>

Mask from max values in numpy array, specific axis

Tags:

python

numpy

Input example:

I have a numpy array, e.g.

a=np.array([[0,1], [2, 1], [4, 8]])

Desired output:

I would like to produce a mask array with the max value along a given axis, in my case axis 1, being True and all others being False. e.g. in this case

mask = np.array([[False, True], [True, False], [False, True]])

Attempt:

I have tried approaches using np.amax but this returns the max values in a flattened list:

>>> np.amax(a, axis=1)
array([1, 2, 8])

and np.argmax similarly returns the indices of the max values along that axis.

>>> np.argmax(a, axis=1)
array([1, 0, 1])

I could iterate over this in some way but once these arrays become bigger I want the solution to remain something native in numpy.

615

asked Dec 06 '17 15:12

feedMe

3 Answers

Method #1

Using broadcasting, we can use comparison against the max values, while keeping dims to facilitate broadcasting -

a.max(axis=1,keepdims=1) == a

Sample run -

In [83]: a
Out[83]: 
array([[0, 1],
       [2, 1],
       [4, 8]])

In [84]: a.max(axis=1,keepdims=1) == a
Out[84]: 
array([[False,  True],
       [ True, False],
       [False,  True]], dtype=bool)

Method #2

Alternatively with argmax indices for one more case of broadcasted-comparison against the range of indices along the columns -

In [92]: a.argmax(axis=1)[:,None] == range(a.shape[1])
Out[92]: 
array([[False,  True],
       [ True, False],
       [False,  True]], dtype=bool)

Method #3

To finish off the set, and if we are looking for performance, use intialization and then advanced-indexing -

out = np.zeros(a.shape, dtype=bool)
out[np.arange(len(a)), a.argmax(axis=1)] = 1

132

answered Sep 28 '22 05:09

Divakar

Create an identity matrix and select from its rows using argmax on your array:

np.identity(a.shape[1], bool)[a.argmax(axis=1)]
# array([[False,  True],
#        [ True, False],
#        [False,  True]], dtype=bool)

Please note that this ignores ties, it just goes with the value returned by argmax.

answered Sep 28 '22 05:09

Paul Panzer

You're already halfway in the answer. Once you compute the max along an axis, you can compare it with the input array and you'll have the required binary mask!

In [7]: maxx = np.amax(a, axis=1)

In [8]: maxx
Out[8]: array([1, 2, 8])

In [12]: a >= maxx[:, None]
Out[12]: 
array([[False,  True],
       [ True, False],
       [False,  True]], dtype=bool)

Note: This uses NumPy broadcasting when doing the comparison between a and maxx

answered Sep 28 '22 05:09

kmario23

Related questions
                            
                                Element-wise minimum of multiple vectors in numpy
                            
                                Python and Node.js on Heroku
                            
                                Django rest framework pagination with custom API view
                            
                                Google Cloud Storage HttpAccessTokenRefreshError: invalid_grant: Bad Request
                            
                                converting two digit integer into single digit inside a python list?
                            
                                Why does ast.literal_eval('5 * 7') fail?
                            
                                Outlook using python win32com to iterate subfolders
                            
                                Find count of characters within the string in Python
                            
                                ImportError: No module named geopandas
                            
                                closing session in tensorflow doesn't reset graph
                            
                                Python (Pandas) Add subtotal on each lvl of multiindex dataframe
                            
                                pip install pickle not working - no such file or directory
                            
                                expanding a dataframe based on start and end columns (speed)
                            
                                How to remove the quotes from a string for SQL query in Python?
                            
                                Convert column values to lower case only if they are string
                            
                                How to remove all the values in a string except for the chosen ones [duplicate]
                            
                                json.loads() doesn't keep order [duplicate]
                            
                                Check if module is running in Jupyter or not
                            
                                Is there a way to delete all cells at once in jupyter?
                            
                                Python download youtube with specific filename

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mask from max values in numpy array, specific axis

Tags:

python

numpy

feedMe

People also ask

3 Answers

Divakar

Paul Panzer

kmario23

Recent Activity

Donate For Us