I have a 2D array containing integers (both positive or negative). Each row represents the values over time for a particular spatial site, whereas each column represents values for various spatial sites for a given time. So if the array is like: <pre class="prettyprint"><code>1 3 4 2 2 7 5 2 2 1 4 1 3 3 2 2 1 1 </code></pre> The result should be <pre class="prettyprint"><code>1 3 2 2 2 1 </code></pre> Note that when there are multiple values for mode, any one (selected randomly) may be set as mode. I can iterate over the columns finding mode one at a time but I was hoping numpy might have some in-built function to do that. Or if there is a trick to find that efficiently without looping.

Check <code>scipy.stats.mode()</code> (inspired by @tom10's comment): <pre class="prettyprint"><code>import numpy as np from scipy import stats a = np.array([[1, 3, 4, 2, 2, 7], [5, 2, 2, 1, 4, 1], [3, 3, 2, 2, 1, 1]]) m = stats.mode(a) print(m) </code></pre> Output: <pre class="prettyprint"><code>ModeResult(mode=array([[1, 3, 2, 2, 1, 1]]), count=array([[1, 2, 2, 2, 1, 2]])) </code></pre> As you can see, it returns both the mode as well as the counts. You can select the modes directly via <code>m[0]</code>: <pre class="prettyprint"><code>print(m[0]) </code></pre> Output: <pre class="prettyprint"><code>[[1 3 2 2 1 1]] </code></pre>

Most efficient way to find mode in numpy array

Tags:

python

numpy

mode

2d

I have a 2D array containing integers (both positive or negative). Each row represents the values over time for a particular spatial site, whereas each column represents values for various spatial sites for a given time.

So if the array is like:

1 3 4 2 2 7 5 2 2 1 4 1 3 3 2 2 1 1

The result should be

1 3 2 2 2 1

Note that when there are multiple values for mode, any one (selected randomly) may be set as mode.

I can iterate over the columns finding mode one at a time but I was hoping numpy might have some in-built function to do that. Or if there is a trick to find that efficiently without looping.

718

asked May 02 '13 05:05

Nik

1 Answers

Check scipy.stats.mode() (inspired by @tom10's comment):

import numpy as np from scipy import stats  a = np.array([[1, 3, 4, 2, 2, 7],               [5, 2, 2, 1, 4, 1],               [3, 3, 2, 2, 1, 1]])  m = stats.mode(a) print(m)

Output:

ModeResult(mode=array([[1, 3, 2, 2, 1, 1]]), count=array([[1, 2, 2, 2, 1, 2]]))

As you can see, it returns both the mode as well as the counts. You can select the modes directly via m[0]:

print(m[0])

Output:

[[1 3 2 2 1 1]]

answered Oct 05 '22 12:10

fgb

Related questions
                            
                                Validating with an XML schema in Python
                            
                                How can I set the 'backend' in matplotlib in Python?
                            
                                Is python's sorted() function guaranteed to be stable?
                            
                                Python Pandas : group by in group by and average?
                            
                                Adding meta-information/metadata to pandas DataFrame
                            
                                sort eigenvalues and associated eigenvectors after using numpy.linalg.eig in python
                            
                                Python: reload component Y imported with 'from X import Y'?
                            
                                Django rest framework serializing many to many field
                            
                                Foreign key from one app into another in Django
                            
                                ipython reads wrong python version
                            
                                How to parse/read a YAML file into a Python object? [duplicate]
                            
                                Python: Tuples/dictionaries as keys, select, sort
                            
                                In Django - Model Inheritance - Does it allow you to override a parent model's attribute?
                            
                                Argmax of numpy array returning non-flat indices
                            
                                Matplotlib Legends not working
                            
                                Split models.py into several files
                            
                                How to do math in a Django template?
                            
                                Default value for field in Django model
                            
                                In python, how do I cast a class object to a dict
                            
                                How do I access the command history from IDLE?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With