In numpy, is there a nice idiomatic way of testing if all rows are equal in a 2d array? I can do something like <pre class="prettyprint"><code>np.all([np.array_equal(M[0], M[i]) for i in xrange(1,len(M))]) </code></pre> This seems to mix python lists with numpy arrays which is ugly and presumably also slow. Is there a nicer/neater way?

One way is to check that every row of the array <code>arr</code> is equal to its first row <code>arr[0]</code>: <pre class="prettyprint"><code>(arr == arr[0]).all() </code></pre> Using equality <code>==</code> is fine for integer values, but if <code>arr</code> contains floating point values you could use <code>np.isclose</code> instead to check for equality within a given tolerance: <pre class="prettyprint"><code>np.isclose(a, a[0]).all() </code></pre> If your array contains <code>NaN</code> and you want to avoid the tricky <code>NaN != NaN</code> issue, you could combine this approach with <code>np.isnan</code>: <pre class="prettyprint"><code>(np.isclose(a, a[0]) | np.isnan(a)).all() </code></pre>

<strike>Simply check if the number if unique items in the array are 1:</strike> <pre class="prettyprint"><code>>>> arr = np.array([[1]*10 for _ in xrange(5)]) >>> len(np.unique(arr)) == 1 True </code></pre> A solution inspired from unutbu's answer: <pre class="prettyprint"><code>>>> arr = np.array([[1]*10 for _ in xrange(5)]) >>> np.all(np.all(arr == arr[0,:], axis = 1)) True </code></pre> One problem with your code is that you're creating an entire list first before applying <code>np.all()</code> on it. Due to that there's no short-circuiting happening in your version, instead of that it would be better if you use Python's <code>all()</code> with a generator expression: Timing comparisons: <pre class="prettyprint"><code>>>> M = arr = np.array([[3]*100] + [[2]*100 for _ in xrange(1000)]) >>> %timeit np.all(np.all(arr == arr[0,:], axis = 1)) 1000 loops, best of 3: 272 µs per loop >>> %timeit (np.diff(M, axis=0) == 0).all() 1000 loops, best of 3: 596 µs per loop >>> %timeit np.all([np.array_equal(M[0], M[i]) for i in xrange(1,len(M))]) 100 loops, best of 3: 10.6 ms per loop >>> %timeit all(np.array_equal(M[0], M[i]) for i in xrange(1,len(M))) 100000 loops, best of 3: 11.3 µs per loop >>> M = arr = np.array([[2]*100 for _ in xrange(1000)]) >>> %timeit np.all(np.all(arr == arr[0,:], axis = 1)) 1000 loops, best of 3: 330 µs per loop >>> %timeit (np.diff(M, axis=0) == 0).all() 1000 loops, best of 3: 594 µs per loop >>> %timeit np.all([np.array_equal(M[0], M[i]) for i in xrange(1,len(M))]) 100 loops, best of 3: 9.51 ms per loop >>> %timeit all(np.array_equal(M[0], M[i]) for i in xrange(1,len(M))) 100 loops, best of 3: 9.44 ms per loop </code></pre>

How to test if all rows are equal in a numpy

Tags:

In numpy, is there a nice idiomatic way of testing if all rows are equal in a 2d array?

I can do something like

np.all([np.array_equal(M[0], M[i]) for i in xrange(1,len(M))])

This seems to mix python lists with numpy arrays which is ugly and presumably also slow.

Is there a nicer/neater way?

944

asked Oct 02 '14 15:10

graffe

2 Answers

One way is to check that every row of the array arr is equal to its first row arr[0]:

(arr == arr[0]).all()

Using equality == is fine for integer values, but if arr contains floating point values you could use np.isclose instead to check for equality within a given tolerance:

np.isclose(a, a[0]).all()

If your array contains NaN and you want to avoid the tricky NaN != NaN issue, you could combine this approach with np.isnan:

(np.isclose(a, a[0]) | np.isnan(a)).all()

200

answered Sep 25 '22 06:09

Alex Riley

~~Simply check if the number if unique items in the array are 1:~~

>>> arr = np.array([[1]*10 for _ in xrange(5)]) >>> len(np.unique(arr)) == 1 True

A solution inspired from unutbu's answer:

>>> arr = np.array([[1]*10 for _ in xrange(5)]) >>> np.all(np.all(arr == arr[0,:], axis = 1)) True

One problem with your code is that you're creating an entire list first before applying np.all() on it. Due to that there's no short-circuiting happening in your version, instead of that it would be better if you use Python's all() with a generator expression:

Timing comparisons:

>>> M = arr = np.array([[3]*100] + [[2]*100 for _ in xrange(1000)]) >>> %timeit np.all(np.all(arr == arr[0,:], axis = 1)) 1000 loops, best of 3: 272 µs per loop >>> %timeit (np.diff(M, axis=0) == 0).all() 1000 loops, best of 3: 596 µs per loop >>> %timeit np.all([np.array_equal(M[0], M[i]) for i in xrange(1,len(M))]) 100 loops, best of 3: 10.6 ms per loop >>> %timeit all(np.array_equal(M[0], M[i]) for i in xrange(1,len(M))) 100000 loops, best of 3: 11.3 µs per loop  >>> M = arr = np.array([[2]*100 for _ in xrange(1000)]) >>> %timeit np.all(np.all(arr == arr[0,:], axis = 1)) 1000 loops, best of 3: 330 µs per loop >>> %timeit (np.diff(M, axis=0) == 0).all() 1000 loops, best of 3: 594 µs per loop >>> %timeit np.all([np.array_equal(M[0], M[i]) for i in xrange(1,len(M))]) 100 loops, best of 3: 9.51 ms per loop >>> %timeit all(np.array_equal(M[0], M[i]) for i in xrange(1,len(M))) 100 loops, best of 3: 9.44 ms per loop

answered Sep 26 '22 06:09

Ashwini Chaudhary

Related questions
                            
                                Accidentally overwrote iPhone device location via Xcode
                            
                                ubuntu 14.04 /etc/init.d/ vs /etc/init/ start service at startup
                            
                                ANTLR4 mutually left-recursive error when parsing
                            
                                How to remove all cookies in Angularjs?
                            
                                angular ui.router state.go('statename') not working
                            
                                Catch all Exceptions and also return custom Errors in Jersey
                            
                                Constructor pattern by Douglas Crockford
                            
                                How to consume npm package with es6 module via Webpack and 6to5?
                            
                                Exponential backoff: time.sleep with random.randint(0, 1000) / 1000
                            
                                OSX boot2docker hangs on "Waiting for VM and Docker daemon to start …"
                            
                                How to instantiate Spring managed beans at runtime?
                            
                                Can I import 3rd party package into golang playground

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With