NumPy ndarray.all() vs np.all(ndarray) vs all(ndarray)

Tags:

What is the the difference between the three "all" methods in Python/NumPy? What is the reason for the performance difference? Is it true that ndarray.all() is always the fastest of the three?

Here is a timing test that I ran:

In [59]: a = np.full(100000, True, dtype=bool)

In [60]: timeit a.all()
The slowest run took 5.40 times longer than the fastest. This could mean that an intermediate result is being cached.
100000 loops, best of 3: 5.24 µs per loop

In [61]: timeit all(a)
1000 loops, best of 3: 1.34 ms per loop

In [62]: timeit np.all(a)
The slowest run took 5.54 times longer than the fastest. This could mean that an intermediate result is being cached.
100000 loops, best of 3: 6.41 µs per loop

918

asked Apr 13 '17 01:04

dkv

2 Answers

The difference between np.all(a) and a.all() is simple:

If a is a numpy.array then np.all() will simply call a.all().
If a is not a numpy.array the np.all() call will convert it to an numpy.array and then call a.all(). a.all() on the other hand will fail because a wasn't a numpy.array and therefore probably has no all method.

The difference between np.all and all is more complicated.

The all function works on any iterable (including list, sets, generators, ...). np.all works only for numpy.arrays (including everything that can be converted to a numpy array, i.e. lists and tuples).
np.all processes an array with specified data type, that makes it pretty efficient when comparing for != 0. all however needs to evaluate bool for each item, that's much slower.
processing arrays with python functions is pretty slow because each item in the array needs to be converted to a python object. np.all doesn't need to do that conversion.

Note that the timings depend also on the type of your a. If you process a python list all can be faster for relativly short lists. If you process an array, np.all and a.all() will be faster in almost all cases (except maybe for object arrays, but I won't go down that path, that way lies madness).

answered Oct 13 '22 01:10

MSeifert

I'll take a swing at this

np.all is a generic function which will work with different data types, under the hood this probably looks for ndarray.all which is why it's slightly slower.
all is a python bulit-in function see https://docs.python.org/2/library/functions.html#all.
ndarray.all is method of the 'numpy.ndarray' object, calling this directly may be faster.

answered Oct 12 '22 23:10

pyCthon

Related questions
                            
                                Plotting markers on a map using Pandas & Folium
                            
                                How to fill numpy array of zeros with ones given indices/coordinates
                            
                                Is an import in python considered to be dynamic linking?
                            
                                Difference between scipy.leastsq and scipy.least_squares
                            
                                How to convert a timedelta to a string and back again
                            
                                Renaming columns on DataFrame output of pandas.concat
                            
                                Using Scipy curve_fit with piecewise function
                            
                                Cloning Conda root environment does not clone conda and condo-build
                            
                                Why does shuffling my validation set in Keras change my model's performance?
                            
                                Symbol not found: _sqlite3_enable_load_extension - sqlite installed via homebrew
                            
                                Preserving quotes in ruamel.yaml
                            
                                python numpy: how to construct a big diagonal array(matrix) from two small array
                            
                                Json parsing Python subprocess
                            
                                How to dynamically import modules?
                            
                                Making a list and appending to it in TensorFlow
                            
                                ANSI color lost when using python subprocess [closed]
                            
                                Pandas: How to use LocIndexer?
                            
                                How to remove an data/models from nltk dowloader?
                            
                                What is the meaning of angle brackets in Python?
                            
                                Can I handle multiple asserts within a single Python pytest method?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

NumPy ndarray.all() vs np.all(ndarray) vs all(ndarray)

Tags:

performance

python

numpy

dkv

People also ask

2 Answers

MSeifert

pyCthon

Recent Activity

Donate For Us