Performance of zeros function in Numpy

Tags:

numpy

I just noticed that the zeros function of numpy has a strange behavior :

%timeit np.zeros((1000, 1000))
1.06 ms ± 29.8 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

%timeit np.zeros((5000, 5000))
4 µs ± 66 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

On the other hand, ones seems to have a normal behavior. Is anybody know why initializing a small numpy array with the zeros function takes more time than for a large array ?

(Python 3.5, numpy 1.11)

410

asked Jun 11 '17 19:06

1 Answers

This looks like calloc hitting a threshold where it makes an OS request for zeroed memory and doesn't need to initialize it manually. Looking through the source code, numpy.zeros eventually delegates to calloc to acquire a zeroed memory block, and if you compare to numpy.empty, which doesn't perform initialization:

In [15]: %timeit np.zeros((5000, 5000))
The slowest run took 12.65 times longer than the fastest. This could mean that a
n intermediate result is being cached.
100000 loops, best of 3: 10 µs per loop

In [16]: %timeit np.empty((5000, 5000))
The slowest run took 5.05 times longer than the fastest. This could mean that an
 intermediate result is being cached.
100000 loops, best of 3: 10.3 µs per loop

you can see that np.zeros has no initialization overhead for the 5000x5000 array.

In fact, the OS isn't even "really" allocating that memory until you try to access it. A request for terabytes of array succeeds on a machine without terabytes to spare:

In [23]: x = np.zeros(2**40)  # No MemoryError!

answered Sep 27 '22 19:09

user2357112 supports Monica

Related questions
                            
                                Pandas: Dataframe.Drop - ValueError: labels ['id'] not contained in axis
                            
                                Anaconda "failed to create process"
                            
                                Yes/No prompt in Python3 using strtobool
                            
                                How to optimize MAPE code in Python?
                            
                                Non-blocking requests in Sanic framework
                            
                                Don't understand cause of "IndexError: tuple index out of range" when formatting string
                            
                                How to create groups and assign permission during project setup in django?
                            
                                NumPy: calculate cumulative median
                            
                                Prevent deletion of parent row if it's child will be orphaned in SQLAlchemy
                            
                                How should I pass my s3 credentials to Python lambda function on AWS?
                            
                                Tensorflow dynamic RNN (LSTM): how to format input?
                            
                                python arabic encoding issue
                            
                                Pandas df to database using flask-sqlalchemy
                            
                                How can I use a text file as database in Python?
                            
                                scheduled sampling in Tensorflow
                            
                                Is there a way to obtain the instance id within an ec2 instance [duplicate]
                            
                                Check if two numpy arrays are identical
                            
                                "pattern" package for python 3.6 Anaconda
                            
                                How is the categorical_crossentropy implemented in keras?
                            
                                How can I eliminate the gray border around Jupyter/ipython notebooks in my browser?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Performance of zeros function in Numpy

Tags:

python

numpy

Ipse Lium

People also ask

1 Answers

user2357112 supports Monica

Recent Activity

Donate For Us