built-in range or numpy.arange: which is more efficient?

Tags:

When iterating over a large array with a range expression, should I use Python's built-in range function, or numpy's arange to get the best performance?

My reasoning so far:

range probably resorts to a native implementation and might be faster therefore. On the other hand, arange returns a full array, which occupies memory, so there might be an overhead. Python 3's range expression is a generator, which does not hold all the values in memory.

429

asked May 22 '12 09:05

clstaudt

1 Answers

For large arrays, a vectorised numpy operation is the fastest. If you must loop, prefer xrange/range and avoid using np.arange.

In numpy you should use combinations of vectorized calculations, ufuncs and indexing to solve your problems as it runs at C speed. Looping over numpy arrays is inefficient compared to this.

(Something like the worst thing you could do would be to iterate over the array with an index created with range or np.arange as the first sentence in your question suggests, but I'm not sure if you really mean that.)

import numpy as np import sys  sys.version # out: '2.7.3rc2 (default, Mar 22 2012, 04:35:15) \n[GCC 4.6.3]' np.version.version # out: '1.6.2'  size = int(1E6)  %timeit for x in range(size): x ** 2 # out: 10 loops, best of 3: 136 ms per loop  %timeit for x in xrange(size): x ** 2 # out: 10 loops, best of 3: 88.9 ms per loop  # avoid this %timeit for x in np.arange(size): x ** 2 #out: 1 loops, best of 3: 1.16 s per loop  # use this %timeit np.arange(size) ** 2 #out: 100 loops, best of 3: 19.5 ms per loop

So for this case numpy is 4 times faster than using xrange if you do it right. Depending on your problem numpy can be much faster than a 4 or 5 times speed up.

The answers to this question explain some more advantages of using numpy arrays instead of python lists for large data sets.

168

answered Sep 23 '22 08:09

bmu

Related questions
                            
                                npm - "Can't find Python executable "python", you can set the PYTHON env variable."
                            
                                Flask: get current route
                            
                                `final` keyword equivalent for variables in Python?
                            
                                Django get list of models in application
                            
                                when installing pyaudio, pip cannot find portaudio.h in /usr/local/include
                            
                                Best way to generate xml? [duplicate]
                            
                                Using a dictionary to select function to execute
                            
                                Flask and uWSGI - unable to load app 0 (mountpoint='') (callable not found or import error)
                            
                                How to Maximize window in chrome using webDriver (python)
                            
                                How to Add Incremental Numbers to a New Column Using Pandas
                            
                                How to overplot a line on a scatter plot in python?
                            
                                Python: Find a substring in a string and returning the index of the substring
                            
                                Is there a better way to compare dictionary values
                            
                                Make new column in Panda dataframe by adding values from other columns
                            
                                How do you get the process ID of a program in Unix or Linux using Python?
                            
                                How to dynamically create a derived type in the Python C-API
                            
                                Is there a Generic python library to consume REST based services? [closed]
                            
                                How do I create a login API using Django Rest Framework?
                            
                                Why is Python 3 not backwards compatible? [closed]
                            
                                What is the difference between variable_scope and name_scope? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

built-in range or numpy.arange: which is more efficient?

Tags:

python

python-3.x

range

numpy

clstaudt

People also ask

1 Answers

bmu

Recent Activity

Donate For Us