I was investigating Python generators and decided to run a little experiment. <pre class="prettyprint"><code>TOTAL = 100000000 def my_sequence(): i = 0 while i < TOTAL: yield i i += 1 def my_list(): return range(TOTAL) def my_xrange(): return xrange(TOTAL) </code></pre> Memory usage (using psutil to get process RSS memory) and time taken (using time.time()) are shown below after running each method several times and taking the average: <pre class="prettyprint"><code>sequence_of_values = my_sequence() # Memory usage: 6782976B Time taken: 9.53674e-07 s sequence_of_values2 = my_xrange() # Memory usage: 6774784B Time taken: 2.14576e-06 s list_of_values = my_list() # Memory usage: 3266207744B Time taken: 1.80253s </code></pre> I noticed that producing a generator by using xrange is consistently (slightly) slower than that by using yield. Why is that so?

As I mentioned in my comment above, with your generator function and with xrange, you're not actually creating the sequence, merely creating the object. @mgilson's answer covers the calls related to creating them. As for actually doing something with them: <pre class="prettyprint"><code>>>> TOTAL = 100000 >>> # your functions here ... >>> import timeit >>> timeit.timeit("list(my_seq())", setup="from __main__ import my_seq", number=1000) 9.783777457339898 >>> timeit.timeit("list(my_xrange())", setup="from __main__ import my_xrange", number=1000) 1.2652621698083024 >>> timeit.timeit("list(my_list())", setup="from __main__ import my_list", number=1000) 2.666709824464867 >>> timeit.timeit("my_list()", setup="from __main__ import my_list", number=1000) 1.2324339537661615 </code></pre> <ol> <li>You'll see that I'm creating a <code>list</code> out of each so I'm processing the sequences.</li> <li>The generator function is nearly 10x the time for <code>xrange</code>.</li> <li><code>list(my_list)</code> is redundant since <code>my_list</code> already returns the list produced by <code>range</code>, so I did it one more time without the call to <code>list()</code>.</li> <li><code>range</code> is nearly the same as <code>xrange</code> but that's because I reduced TOTAL. The biggest difference there would be that <code>range</code> would consume more memory since it creates the entire list first and so takes longer only in that part. Creating a list from xrange = range, effectively. So the final memory used would be the same and since I'm merely creating a list out of the xrange, it's hard to see the difference in this trivial case.</li> </ol>

Why is a generator produced by yield faster than a generator produced by xrange?

Tags:

python

yield

python-3.x

python-2.7

I was investigating Python generators and decided to run a little experiment.

TOTAL = 100000000
def my_sequence():
    i = 0
    while i < TOTAL:
        yield i
        i += 1

def my_list():
    return range(TOTAL)

def my_xrange():
    return xrange(TOTAL)

Memory usage (using psutil to get process RSS memory) and time taken (using time.time()) are shown below after running each method several times and taking the average:

sequence_of_values = my_sequence() # Memory usage: 6782976B  Time taken: 9.53674e-07 s

sequence_of_values2 = my_xrange() # Memory usage: 6774784B  Time taken: 2.14576e-06 s

list_of_values = my_list() # Memory usage: 3266207744B  Time taken: 1.80253s

I noticed that producing a generator by using xrange is consistently (slightly) slower than that by using yield. Why is that so?

417

asked Jul 28 '16 02:07

Yan Yi

2 Answers

I'm going to preface this answer by saying that timings on this scale are likely going to be hard to measure accurately (probably best to use timeit) and that these sorts of optimizations will almost never make any difference in your actual program's runtime ...

Ok, now the disclaimer is done ...

The first thing that you need to notice is that you're only timing the construction of the generator/xrange object -- You are NOT timing how long it takes to actually iterate over the values¹. There are a couple reasons why creating the generator might be faster in some cases than creating the xrange object...

For the generator case, you're only creating a generator -- No code in the generator actually gets run. This amounts to roughly 1 function call.
For the xrange case, you're calling the function and then you have to lookup the global name xrange, the global TOTAL and then you need to call that builtin -- So there are more things being executed in this case.

As for memory -- In both of the lazy approaches, the memory used will be dominated by the python runtime -- Not by the size of your generator objects. The only case where the memory use is impacted appreciably by your script is the case where you construct a list of 100million items.

Also note that I can't actually confirm your results consistently on my system... Using timeit, I actually get that my_xrange is sometimes² faster to construct (by ~30%).

Adding the following to the bottom of your script:

from timeit import timeit
print timeit('my_xrange()', setup='from __main__ import my_xrange')
print timeit('my_sequence()', setup='from __main__ import my_sequence')

And my results are (for CPython on OS-X El-Capitan):

0.227491140366
0.356791973114

However, pypy seems to favor the generator construction (I tried it with both my_xrange first and my_sequence first and got fairly consistent results though the first one to run does seem to be at a bit of a disadvantage -- Maybe due to JIT warm-up time or something):

0.00285911560059
0.00137305259705

^{¹Here, I would expect xrange to have the edge -- but again, nothing is true until you timeit and then it's only true if the timings differences are significant and it's only true on the computer where you did the timings.}
^{²See opening disclaimer :-P}

164

answered Sep 29 '22 11:09

mgilson

As I mentioned in my comment above, with your generator function and with xrange, you're not actually creating the sequence, merely creating the object. @mgilson's answer covers the calls related to creating them.

As for actually doing something with them:

>>> TOTAL = 100000
>>> # your functions here
...
>>> import timeit
>>> timeit.timeit("list(my_seq())", setup="from __main__ import my_seq", number=1000)
9.783777457339898
>>> timeit.timeit("list(my_xrange())", setup="from __main__ import my_xrange", number=1000)
1.2652621698083024
>>> timeit.timeit("list(my_list())", setup="from __main__ import my_list", number=1000)
2.666709824464867
>>> timeit.timeit("my_list()", setup="from __main__ import my_list", number=1000)
1.2324339537661615

You'll see that I'm creating a list out of each so I'm processing the sequences.
The generator function is nearly 10x the time for xrange.
list(my_list) is redundant since my_list already returns the list produced by range, so I did it one more time without the call to list().
range is nearly the same as xrange but that's because I reduced TOTAL. The biggest difference there would be that range would consume more memory since it creates the entire list first and so takes longer only in that part. Creating a list from xrange = range, effectively. So the final memory used would be the same and since I'm merely creating a list out of the xrange, it's hard to see the difference in this trivial case.

answered Sep 29 '22 12:09

aneroid

Related questions
                            
                                Grouped linear regression in Spark
                            
                                How do I access a variable in sphinx conf.py from my .rst file?
                            
                                Extract nested JSON embedded as string in Pandas dataframe
                            
                                Unexpected output using Pythons' ternary operator in combination with lambda
                            
                                How to reschedule 403 HTTP status codes to be crawled later in scrapy?
                            
                                Numpy types for Cython users
                            
                                Determining if A Value is in a Set in TensorFlow
                            
                                Python text processing: NLTK and pandas
                            
                                How can I extract the main diagonal of a sparse matrix?
                            
                                Why does apply change dtype in pandas dataframe columns
                            
                                pythonic way for axis-wise winner-take-all in numpy
                            
                                return a list of all datasets in a hdf file with pandas
                            
                                Implement a java UDF and call it from pyspark
                            
                                What does "late binding closures" mean? [duplicate]
                            
                                Is it possible for Python to display LaTex in real time in a text box?
                            
                                Pandas .loc with Tuple column names
                            
                                Can't log in to admin site in Django
                            
                                list of objects python
                            
                                Find location of image inside bigger image
                            
                                Forcing requests library to use TLSv1.1 or TLSv1.2 in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With