I was trying to find the quickest way to count the number of items in a list matching a specific filter. In this case, finding how many odd numbers there are in a list. While doing this, I was surprised by the results of comparing a list comprehension vs the equivalent generator expression: <pre class="prettyprint"><code>python -m timeit -s "L = xrange(1000000)" "sum([1 for i in L if i & 1])" 10 loops, best of 3: 109 msec per loop python -m timeit -s "L = xrange(1000000)" "sum(1 for i in L if i & 1)" 10 loops, best of 3: 125 msec per loop </code></pre> I have also tried with L being a regular list, and different sizes, but in all cases the list comprehension wins. What is the genexp doing that causes it to be slower compared to the listcomp that creates a new list with 1 million items...? (Btw, the fastest way I found was: <code>x = 1; len(filter(x.__and__, L))</code>. And yes, I know writing code like that kills kittens, I'm doing it for the fun of it)

When essentially unlimited memory is available (which will invariably be the case in tiny benchmarks, although often not in real-world problems!-), lists will tend to outperform generators because they can get allocated just once, in one "big bunch" (no memory fragmentation, etc), while generators require (internally) extra effort to avoid that "big bunch" approach by preserving the stack-frame state to allow resumption of execution. Whether a list-approach or generator-approach will be faster in a real program depends on the exact memory situation, including fragmentation, which is about impossible to reproduce accurately in a "micro-benchmark". IOW, in the end, if you truly care about performance, you must carefully benchmark (and, separately, profile) your actual program(s), not just "toy" micro-benchmarks, in the general case.

From what I remember, a generator frame have to be activated for each result, whereas the list comprehension uses the one activation frame. The incremental cost in the list compression is the added cost of the memory -- references to int in your case. The relation may well reverse if each item is a new instance and uses more memory. update: After testing, it did reverse <pre class="prettyprint"><code>~% python -m timeit -s "L = xrange(1000000);oint=type('intEx', (int,),{})" "sum([oint(1) for i in L if i & 1])" 10 loops, best of 3: 414 msec per loop ~% python -m timeit -s "L = xrange(1000000);oint=type('intEx', (int,),{})" "sum(oint(1) for i in L if i & 1)" 10 loops, best of 3: 392 msec per loop </code></pre>

Why is this genexp performing worse than a list comprehension?

Tags:

python

list-comprehension

generator-expression

I was trying to find the quickest way to count the number of items in a list matching a specific filter. In this case, finding how many odd numbers there are in a list.

While doing this, I was surprised by the results of comparing a list comprehension vs the equivalent generator expression:

python -m timeit -s "L = xrange(1000000)" "sum([1 for i in L if i & 1])"
10 loops, best of 3: 109 msec per loop

python -m timeit -s "L = xrange(1000000)" "sum(1 for i in L if i & 1)"
10 loops, best of 3: 125 msec per loop

I have also tried with L being a regular list, and different sizes, but in all cases the list comprehension wins.

What is the genexp doing that causes it to be slower compared to the listcomp that creates a new list with 1 million items...?

(Btw, the fastest way I found was: x = 1; len(filter(x.__and__, L)). And yes, I know writing code like that kills kittens, I'm doing it for the fun of it)

789

asked Jan 31 '10 23:01

mthurlin

2 Answers

When essentially unlimited memory is available (which will invariably be the case in tiny benchmarks, although often not in real-world problems!-), lists will tend to outperform generators because they can get allocated just once, in one "big bunch" (no memory fragmentation, etc), while generators require (internally) extra effort to avoid that "big bunch" approach by preserving the stack-frame state to allow resumption of execution.

Whether a list-approach or generator-approach will be faster in a real program depends on the exact memory situation, including fragmentation, which is about impossible to reproduce accurately in a "micro-benchmark". IOW, in the end, if you truly care about performance, you must carefully benchmark (and, separately, profile) your actual program(s), not just "toy" micro-benchmarks, in the general case.

173

answered Oct 20 '22 22:10

Alex Martelli

From what I remember, a generator frame have to be activated for each result, whereas the list comprehension uses the one activation frame. The incremental cost in the list compression is the added cost of the memory -- references to int in your case. The relation may well reverse if each item is a new instance and uses more memory.

update: After testing, it did reverse

~% python -m timeit -s "L = xrange(1000000);oint=type('intEx', (int,),{})" "sum([oint(1) for i in L if i & 1])" 
10 loops, best of 3: 414 msec per loop

~% python -m timeit -s "L = xrange(1000000);oint=type('intEx', (int,),{})" "sum(oint(1) for i in L if i & 1)" 
10 loops, best of 3: 392 msec per loop

answered Oct 20 '22 22:10

Shane Holloway

Related questions
                            
                                ImportError: cannot import name 'generate_password_hash'
                            
                                Plotly: How to make a line plot from a pandas dataframe with a long or wide format?
                            
                                How to change the colour of an image using a mask?
                            
                                How do I count specific values across multiple columns in pandas
                            
                                Sum of consecutive pairs in a list including a sum of the last element with the first
                            
                                How to plot confusion matrix for prefetched dataset in Tensorflow
                            
                                No module name 'sklearn.forest.ensemble'
                            
                                fastapi asynchronous background tasks blocks other requests?
                            
                                How to build Python C extension modules with autotools
                            
                                List Comprehensions and Conditions?
                            
                                Allowing the " - " character in usernames in the Django Admin interface
                            
                                Is LINQ (or linq) a niche tool, or is it on the path to becoming foundational?
                            
                                How can I specify that some command line arguments are mandatory in Python?
                            
                                Is there anyway to persuade python's getopt to handle optional parameters to options?
                            
                                How to interpret status code in Python commands.getstatusoutput()
                            
                                Releasing Python GIL while in C++ code
                            
                                UnicodeDecodeError with Django's request.FILES
                            
                                Correct place to put extra startup code in django?
                            
                                Does anyone had success getting Django to send emails when hosting on Dreamhost?
                            
                                Pyparsing - where order of tokens in unpredictable

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With