Just say I have a list <pre class="prettyprint"><code>a = (3, 2, 9, 4) </code></pre> And I want to add one to each number and store the result, (I won't need to manipulate the result after), my first thought would be to go: <pre class="prettyprint"><code>[x + 1 for x in a] </code></pre> But what about: <pre class="prettyprint"><code>tuple(x + 1 for x in a) </code></pre> Tuples are meant to be faster right? And if I don't need to change the result after is this code more efficient? Also how does it really work, does the <code>tuple</code> constructor have to create a list out of the generator expression to know the size in advance? Thanks in advance for any explanation.

just <code>timeit()</code>: <pre class="prettyprint"><code>In : a = (3, 2, 9, 4) In : f1 = lambda: [x + 1 for x in a] In : f2 = lambda: tuple(x + 1 for x in a) In : timeit.timeit(f1) Out: 0.595026969909668 In : timeit.timeit(f2) Out: 2.360887050628662 </code></pre> so it seems that the tuple constructor variant takes about four times as long, I guess because list comprehensions are fairly optimized (in cpython). But let's take a closer look: <pre class="prettyprint"><code>In : f3 = lambda: list(x + 1 for x in a) In : timeit.timeit(f3) Out: 2.5421998500823975 </code></pre> so this takes about the same time as the tuple construction, which indicates that the performance penalty lies in the generator expression overhead. (we can rule out list / tuple construction, see edit below) It is even about twice as slow as to <code>map()</code> the list: <pre class="prettyprint"><code>In : inc = partial(operator.add,1) In : f4 = lambda:map(inc, a) In : timeit.timeit(f4) Out: 1.2346529960632324 </code></pre> I think this really boils down to (cpython) implementation details, so don't rely on this. Anyway - don't worry about performance, it's just a factor of 2-4, use the method which is best to read. If you really hit performance bottlenecks, investigate and optimize them after you noticed them. And I bet a factor 4 in a list manipulation will be the least of your problems, then. Edit: Someone mentioned that the lookup cost of "tuple" could cause the slowdown, but this is not the case: <pre class="prettyprint"><code>In : f5 = lambda: tuple([x + 1 for x in a]) In : timeit.timeit(f5) Out: 0.7900090217590332 </code></pre> So I guess it really is the generator expressions overhead which slows things down.

The <code>dis</code> module can give you some idea of how the code executes internally... <code>dis.dis(lambda a: [x + 1 for x in a])</code> yields... <pre class="prettyprint"><code> 1 0 BUILD_LIST 0 3 LOAD_FAST 0 (a) 6 GET_ITER >> 7 FOR_ITER 16 (to 26) 10 STORE_FAST 1 (x) 13 LOAD_FAST 1 (x) 16 LOAD_CONST 1 (1) 19 BINARY_ADD 20 LIST_APPEND 2 23 JUMP_ABSOLUTE 7 >> 26 RETURN_VALUE </code></pre> ...and <code>dis.dis(lambda a: tuple(x + 1 for x in a))</code> yields... <pre class="prettyprint"><code> 1 0 LOAD_GLOBAL 0 (tuple) 3 LOAD_CONST 1 (<code object <genexpr> at 0x7f62e9eda930, file "<stdin>", line 1>) 6 MAKE_FUNCTION 0 9 LOAD_FAST 0 (a) 12 GET_ITER 13 CALL_FUNCTION 1 16 CALL_FUNCTION 1 19 RETURN_VALUE </code></pre> ...but you may not be able to infer much from that. If you want to know which is faster, check out the <code>timeit</code> module.

Tuple constructor vs list comp

Tags:

python

list

tuples

Just say I have a list

Click to copy

a = (3, 2, 9, 4)

And I want to add one to each number and store the result, (I won't need to manipulate the result after), my first thought would be to go:

Click to copy

[x + 1 for x in a]

But what about:

Click to copy

tuple(x + 1 for x in a)

Tuples are meant to be faster right? And if I don't need to change the result after is this code more efficient? Also how does it really work, does the tuple constructor have to create a list out of the generator expression to know the size in advance? Thanks in advance for any explanation.

908

asked Apr 12 '13 14:04

197

2 Answers

just timeit():

Click to copy

In : a = (3, 2, 9, 4)

In : f1 = lambda: [x + 1 for x in a]

In : f2 = lambda: tuple(x + 1 for x in a)

In : timeit.timeit(f1)
Out: 0.595026969909668

In : timeit.timeit(f2)
Out: 2.360887050628662

so it seems that the tuple constructor variant takes about four times as long, I guess because list comprehensions are fairly optimized (in cpython).

But let's take a closer look:

Click to copy

In : f3 = lambda: list(x + 1 for x in a)

In : timeit.timeit(f3)
Out: 2.5421998500823975

so this takes about the same time as the tuple construction, which indicates that the performance penalty lies in the generator expression overhead. (we can rule out list / tuple construction, see edit below)

It is even about twice as slow as to map() the list:

Click to copy

In : inc = partial(operator.add,1)

In : f4 = lambda:map(inc, a)

In : timeit.timeit(f4)
Out: 1.2346529960632324

I think this really boils down to (cpython) implementation details, so don't rely on this. Anyway - don't worry about performance, it's just a factor of 2-4, use the method which is best to read.

If you really hit performance bottlenecks, investigate and optimize them after you noticed them. And I bet a factor 4 in a list manipulation will be the least of your problems, then.

Edit: Someone mentioned that the lookup cost of "tuple" could cause the slowdown, but this is not the case:

Click to copy

In : f5 = lambda: tuple([x + 1 for x in a])

In : timeit.timeit(f5)
Out: 0.7900090217590332

So I guess it really is the generator expressions overhead which slows things down.

157

answered Sep 29 '22 19:09

ch3ka

The dis module can give you some idea of how the code executes internally...

dis.dis(lambda a: [x + 1 for x in a]) yields...

Click to copy

  1           0 BUILD_LIST               0
              3 LOAD_FAST                0 (a)
              6 GET_ITER
        >>    7 FOR_ITER                16 (to 26)
             10 STORE_FAST               1 (x)
             13 LOAD_FAST                1 (x)
             16 LOAD_CONST               1 (1)
             19 BINARY_ADD
             20 LIST_APPEND              2
             23 JUMP_ABSOLUTE            7
        >>   26 RETURN_VALUE

...and dis.dis(lambda a: tuple(x + 1 for x in a)) yields...

Click to copy

  1           0 LOAD_GLOBAL              0 (tuple)
              3 LOAD_CONST               1 (<code object <genexpr> at 0x7f62e9eda930, file "<stdin>", line 1>)
              6 MAKE_FUNCTION            0
              9 LOAD_FAST                0 (a)
             12 GET_ITER
             13 CALL_FUNCTION            1
             16 CALL_FUNCTION            1
             19 RETURN_VALUE

...but you may not be able to infer much from that. If you want to know which is faster, check out the timeit module.

answered Sep 29 '22 18:09

Aya

Related questions
                            
                                Mask specific columns of a numpy array
                            
                                python how to compute a simple checksum as quickly as zlib.adler32
                            
                                Scipy dendrogram leaf label colours
                            
                                Why is this else: pass needed for processing to continue? [closed]
                            
                                Date 6 months into the future
                            
                                Deleting non existing record should raise an error in sqlalchemy
                            
                                Python: Sum values in a dictionary based on condition
                            
                                Python - List comprehension within File I/O code
                            
                                "Protected" access in Python - how?
                            
                                Flask-SQLAlchemy not creating tables using create_all()
                            
                                Flask Jinja Template '<br>'.join
                            
                                How to set up multiple PATHs in the user bash_profile in OSX 10.8?
                            
                                Sublime Text 2 :: Python code completion [duplicate]
                            
                                How to use the dir/s command in Python?
                            
                                Python properties and string formatting
                            
                                Update json file
                            
                                Creating Lexicon and Scanner in Python
                            
                                Performance of library itertools compared to python code
                            
                                Python - How to determine hierarchy level of parsed XML elements?
                            
                                python one line save values of lists in dict to list

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Tuple constructor vs list comp

Tags:

python

list

tuples

197

People also ask

2 Answers

ch3ka

Aya

Recent Activity

Donate For Us