If <code>foo</code> is a builtin <code>set</code> that I know contains <code>"bar"</code>, which of these is faster? Which is more Pythonic? <pre class="prettyprint"><code>foo.add("bar") </code></pre> or <pre class="prettyprint"><code>if "bar" not in foo: foo.add("bar") </code></pre>

The pythonic way is to do first, ask later. Just add it to the set. Asking first is more common in languages such as C. Performance is usually not key in python code. Readability is usually much more important, so writing ideomatic code is good practice.

Should I check if an item is already in a set before adding it?

Tags:

python

set

If foo is a builtin set that I know contains "bar", which of these is faster? Which is more Pythonic?

foo.add("bar")

if "bar" not in foo:
    foo.add("bar")

590

asked Apr 28 '15 19:04

javanix

2 Answers

Actually, the second may be faster (output from IPython):

In [2]: %timeit s.add("a")
The slowest run took 68.27 times longer than the fastest. This could mean that an intermediate result is being cached 
10000000 loops, best of 3: 73.3 ns per loop

In [3]: %timeit if not "a" in s: s.add("a")
10000000 loops, best of 3: 37.1 ns per loop

But anyway, the first one is more Pythonic, I agree.

100

answered Oct 24 '22 23:10

honza_p

The pythonic way is to do first, ask later. Just add it to the set.

Asking first is more common in languages such as C.

Performance is usually not key in python code. Readability is usually much more important, so writing ideomatic code is good practice.

answered Oct 24 '22 23:10

Filip Haglund

Related questions
                            
                                Concatenate (join) a NumPy array with a pandas DataFrame
                            
                                Multiple columns with the same name in Pandas
                            
                                Pandas DataFrame with tuple of strings as index
                            
                                python sqlite3 OperationalError: attempt to write a readonly database
                            
                                Python: Stacktrace vs Traceback
                            
                                Django admin add custom filter
                            
                                Stop a python script without losing data
                            
                                Python heapify() time complexity
                            
                                Extract line from txt file using python
                            
                                How do I turn a python program into an .egg file?
                            
                                Python List - "reserving" space ( ~ resizing)
                            
                                python operator, no operator for "not in"
                            
                                Python PEP 8 docstring line length [closed]
                            
                                Django: Retrieving IDs of manyToMany fields quickly
                            
                                Why python designed as str(None) return 'None' instead of an empty string?
                            
                                Python Gzip - Appending to file on the fly
                            
                                How to access python package metadata from within the python console?
                            
                                Celery task state always pending
                            
                                How to select range in Pandas using a row
                            
                                Is it more memory-efficient to set variables to `None` in python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With