Why is iterating over a dict so slow?

Tags:

I have a script that does a lot of dict deletions and eventually iterates over it.

I've managed to reduce it to a simple benchmark:

> py -m timeit -s "a = {i:i for i in range(10000000)};[a.pop(i) for i in range(10000000-1)]" "next(iter(a))"
10 loops, best of 5: 30.8 msec per loop

How come iterating over a single key after I've deleted all previous values becomes slow?

945

asked Jan 26 '21 04:01

Bharel

1 Answers

Since 3.6, Python dictionaries work with an internal hash table and an array of entries.

When a key is removed from the dictionary, its entry is actually replaced in the array with a dummy value marking the entry as deleted.

Upon iteration, it skips all of these dummy values one by one, until it finds the next real item.

That's why if you'll skip the first value, and remove only the rest, you'll see the iteration is as fast as iterating over a single item dictionary:

> py -m timeit -s "a = {i:i for i in range(10000000)};[a.pop(i) for i in range(1,10000000-1)]" "next(iter(a))"
1000000 loops, best of 5: 219 nsec per loop

For more information about the internal dictionary structure, you may see this wonderful answer.

183

answered Nov 09 '22 03:11

Bharel

Related questions
                            
                                What is Right extension for Plotly in JupyterLab?
                            
                                Confused why after 2nd evaluation of += operator of immutable string does not change the id in Python3 [duplicate]
                            
                                Documenting and detailing a single script based on the comments inside
                            
                                TypeError: use() got an unexpected keyword argument 'warn' when importing matplotlib
                            
                                Pygame Basic calculator
                            
                                Is it possible to have Python IDEs offer autocompletion for dynamically generated class attributes?
                            
                                Keras: ValueError: logits and labels must have the same shape ((None, 2) vs (None, 1))
                            
                                What is !r called?
                            
                                Error when importing Dash: "ImportError: DLL load failed while importing _brotli: The specified module could not be found."
                            
                                Plotly: How to set up a color palette for a figure created with multiple traces?
                            
                                django.contrib.auth.login() function not returning any user as logged in
                            
                                Pivoting pandas dataframe by rank on id
                            
                                Zen of Python 'Explicit is better than implicit'
                            
                                How I can aggregate employee based on their department and show average salary in each department using groupby pandas?
                            
                                How to replace multiple forward slashes in a directory by a single slash?
                            
                                Selenium app redirect to Cloudflare page when hosted on Heroku
                            
                                Replace values in pandas dataframe column with different replacement dict based on condition
                            
                                How to run selenium+chrome on Raspberry PI 4?
                            
                                Set default value for selectbox
                            
                                How to sum rows in the same column than the category in pandas dataframe - python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is iterating over a dict so slow?

Tags:

performance

python

dictionary

iteration

python-internals

Bharel

People also ask

1 Answers

Bharel

Recent Activity

Donate For Us