What is more efficient in terms of memory and speed between <code>d[(first,second)]</code> and <code>d[first][second]</code>, where <code>d</code> is a dictionary of either tuples or dictionaries?

Here is some very basic test data that indicates that for a very contrived example(storing 'a' a million times using numbers as keys) using 2 dictionaries is significantly faster. <pre class="prettyprint"><code>$ python -m timeit 'd = {i:{j:"a" for j in range(1000)} for i in range(1000)};a = [d[i][j] for j in range(1000) for i in range(1000)];' 10 loops, best of 3: 316 msec per loop $ python -m timeit 'd = {(i, j):"a" for j in range(1000) for i in range(1000)};a = [d[i, j] for j in range(1000) for i in range(1000)];' 10 loops, best of 3: 970 msec per loop </code></pre> Of course, these tests do not necessarily mean anything depending on what you are trying to do. Determine what you'll be storing, and then test. A little more data: <pre class="prettyprint"><code>$ python -m timeit 'a = [(hash(i), hash(j)) for i in range(1000) for j in range(1000)]' 10 loops, best of 3: 304 msec per loop $ python -m timeit 'a = [hash((i, j)) for i in range(1000) for j in range(1000)]' 10 loops, best of 3: 172 msec per loop $ python -m timeit 'd = {i:{j:"a" for j in range(1000)} for i in range(1000)}' 10 loops, best of 3: 101 msec per loop $ python -m timeit 'd = {(i, j):"a" for j in range(1000) for i in range(1000)}' 10 loops, best of 3: 645 msec per loop </code></pre> Once again this is clearly not indicative of real world use, but it seems to me like the cost of building a dictionary with tuples like that is huge and that's where the dictionary in a dictionary wins out. This surprises me, I was expecting completely different results. I'll have to try a few more things when I have time.

Two-dimensional vs. One-dimensional dictionary efficiency in Python

2 Answers

Here is some very basic test data that indicates that for a very contrived example(storing 'a' a million times using numbers as keys) using 2 dictionaries is significantly faster.

$ python -m timeit 'd = {i:{j:"a" for j in range(1000)} for i in range(1000)};a = [d[i][j] for j in range(1000) for i in range(1000)];'
10 loops, best of 3: 316 msec per loop
$ python -m timeit 'd = {(i, j):"a" for j in range(1000) for i in range(1000)};a = [d[i, j] for j in range(1000) for i in range(1000)];'
10 loops, best of 3: 970 msec per loop

Of course, these tests do not necessarily mean anything depending on what you are trying to do. Determine what you'll be storing, and then test.

A little more data:

$ python -m timeit 'a = [(hash(i), hash(j)) for i in range(1000) for j in range(1000)]'
10 loops, best of 3: 304 msec per loop
$ python -m timeit 'a = [hash((i, j)) for i in range(1000) for j in range(1000)]'
10 loops, best of 3: 172 msec per loop
$ python -m timeit 'd = {i:{j:"a" for j in range(1000)} for i in range(1000)}'
10 loops, best of 3: 101 msec per loop
$ python -m timeit 'd = {(i, j):"a" for j in range(1000) for i in range(1000)}'
10 loops, best of 3: 645 msec per loop

Once again this is clearly not indicative of real world use, but it seems to me like the cost of building a dictionary with tuples like that is huge and that's where the dictionary in a dictionary wins out. This surprises me, I was expecting completely different results. I'll have to try a few more things when I have time.

answered Oct 24 '22 10:10

Nolen Royalty

A little surprisingly, the dictionary of dictionaries is faster than the tuple in both CPython 2.7 and Pypy 1.8.

I didn't check on space, but you can do that with ps.

answered Oct 24 '22 09:10

user1277476

Related questions
                            
                                Python/Bottle/MongoDB: Unsupported response type: <type 'dict'>
                            
                                Django performance testing suite that'll report on metrics (db queries etc.)
                            
                                Validate with three xml schemas as one combined schema in lxml?
                            
                                Can I configure mercurial hooks like some extensions are configured in the hgrc file?
                            
                                How to change version of Python picked up by Cygwin
                            
                                BeautifulSoup KeyError Issue
                            
                                Where are 'package data' files?
                            
                                Are verbose __init__ methods in Python bad?
                            
                                Problems with the GC when using a WeakValueDictionary for caches
                            
                                Download A Single File Using Multiple Threads
                            
                                how to run several executable using python?
                            
                                Include nonce and block count in PyCrypto AES MODE_CTR
                            
                                Parsing SQL Query into a DOM-like tree to enable automatic permutation?
                            
                                Print a tree of pyparsing result
                            
                                Setting up Django settings for sphinx (documentation)
                            
                                Using itertools.product and want to seed a value
                            
                                Interactive Brokers automated trading
                            
                                Prevent MySQL-Python from inserting quotes around database name parameter
                            
                                Is there a way to get code-hints for gtk3 and python working on aptana?
                            
                                Beautifulsoup, maximum recursion depth reached

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Two-dimensional vs. One-dimensional dictionary efficiency in Python

Tags:

performance

python

dictionary

tuples

nested

Zach

People also ask

2 Answers

Nolen Royalty

user1277476

Recent Activity

Donate For Us