While running a numerical integrator, I noticed a noticeable difference in speed depending on how I extract the value of the field in a dictionary <pre class="prettyprint"><code>import numpy as np def bad_get(mydict): '''Extract the name field using get()''' output = mydict.get('name', None) return output def good_get(mydict): '''Extract the name field using if-else''' if 'name' in mydict: output = mydict['name'] else: output = None return output name_dict = dict() name_dict['name'] = np.zeros((5000,5000)) </code></pre> On my system, I notice the following difference (using iPython) <pre class="prettyprint"><code>%%timeit bad_get(name_dict) The slowest run took 7.75 times longer than the fastest. This could mean that an intermediate result is being cached. 1000000 loops, best of 3: 247 ns per loop </code></pre> Compared to <pre class="prettyprint"><code>%%timeit good_get(name_dict) 1000000 loops, best of 3: 188 ns per loop </code></pre> This may seem like a small difference, but in for some arrays the difference appears to be even more dramatic. What causes this behavior, and is there some way I should alter my use of the <code>get()</code> function?

Python has to do more work for <code>dict.get()</code>: <ul> <li> <code>get</code> is an attribute, so Python has to look this up, and then bind the descriptor found to the dictionary instance.</li> <li> <code>()</code> is a call, so the current frame has to be pushed on the stack, a call has to be made, then the frame has to be popped again from the stack to continue.</li> </ul> The <code>[...]</code> notation, used with a <code>dict</code>, doesn't require a separate attribute step or frame push and pop. You can see the difference when you use the Python bytecode disassembler <code>dis</code>: <pre class="prettyprint"><code>>>> import dis >>> dis.dis(compile('d[key]', '', 'eval')) 1 0 LOAD_NAME 0 (d) 3 LOAD_NAME 1 (key) 6 BINARY_SUBSCR 7 RETURN_VALUE >>> dis.dis(compile('d.get(key)', '', 'eval')) 1 0 LOAD_NAME 0 (d) 3 LOAD_ATTR 1 (get) 6 LOAD_NAME 2 (key) 9 CALL_FUNCTION 1 12 RETURN_VALUE </code></pre> so the <code>d[key]</code> expression only has to execute a <code>BINARY_SUBSCR</code> opcode, while <code>d.get(key)</code> adds a <code>LOAD_ATTR</code> opcode. <code>CALL_FUNCTION</code> is a lot more expensive than <code>BINARY_SUBSCR</code> on a built-in type (custom types with <code>__getitem__</code> methods still end up doing a function call). If the majority of your keys exist in the dictionary, you could use <code>try...except KeyError</code> to handle missing keys: <pre class="prettyprint"><code>try: return mydict['name'] except KeyError: return None </code></pre> Exception handling is cheap if there are no exceptions.

Why does dict.get(key) run slower than dict[key]

Tags:

performance

python

dictionary

While running a numerical integrator, I noticed a noticeable difference in speed depending on how I extract the value of the field in a dictionary

import numpy as np

def bad_get(mydict):
    '''Extract the name field using get()'''
    output = mydict.get('name', None)
    return output

def good_get(mydict):
    '''Extract the name field using if-else'''
    if 'name' in mydict:
        output = mydict['name']
    else:
        output = None
    return output


name_dict = dict()
name_dict['name'] = np.zeros((5000,5000))

On my system, I notice the following difference (using iPython)

%%timeit
bad_get(name_dict) 

The slowest run took 7.75 times longer than the fastest. This could mean that an intermediate result is being cached.
1000000 loops, best of 3: 247 ns per loop

Compared to

%%timeit
good_get(name_dict)  

1000000 loops, best of 3: 188 ns per loop

This may seem like a small difference, but in for some arrays the difference appears to be even more dramatic. What causes this behavior, and is there some way I should alter my use of the get() function?

780

asked Apr 12 '16 07:04

wil3

1 Answers

Python has to do more work for dict.get():

get is an attribute, so Python has to look this up, and then bind the descriptor found to the dictionary instance.
() is a call, so the current frame has to be pushed on the stack, a call has to be made, then the frame has to be popped again from the stack to continue.

The [...] notation, used with a dict, doesn't require a separate attribute step or frame push and pop.

You can see the difference when you use the Python bytecode disassembler dis:

>>> import dis
>>> dis.dis(compile('d[key]', '', 'eval'))
  1           0 LOAD_NAME                0 (d)
              3 LOAD_NAME                1 (key)
              6 BINARY_SUBSCR
              7 RETURN_VALUE
>>> dis.dis(compile('d.get(key)', '', 'eval'))
  1           0 LOAD_NAME                0 (d)
              3 LOAD_ATTR                1 (get)
              6 LOAD_NAME                2 (key)
              9 CALL_FUNCTION            1
             12 RETURN_VALUE

so the d[key] expression only has to execute a BINARY_SUBSCR opcode, while d.get(key) adds a LOAD_ATTR opcode. CALL_FUNCTION is a lot more expensive than BINARY_SUBSCR on a built-in type (custom types with __getitem__ methods still end up doing a function call).

If the majority of your keys exist in the dictionary, you could use try...except KeyError to handle missing keys:

try:
    return mydict['name']
except KeyError:
    return None

Exception handling is cheap if there are no exceptions.

182

answered Sep 30 '22 19:09

Martijn Pieters

Related questions
                            
                                Is there a function in Python to split a string without ignoring the spaces?
                            
                                How can I capture the stdout output of a child process?
                            
                                Should I Start With Python 3.0? [closed]
                            
                                Set paragraph font in python-docx
                            
                                Boto3 S3: Get files without getting folders
                            
                                What Python GUI APIs Are Out There? [closed]
                            
                                When is the `==` operator not equivalent to the `is` operator? (Python)
                            
                                Django 'if and' template
                            
                                Make Python Program Wait
                            
                                Computing Eulers Totient Function
                            
                                Issue trying to change language from Django template
                            
                                Measure runtime of a Jupyter Notebook code cell
                            
                                What's the reason of the error ValueError: Expected more than 1 value per channel?
                            
                                Most pythonic way of function with no return?
                            
                                Is there a better way to write this "if" boolean evaluation?
                            
                                How to convert unicode accented characters to pure ascii without accents?
                            
                                Factorial function works in Python, returns 0 for Julia
                            
                                Proper way for user authentication with angularjs and flask
                            
                                Why does Python have a maximum recursion depth?
                            
                                String replace doesn't appear to be working

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With