Can somebody explain the following? Why is the id the same, but the lists are different? <pre class="prettyprint"><code>>>> [] is [] False >>> id([]) == id([]) True </code></pre> Is there difference in list creation? <pre class="prettyprint"><code>>>> id(list()) == id(list()) False >>> id([]) == id([]) True </code></pre> Why is this happening? I get two different lists. Why not only one, or three or more? <pre class="prettyprint"><code>>>> [].__repr__ <method-wrapper '__repr__' of list object at 0x7fd2be868128> >>> [].__repr__ <method-wrapper '__repr__' of list object at 0x7fd2be868170> >>> [].__repr__ <method-wrapper '__repr__' of list object at 0x7fd2be868128> >>> [].__repr__ <method-wrapper '__repr__' of list object at 0x7fd2be868170> </code></pre>

You used <code>id()</code> wrong. <code>id([])</code> takes the memory id of an object that is discarded immediately. After all, nothing is referencing it anymore once <code>id()</code> is done with it. So the next time you use <code>id([])</code> Python sees an opportunity to re-use the memory and lo and behold, those addresses are indeed the same. However, this is an implementation detail, one you can't rely on, and it won't always be able to reuse the memory address. Note that <code>id()</code> values are only unique for the lifetime of the object, see the documentation: <blockquote> This is an integer which is guaranteed to be unique and constant for this object during its lifetime. Two objects with non-overlapping lifetimes may have the same <code>id()</code> value. </blockquote> (Bold emphasis mine). That <code>id(list())</code> can't re-use the memory location is probably due to the extra heap mutations caused by pushing the current frame on the stack to call a function, then popping it again when the <code>list()</code> call returns. Both <code>[]</code> and <code>list()</code> produce a new empty list object; but you need to first create references to those separate lists (here <code>a</code> and <code>b</code>): <pre class="prettyprint"><code>>>> a, b = [], [] >>> a is b False >>> id(a) == id(b) False >>> a, b = list(), list() >>> a is b False >>> id(a) == id(b) False </code></pre> The same happens when you used <code>[].__repr__</code>. The Python interactive interpreter has a special global name, <code>_</code>, that you can use to reference the last result produced: <pre class="prettyprint"><code>>>> [].__repr__ <method-wrapper '__repr__' of list object at 0x10e011608> >>> _ <method-wrapper '__repr__' of list object at 0x10e011608> </code></pre> That creates an extra reference, so the <code>__repr__</code> method, and by extension, the empty list you created for it, are still considered active. The memory location is not freed and not available for the next list you create. But executing <code>[].__repr__</code> again, Python now binds <code>_</code> to that new method object. Suddenly the previous <code>__repr__</code> method is no longer referenced by anything and can be freed, and so is the list object. The third time you execute <code>[].__repr__</code> the first memory location is available again for reuse, so Python does just that: <pre class="prettyprint"><code>>>> [].__repr__ # create a new method <method-wrapper '__repr__' of list object at 0x10e00cb08> >>> _ # now _ points to the new method <method-wrapper '__repr__' of list object at 0x10e00cb08> >>> [].__repr__ # so the old address can be reused <method-wrapper '__repr__' of list object at 0x10e011608> </code></pre> You never create more than two lists; the previous one (still referenced by <code>_</code>) and the current one. If you wanted to see more memory locations, use variables to add another reference.

Is there a difference between [] and list() when using id()?

Tags:

python

list

python-internals

Can somebody explain the following?

Why is the id the same, but the lists are different?

>>> [] is []
False
>>> id([]) == id([])
True

Is there difference in list creation?

>>> id(list()) == id(list())
False
>>> id([]) == id([])
True

Why is this happening? I get two different lists. Why not only one, or three or more?

>>> [].__repr__
<method-wrapper '__repr__' of list object at 0x7fd2be868128>
>>> [].__repr__
<method-wrapper '__repr__' of list object at 0x7fd2be868170>
>>> [].__repr__
<method-wrapper '__repr__' of list object at 0x7fd2be868128>
>>> [].__repr__
<method-wrapper '__repr__' of list object at 0x7fd2be868170>

970

asked Nov 26 '16 17:11

Vlad Okrimenko

Video Answer

1 Answers

You used id() wrong. id([]) takes the memory id of an object that is discarded immediately. After all, nothing is referencing it anymore once id() is done with it. So the next time you use id([]) Python sees an opportunity to re-use the memory and lo and behold, those addresses are indeed the same.

However, this is an implementation detail, one you can't rely on, and it won't always be able to reuse the memory address.

Note that id() values are only unique for the lifetime of the object, see the documentation:

This is an integer which is guaranteed to be unique and constant for this object during its lifetime. Two objects with non-overlapping lifetimes may have the same id() value.

(Bold emphasis mine).

That id(list()) can't re-use the memory location is probably due to the extra heap mutations caused by pushing the current frame on the stack to call a function, then popping it again when the list() call returns.

Both [] and list() produce a new empty list object; but you need to first create references to those separate lists (here a and b):

>>> a, b = [], []
>>> a is b
False
>>> id(a) == id(b)
False
>>> a, b = list(), list()
>>> a is b
False
>>> id(a) == id(b)
False

The same happens when you used [].__repr__. The Python interactive interpreter has a special global name, _, that you can use to reference the last result produced:

>>> [].__repr__
<method-wrapper '__repr__' of list object at 0x10e011608>
>>> _
<method-wrapper '__repr__' of list object at 0x10e011608>

That creates an extra reference, so the __repr__ method, and by extension, the empty list you created for it, are still considered active. The memory location is not freed and not available for the next list you create.

But executing [].__repr__ again, Python now binds _ to that new method object. Suddenly the previous __repr__ method is no longer referenced by anything and can be freed, and so is the list object.

The third time you execute [].__repr__ the first memory location is available again for reuse, so Python does just that:

>>> [].__repr__  # create a new method
<method-wrapper '__repr__' of list object at 0x10e00cb08>
>>> _            # now _ points to the new method
<method-wrapper '__repr__' of list object at 0x10e00cb08>
>>> [].__repr__  # so the old address can be reused
<method-wrapper '__repr__' of list object at 0x10e011608>

You never create more than two lists; the previous one (still referenced by _) and the current one. If you wanted to see more memory locations, use variables to add another reference.

160

answered Oct 09 '22 21:10

Martijn Pieters

Related questions
                            
                                gensim error : no module named gensim
                            
                                How could I use aws lambda to write file to s3 (python)?
                            
                                How to get the logical right binary shift in python
                            
                                Mountain Lion update and mercurial libraries python
                            
                                Dictionary Comprehension in Python 3
                            
                                How do I change the text size in a label widget, python tkinter [duplicate]
                            
                                isinstance and Mocking
                            
                                UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 7: ordinal not in range(128) [duplicate]
                            
                                Does readlines() return a list or an iterator in Python 3?
                            
                                Which tool to use to parse programming languages in Python?
                            
                                Annotate Time Series plot in Matplotlib
                            
                                'int' object has no attribute '__getitem__'
                            
                                datetime to Unix timestamp with millisecond precision
                            
                                Capture arbitrary path in Flask route
                            
                                Calling a parent's parent's method, which has been overridden by the parent
                            
                                Python3 - reload() can not be called on __import__ object?
                            
                                How can I trigger a 500 error in Django?
                            
                                Difference between .string and .text BeautifulSoup
                            
                                Syntax error installing gunicorn
                            
                                Index a 2D Numpy array with 2 lists of indices

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With