<pre class="prettyprint"><code>>>> d {1: 1, 2: 2, 3: 3} >>> lst = [d, d] >>> c=lst[0] >>> c[1]=5 >>> lst [{1: 5, 2: 2, 3: 3}, {1: 5, 2: 2, 3: 3}] </code></pre> When <code>lst = [d, d]</code>, are <code>lst[0]</code> and <code>lsg[1]</code> both references to the memory block of <code>d</code>, instead of creating two memory blocks and copy the content of <code>d</code> to them respectively? When <code>c=lst[0]</code>, is <code>c</code> just a reference to the memory occupied by <code>lst[0]</code>, instead of creating a new memory block and copy the content from <code>lst[0]</code>? In Python, when is a reference created to point to an existing memory block, and when is a new memory block allocated and then copy? This language feature of Python is different from C. What is the name of this language feature? Thanks.

All variables (and other containers, such as dictionaries, lists, and object attributes) hold references to objects. Memory allocation occurs when the object is instantiated. Simple assignment always creates another reference to the existing object. For example, if you have: <pre class="prettyprint"><code>a = [1, 2, 3] b = a </code></pre> Then <code>b</code> and <code>a</code> point to the same object, a list. You can verify this using the <code>is</code> operator: <pre class="prettyprint"><code>print(b is a) # True </code></pre> If you change <code>a</code>, then <code>b</code> changes too, because they are two names for the same object. <pre class="prettyprint"><code>a.append(4) print(b[3] == 4) # True print(b[3] is a[3]) # also True </code></pre> If you want to create a copy, you must do so explicitly. Here are some ways of doing this: <ul> <li>For lists, use a slice: <code>b = a[:]</code>.</li> <li>For many types, you can use the type name to copy an existing object of that type: <code>b = list(a)</code>. When creating your own classes, this is a good approach to take if you need copy functionality.</li> <li>The <code>copy</code> module has methods that can be used to copy objects (either shallowly or deeply).</li> </ul> For immutable types, such as strings, numbers, and tuples, there is never any need to make a copy. You can only "change" these kinds of values by referencing different ones. The best way of describing this is probably "everything's an object." In C, "primitive" types like integers are treated differently from arrays. In Python, they are not: all values are stored as references to objects—even integers.

This paragraph from the Python tutorial should help clear things up for you: <blockquote> Objects have individuality, and multiple names (in multiple scopes) can be bound to the same object. This is known as aliasing in other languages. This is usually not appreciated on a first glance at Python, and can be safely ignored when dealing with immutable basic types (numbers, strings, tuples). However, aliasing has a possibly surprising effect on the semantics of Python code involving mutable objects such as lists, dictionaries, and most other types. This is usually used to the benefit of the program, since aliases behave like pointers in some respects. For example, passing an object is cheap since only a pointer is passed by the implementation; and if a function modifies an object passed as an argument, the caller will see the change — this eliminates the need for two different argument passing mechanisms as in Pascal. </blockquote> <hr> To answer your individual questions in more detail: When lst = [d, d], are lst[0] and lst[1] both references to the memory block of d, instead of creating two memory blocks and copy the content of d to them respectively? No. They don't refer to the memory block of <code>d</code>. <code>lst[0]</code> and <code>lst[1]</code> are aliasing the same object as <code>d</code>, at that point in time. Proof: If you assign <code>d</code> to a new object after initializing the list, <code>lst[0]</code> and <code>lst[1]</code> will be unchanged. If you mutate the object aliased by <code>d</code>, then the mutation is visible <code>lst[0]</code> and <code>lst[1]</code>, because they alias the same object. When c=lst[0], is c just a reference to the memory occupied by lst[0], instead of creating a new memory block and copy the content from lst[0]? Again no. It's not a reference to the memory occupied by <code>lst[0]</code>. Proof: if you assign <code>lst[0]</code> to a new object, c will be unchanged. If you modify a mutable object (like the dictionary that <code>lst[0]</code> points to) you will see the change in <code>c</code>, because <code>c</code> is referring to the same object, the original dictionary. In Python, when is a reference created to point to an existing memory block, and when is a new memory block allocated and then copy? Python doesn't really work with "memory blocks" in the same way that C does. It is an abstraction away from that. Whenever you create a new object, and assign it to a variable, you've obviously got memory allocated for that object. But you will never work with that memory directly, you work with references to the objects in that memory. Those references are the values that get assigned to symbolic names, AKA variables, AKA aliases. "pass-by-reference" is a concept from pointer-based languages like C and C++, and does not apply to Python. There is a blog post which I believe covers this topic the best. It is often argued whether Python is pass-by-value, pass-by-reference, or pass-by-object-reference. The truth is that it doesn't matter how you think of it, as long as you understand that the entire language specification is just an abstraction for working with names and objects. Java and Ruby have similar execution models, but the Java docs call it pass-by-value while the Ruby docs call it pass-by-reference. The Python docs remain neutral on the subject, so it's best not to speculate and just see things for what they are. This language feature of Python is different from C. What is the name of this language feature? Associating names with objects is known as name binding. Allowing multiple names (in potentially multiple scopes) to be bound to the same object is known as aliasing. You can read more about aliasing in the Python tutorial and on Wikipedia. It might also be helpful for you to read would be the execution model documentation where it talks about name binding and scopes in more detail.

In short; Python is pass-by-reference. Objects are created and memory allocated upon their construction. Referencing objects does not allocate more memory unless you are either creating new objects or expanding existing objects (<code>list.append()</code>) This post Is Python pass-by-reference or pass-by-value covers it very well. As a side note; if you are worried about how memory is allocated in a manage programming language like Python then you're probably using the wrong language and/or prematurely optimizing. Also how memory is managed in Python is implemtnation specific as there are many implementations of Python; CPython (what you are probably using); Jython, IronPython, PyPy, MicroPython, etc.

When is a reference created, and when is a new memory block allocated and then copy?

Tags:

python

>>> d
{1: 1, 2: 2, 3: 3}
>>> lst = [d, d]
>>> c=lst[0]
>>> c[1]=5
>>> lst
[{1: 5, 2: 2, 3: 3}, {1: 5, 2: 2, 3: 3}]

When lst = [d, d], are lst[0] and lsg[1] both references to the memory block of d, instead of creating two memory blocks and copy the content of d to them respectively?

When c=lst[0], is c just a reference to the memory occupied by lst[0], instead of creating a new memory block and copy the content from lst[0]?

In Python, when is a reference created to point to an existing memory block, and when is a new memory block allocated and then copy?

This language feature of Python is different from C. What is the name of this language feature?

Thanks.

386

asked May 15 '15 01:05

Tim

3 Answers

All variables (and other containers, such as dictionaries, lists, and object attributes) hold references to objects. Memory allocation occurs when the object is instantiated. Simple assignment always creates another reference to the existing object. For example, if you have:

a = [1, 2, 3]
b = a

Then b and a point to the same object, a list. You can verify this using the is operator:

print(b is a)    # True

If you change a, then b changes too, because they are two names for the same object.

a.append(4)
print(b[3] == 4)     # True
print(b[3] is a[3])  # also True

If you want to create a copy, you must do so explicitly. Here are some ways of doing this:

For lists, use a slice: b = a[:].
For many types, you can use the type name to copy an existing object of that type: b = list(a). When creating your own classes, this is a good approach to take if you need copy functionality.
The copy module has methods that can be used to copy objects (either shallowly or deeply).

For immutable types, such as strings, numbers, and tuples, there is never any need to make a copy. You can only "change" these kinds of values by referencing different ones.

The best way of describing this is probably "everything's an object." In C, "primitive" types like integers are treated differently from arrays. In Python, they are not: all values are stored as references to objects—even integers.

152

answered Sep 23 '22 01:09

kindall

This paragraph from the Python tutorial should help clear things up for you:

Objects have individuality, and multiple names (in multiple scopes) can be bound to the same object. This is known as aliasing in other languages. This is usually not appreciated on a first glance at Python, and can be safely ignored when dealing with immutable basic types (numbers, strings, tuples). However, aliasing has a possibly surprising effect on the semantics of Python code involving mutable objects such as lists, dictionaries, and most other types. This is usually used to the benefit of the program, since aliases behave like pointers in some respects. For example, passing an object is cheap since only a pointer is passed by the implementation; and if a function modifies an object passed as an argument, the caller will see the change — this eliminates the need for two different argument passing mechanisms as in Pascal.

To answer your individual questions in more detail:

When lst = [d, d], are lst[0] and lst[1] both references to the memory block of d, instead of creating two memory blocks and copy the content of d to them respectively?

No. They don't refer to the memory block of d. lst[0] and lst[1] are aliasing the same object as d, at that point in time. Proof: If you assign d to a new object after initializing the list, lst[0] and lst[1] will be unchanged. If you mutate the object aliased by d, then the mutation is visible lst[0] and lst[1], because they alias the same object.

When c=lst[0], is c just a reference to the memory occupied by lst[0], instead of creating a new memory block and copy the content from lst[0]?

Again no. It's not a reference to the memory occupied by lst[0]. Proof: if you assign lst[0] to a new object, c will be unchanged. If you modify a mutable object (like the dictionary that lst[0] points to) you will see the change in c, because c is referring to the same object, the original dictionary.

In Python, when is a reference created to point to an existing memory block, and when is a new memory block allocated and then copy?

Python doesn't really work with "memory blocks" in the same way that C does. It is an abstraction away from that. Whenever you create a new object, and assign it to a variable, you've obviously got memory allocated for that object. But you will never work with that memory directly, you work with references to the objects in that memory.

Those references are the values that get assigned to symbolic names, AKA variables, AKA aliases. "pass-by-reference" is a concept from pointer-based languages like C and C++, and does not apply to Python. There is a blog post which I believe covers this topic the best.

It is often argued whether Python is pass-by-value, pass-by-reference, or pass-by-object-reference. The truth is that it doesn't matter how you think of it, as long as you understand that the entire language specification is just an abstraction for working with names and objects. Java and Ruby have similar execution models, but the Java docs call it pass-by-value while the Ruby docs call it pass-by-reference. The Python docs remain neutral on the subject, so it's best not to speculate and just see things for what they are.

This language feature of Python is different from C. What is the name of this language feature?

Associating names with objects is known as name binding. Allowing multiple names (in potentially multiple scopes) to be bound to the same object is known as aliasing. You can read more about aliasing in the Python tutorial and on Wikipedia.

It might also be helpful for you to read would be the execution model documentation where it talks about name binding and scopes in more detail.

answered Sep 20 '22 01:09

Shashank

In short; Python is pass-by-reference. Objects are created and memory allocated upon their construction. Referencing objects does not allocate more memory unless you are either creating new objects or expanding existing objects (list.append())

This post Is Python pass-by-reference or pass-by-value covers it very well.

As a side note; if you are worried about how memory is allocated in a manage programming language like Python then you're probably using the wrong language and/or prematurely optimizing. Also how memory is managed in Python is implemtnation specific as there are many implementations of Python; CPython (what you are probably using); Jython, IronPython, PyPy, MicroPython, etc.

answered Sep 22 '22 01:09

James Mills

Related questions
                            
                                How to I get the coordinates of a cell in a geotif?
                            
                                Pythonic way of single list with all variations of sublists
                            
                                Python/Numpy subarray selection
                            
                                Python how to print args variable when it conflicts with args keyword in pdb context
                            
                                Python JSON to CSV - bad encoding, UnicodeDecodeError: 'charmap' codec can't decode byte
                            
                                Can't setFont(Times-Roman) missing the T1 files?
                            
                                git-clean with GitPython
                            
                                python .lower() is not working [duplicate]
                            
                                Pandas Python - convert HH:MM:SS into seconds in aggegate (csv file)
                            
                                Ignoring existing table when ./manage.py migrate
                            
                                Executing code on startup in Django 1.7
                            
                                What do HWSURFACE and DOUBLEBUF do?
                            
                                Is it possible to make sql join on several fields using peewee python ORM?
                            
                                Why import builtins if you never redefine any built-in functions?
                            
                                How to open a simple image using streams in Pillow-Python
                            
                                invoking onclick event with beautifulsoup python
                            
                                Plotting correlation heatmaps with Seaborn FacetGrid
                            
                                Python: matplotlib 'numpy.ndarray' object has no attribute 'has_data'
                            
                                Exponential calculation in Python
                            
                                python: Open file, edit one line, save it as the same file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With