Say I have a dataframe <pre class="prettyprint"><code>import pandas as pd import numpy as np foo = pd.DataFrame(np.random.random((10,5))) </code></pre> and I create another dataframe from a subset of my data: <pre class="prettyprint"><code>bar = foo.iloc[3:5,1:4] </code></pre> does <code>bar</code> hold a copy of those elements from <code>foo</code>? Is there any way to create a <code>view</code> of that data instead? If so, what would happen if I try to modify data in this view? Does Pandas provide any sort of copy-on-write mechanism?

Your answer lies in the pandas docs: returning-a-view-versus-a-copy. <blockquote> Whenever an array of labels or a boolean vector are involved in the indexing operation, the result will be a copy. With single label / scalar indexing and slicing, e.g. df.ix[3:6] or df.ix[:, 'A'], a view will be returned. </blockquote> In your example, <code>bar</code> is a view of slices of <code>foo</code>. If you wanted a copy, you could have used the <code>copy</code> method. Modifying <code>bar</code> also modifies <code>foo</code>. pandas does not appear to have a copy-on-write mechanism. See my code example below to illustrate: <pre class="prettyprint"><code>In [1]: import pandas as pd ...: import numpy as np ...: foo = pd.DataFrame(np.random.random((10,5))) ...: In [2]: pd.__version__ Out[2]: '0.12.0.dev-35312e4' In [3]: np.__version__ Out[3]: '1.7.1' In [4]: # DataFrame has copy method ...: foo_copy = foo.copy() In [5]: bar = foo.iloc[3:5,1:4] In [6]: bar == foo.iloc[3:5,1:4] == foo_copy.iloc[3:5,1:4] Out[6]: 1 2 3 3 True True True 4 True True True In [7]: # Changing the view ...: bar.ix[3,1] = 5 In [8]: # View and DataFrame still equal ...: bar == foo.iloc[3:5,1:4] Out[8]: 1 2 3 3 True True True 4 True True True In [9]: # It is now different from a copy of original ...: bar == foo_copy.iloc[3:5,1:4] Out[9]: 1 2 3 3 False True True 4 True True True </code></pre>

Pandas: Subindexing dataframes: Copies vs views

Tags:

python

pandas

chained-assignment

Say I have a dataframe

import pandas as pd import numpy as np foo = pd.DataFrame(np.random.random((10,5)))

and I create another dataframe from a subset of my data:

bar = foo.iloc[3:5,1:4]

does bar hold a copy of those elements from foo? Is there any way to create a view of that data instead? If so, what would happen if I try to modify data in this view? Does Pandas provide any sort of copy-on-write mechanism?

207

asked Jul 31 '13 02:07

Amelio Vazquez-Reina

1 Answers

Your answer lies in the pandas docs: returning-a-view-versus-a-copy.

Whenever an array of labels or a boolean vector are involved in the indexing operation, the result will be a copy. With single label / scalar indexing and slicing, e.g. df.ix[3:6] or df.ix[:, 'A'], a view will be returned.

In your example, bar is a view of slices of foo. If you wanted a copy, you could have used the copy method. Modifying bar also modifies foo. pandas does not appear to have a copy-on-write mechanism.

See my code example below to illustrate:

In [1]: import pandas as pd    ...: import numpy as np    ...: foo = pd.DataFrame(np.random.random((10,5)))    ...:   In [2]: pd.__version__ Out[2]: '0.12.0.dev-35312e4'  In [3]: np.__version__ Out[3]: '1.7.1'  In [4]: # DataFrame has copy method    ...: foo_copy = foo.copy()  In [5]: bar = foo.iloc[3:5,1:4]  In [6]: bar == foo.iloc[3:5,1:4] == foo_copy.iloc[3:5,1:4] Out[6]:        1     2     3 3  True  True  True 4  True  True  True  In [7]: # Changing the view    ...: bar.ix[3,1] = 5  In [8]: # View and DataFrame still equal    ...: bar == foo.iloc[3:5,1:4] Out[8]:        1     2     3 3  True  True  True 4  True  True  True  In [9]: # It is now different from a copy of original    ...: bar == foo_copy.iloc[3:5,1:4] Out[9]:         1     2     3 3  False  True  True 4   True  True  True

answered Sep 22 '22 13:09

davidshinn

Related questions
                            
                                Numpy image - rotate matrix 270 degrees
                            
                                Python equivalent of Ruby's 'method_missing'
                            
                                Python ncurses, CDK, urwid difference
                            
                                Dictionary access speed comparison with integer key against string key
                            
                                How to expose a property (virtual field) on a Django Model as a field in a TastyPie ModelResource
                            
                                Disable ipython console in pycharm
                            
                                Python get mouse x, y position on click
                            
                                Python multi-thread multi-interpreter C API
                            
                                NaN values when new column added to pandas DataFrame
                            
                                What does dtype=object mean while creating a numpy array?
                            
                                Convert column to row in Python Pandas
                            
                                Python 3 range Vs Python 2 range
                            
                                Set pyflake AND mypy ignore same line
                            
                                How to access url hash/fragment from a Django Request object
                            
                                Python "string_escape" vs "unicode_escape"
                            
                                Combining a Tokenizer into a Grammar and Parser with NLTK
                            
                                How can I display text over columns in a bar chart in matplotlib?
                            
                                Inconsistency between %time and %timeit in IPython
                            
                                Django : What is the role of ModelState?
                            
                                Hide external modules when importing a module (e.g. regarding code-completion)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With