In pandas, what's the difference between df['column'] and df.column?

Tags:

I'm working my way through Pandas for Data Analysis and learning a ton. However, one thing keeps coming up. The book typically refers to columns of a dataframe as df['column'] however, sometimes without explanation the book uses df.column.

I don't understand the difference between the two. Any help would be appreciated.

Below is come code demonstrating the what I am talking about:

In [5]:

import pandas as pd

data = {'column1': ['a', 'a', 'a', 'b', 'c'], 
        'column2': [1, 4, 2, 5, 3]}
df = pd.DataFrame(data, columns = ['column1', 'column2'])
df

Out[5]:
column1 column2
0    a   1
1    a   4
2    a   2
3    b   5
4    c   3
5 rows × 2 columns

df.column:

In [8]:

df.column1
Out[8]:
0    a
1    a
2    a
3    b
4    c
Name: column1, dtype: object

df['column']:

In [9]:

df['column1']
Out[9]:
0    a
1    a
2    a
3    b
4    c
Name: column1, dtype: object

258

asked May 08 '14 15:05

Anton

1 Answers

for setting, values, you need to use df['column'] = series.

once this is done however, you can refer to that column in the future with df.column, assuming it's a valid python name. (so df.column works, but df.6column would still have to be accessed with df['6column'])

i think the subtle difference here is that when you set something with df['column'] = ser, pandas goes ahead and adds it to the columns/does some other stuff (i believe by overriding the functionality in __setitem__. if you do df.column = ser, it's just like adding a new field to any existing object which uses __setattr__, and pandas does not seem to override this behavior.

149

answered Sep 21 '22 03:09

acushner

Related questions
                            
                                Opencv: Crop out text areas from license
                            
                                How can I minimize/maximize windows in macOS with the Cocoa API from a Python script?
                            
                                Why is np.dot imprecise? (n-dim arrays)
                            
                                PyTorch: RuntimeError: Input, output and indices must be on the current device
                            
                                What's the easiest non-memory intensive way to output XML from Python?
                            
                                Making an android Python service to run in suspend state
                            
                                elegant way to test python ASTs for equality (not reference or object identity)
                            
                                Ways to avoid MySQLdb's "Commands out of sync; you can't run this command now" (2014) exception
                            
                                Enabling Django Admin Filters on Many-to-Many Fields
                            
                                Python: what does "import" prefer - modules or packages?
                            
                                How to call a python class function from another file
                            
                                Rotate numpy 2D array
                            
                                Reverse sort and argsort in python
                            
                                Python 3 builtin types __init__ doesn't call super().__init__?
                            
                                Unit Test for Bash completion script
                            
                                How to share business concepts across different programming languages?
                            
                                does pyodbc have any design advantages over pypyodbc?
                            
                                Access .mat file containing matlab classes in python
                            
                                What is the difference between locals and globals when using Python's eval()?
                            
                                Can Cython code be compiled to a dll so C++ application can call it?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In pandas, what's the difference between df['column'] and df.column?

Tags:

python

pandas

Anton

People also ask

1 Answers

acushner

Recent Activity

Donate For Us