For example I have a dataframe like this. <pre class="prettyprint"><code> Date Open High Low Close \ 0 2009-08-25 20246.789063 20476.250000 20143.509766 20435.240234 Adj Close Volume 0 20435.240234 1531430000 </code></pre> Using attribute or explicit naming both give me the same output: <pre class="prettyprint"><code>sum(data.Date==data['Date']) == data.shape[0] True </code></pre> However I cannot access columns that are named with white space, like 'Adj Close' with df.columnname, but can do with df['columnname']. Is using df['columnname'] strictly better than using df.columnname ?

Using <code>.</code> as a column accessor is a convenience. There are many limitations beyond having spaces in the name. For example, if your column is named the same as an existing dataframe attribute or method, you won't be able to use it with a <code>.</code>. A non-exhaustive list is <code>mean</code>, <code>sum</code>, <code>index</code>, <code>values</code>, <code>to_dict</code>, etc. You also cannot reference columns with numeric headers via the <code>.</code> accessor. So, yes, <code>['col']</code> is strictly better than <code>.col</code> because it is more consistent and reliable.

Proper way to access a column of a pandas dataframe

Tags:

python

pandas

For example I have a dataframe like this.

     Date          Open          High           Low         Close  \
0  2009-08-25  20246.789063  20476.250000  20143.509766  20435.240234   

      Adj Close      Volume  
0  20435.240234  1531430000

Using attribute or explicit naming both give me the same output:

sum(data.Date==data['Date']) == data.shape[0]

True

However I cannot access columns that are named with white space, like 'Adj Close' with df.columnname, but can do with df['columnname'].

Is using df['columnname'] strictly better than using df.columnname ?

317

asked Sep 06 '17 02:09

chrisckwong821

1 Answers

Using . as a column accessor is a convenience. There are many limitations beyond having spaces in the name. For example, if your column is named the same as an existing dataframe attribute or method, you won't be able to use it with a .. A non-exhaustive list is mean, sum, index, values, to_dict, etc. You also cannot reference columns with numeric headers via the . accessor.

So, yes, ['col'] is strictly better than .col because it is more consistent and reliable.

170

answered Oct 13 '22 12:10

piRSquared

Related questions
                            
                                Python Machine Learning Functions [closed]
                            
                                python csv to dictionary using csv or pandas module
                            
                                Pandas groupby with delimiter join
                            
                                How to define a function inside a loop [duplicate]
                            
                                Safely unpacking results of str.split [duplicate]
                            
                                Run all tests from subdirectories in Python
                            
                                python docopt: "expected string or buffer"
                            
                                Pandas Iterrows Row Number & Percentage
                            
                                Python: Dictionary key name that changes dynamically in a loop
                            
                                Numpy's float32 and float comparisons
                            
                                python split text by quotes and spaces
                            
                                shutil.move if directory already exists
                            
                                A multi-threading example of the python GIL
                            
                                Fix PIL.ImageDraw.Draw.line with wide lines
                            
                                Access remote DB via ssh tunnel (Python 3)
                            
                                Listing all tests associated with a given marker in Pytest
                            
                                TypeError: 'ImmutableMultiDict' objects are immutable python
                            
                                serving image files from django admin
                            
                                Exception handled surprisingly in Pyside slots
                            
                                Seconds until end of day in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With