I am trying to read data from a csv file into a pandas dataframe, and access the first column 'Date' <pre class="prettyprint"><code>import pandas as pd df_ticks=pd.read_csv('values.csv', delimiter=',') print(df_ticks.columns) df_ticks['Date'] </code></pre> produces the following result <pre class="prettyprint"><code>Index([u'Date', u'Open', u'High', u'Low', u'Close', u'Volume'], dtype='object') KeyError: u'no item named Date' </code></pre> If I try to acces any other column like 'Open' or 'Volume' it is working as expected

As mentioned by alko, it is probably extra character at the beginning of your file. When using <code>read_csv</code>, you can specify <code>encoding</code> to deal with encoding and heading character, known as BOM (Byte order mark) <pre class="prettyprint"><code>df = pd.read_csv('values.csv', delimiter=',', encoding="utf-8-sig") </code></pre> This question finds some echoes on Stackoverflow: Pandas seems to ignore first column name when reading tab-delimited data, gives KeyError

You most likely have an extra character at the beginning of your file, that is prepended to your first column name, <code>'Date'</code>. Simply Copy / Paste your output to a non-unicode console produces. <pre class="prettyprint"><code>Index([u'?Date', u'Open', u'High', u'Low', u'Close', u'Volume'], dtype='object') </code></pre>

KeyError when indexing Pandas dataframe

Tags:

python

pandas

I am trying to read data from a csv file into a pandas dataframe, and access the first column 'Date'

import pandas as pd df_ticks=pd.read_csv('values.csv', delimiter=',') print(df_ticks.columns) df_ticks['Date']

produces the following result

Index([u'Date', u'Open', u'High', u'Low', u'Close', u'Volume'], dtype='object') KeyError: u'no item named Date'

If I try to acces any other column like 'Open' or 'Volume' it is working as expected

495

asked May 19 '14 07:05

Simbi

2 Answers

As mentioned by alko, it is probably extra character at the beginning of your file. When using read_csv, you can specify encoding to deal with encoding and heading character, known as BOM (Byte order mark)

df = pd.read_csv('values.csv', delimiter=',', encoding="utf-8-sig")

This question finds some echoes on Stackoverflow: Pandas seems to ignore first column name when reading tab-delimited data, gives KeyError

answered Sep 21 '22 21:09

Guillaume Jacquenot

You most likely have an extra character at the beginning of your file, that is prepended to your first column name, 'Date'. Simply Copy / Paste your output to a non-unicode console produces.

Index([u'?Date', u'Open', u'High', u'Low', u'Close', u'Volume'], dtype='object')

answered Sep 22 '22 21:09

alko

Related questions
                            
                                csv.write skipping lines when writing to csv
                            
                                How to get the highest element in absolute value in a numpy matrix?
                            
                                Python to automatically select serial ports (for Arduino)
                            
                                Capturing high multi-collinearity in statsmodels
                            
                                Unmelt Pandas DataFrame
                            
                                Django Rest Framework model Id field in nested relationship serializer
                            
                                Removing elements from an array that are in another array
                            
                                How to initialise only optimizer variables in Tensorflow?
                            
                                Is removing an element from the front of a list cheap in Python?
                            
                                python 3 error RuntimeError: super(): no arguments
                            
                                Completion in IPython (jupyter) does now work (unexpected keyword argument 'column')
                            
                                Translating python dictionary to C++
                            
                                Python:Extend the 'dict' class
                            
                                Replacing a Django image doesn't delete original
                            
                                Is there something better than django-piston? [closed]
                            
                                Insert image in openpyxl
                            
                                Line is too long. Django PEP8
                            
                                How to sort a dictionary by value (DESC) then by key (ASC)?
                            
                                Python 3.2 Lambda Syntax Error [duplicate]
                            
                                Make contour of scatter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With