Is there a way to encode the index of my dataframe? I have a dataframe where the index is the name of international conferences. <code>df2= pd.DataFrame(index=df_conf['Conference'], columns=['Citation1991','Citation1992'])</code> I keep getting: <code>KeyError: 'Leitf\xc3\xa4den der angewandten Informatik'</code> whenever my code references a foreign conference name with unknown ascii letters. I tried: <pre class="prettyprint"><code>df.at[x.encode("utf-8"), 'col1'] df.at[x.encode('ascii', 'ignore'), 'col'] </code></pre> Is there a way around it? I tried to see if I could encode the dataframe itself when creating it, but it doesn't seem I can do that either.

If you're not using csv, and you want to encode your string index, this is what worked for me: <pre class="prettyprint"><code>df.index = df.index.str.encode('utf-8') </code></pre>

Just put "u" in front of utf8 strings such that <pre class="prettyprint"><code>df2= pd.DataFrame(index=df_conf[u'Conference'], columns=[u'Citation1991',u'Citation1992']) </code></pre> It will work.

Dataframe encoding

Tags:

pandas

dataframe

python-2.7

Is there a way to encode the index of my dataframe? I have a dataframe where the index is the name of international conferences.

df2= pd.DataFrame(index=df_conf['Conference'], columns=['Citation1991','Citation1992'])

I keep getting: KeyError: 'Leitf\xc3\xa4den der angewandten Informatik'

whenever my code references a foreign conference name with unknown ascii letters.

I tried:

Click to copy

df.at[x.encode("utf-8"), 'col1']

df.at[x.encode('ascii', 'ignore'), 'col']

Is there a way around it? I tried to see if I could encode the dataframe itself when creating it, but it doesn't seem I can do that either.

322

asked May 10 '15 19:05

BKS

3 Answers

If you're not using csv, and you want to encode your string index, this is what worked for me:

Click to copy

df.index = df.index.str.encode('utf-8')

169

answered Oct 17 '22 19:10

BKS

Setting up the encoding should be treated when reading the input file, using the option encoding

Click to copy

df = pd.read_csv('bibliography.csv', delimiter=',', encoding="utf-8")

or if the file uses BOM,

Click to copy

df = pd.read_csv('bibliography.csv', delimiter=',', encoding="utf-8-sig")

answered Oct 17 '22 19:10

Guillaume Jacquenot

Just put "u" in front of utf8 strings such that

Click to copy

df2= pd.DataFrame(index=df_conf[u'Conference'], columns=[u'Citation1991',u'Citation1992'])

It will work.

answered Oct 17 '22 19:10

Marcel Kim

Related questions
                            
                                python-docx style_id error while creating a word document
                            
                                Filter list with regex [duplicate]
                            
                                use __name__ as attribute
                            
                                Meaning of '\0\0' in Python?
                            
                                Python lambda function to calculate factorial of a number
                            
                                Highlighting the shortest path in a Networkx graph
                            
                                How do I make Python 3.5 my default version on MacOS?
                            
                                python: pickle.load() raising EOFError
                            
                                python - Difference between two unix timestamps
                            
                                How do I alias python2 to python3 in a docker container?
                            
                                Getting "newline inside string" while reading the csv file in Python?
                            
                                make not found with Dockerfile and centos:7 image
                            
                                How can I understand a .pyc file content
                            
                                Is there any way to execute a statement before each return statement in python function?
                            
                                Rotating strings in Python
                            
                                Use string variable **kwargs as named argument
                            
                                How to draw a semicircle in Python turtle only
                            
                                How to make an object both a Python2 and Python3 iterator?
                            
                                Alternative to python's .sort() (for inserting into a large list and keeping it sorted)
                            
                                Comparing values in two lists in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Dataframe encoding

Tags:

pandas

dataframe

python-2.7

BKS

People also ask

3 Answers

BKS

Guillaume Jacquenot

Marcel Kim

Recent Activity

Donate For Us