I can read a csv file in which there is a column containing Chinese characters (other columns are English and numbers). However, Chinese characters don't display correctly. see photo below <img src="https://i.stack.imgur.com/nG6oN.png" alt="enter image description here"> I loaded the csv file with <code>pd.read_csv()</code>. Either <code>display(data06_16)</code> or <code>data06_16.head()</code> won't display Chinese characters correctly. I tried to add the following lines into my <code>.bash_profile</code>: <pre class="prettyprint"><code>export LC_ALL=zh_CN.UTF-8 export LANG=zh_CN.UTF-8 export LC_ALL=en_US.UTF-8 export LANG=en_US.UTF-8 </code></pre> but it doesn't help. Also I have tried to add <code>encoding</code> arg to <code>pd.read_csv()</code>: <pre class="prettyprint"><code>pd.read_csv('data.csv', encoding='utf_8') pd.read_csv('data.csv', encoding='utf_16') pd.read_csv('data.csv', encoding='utf_32') </code></pre> These won't work at all. How can I display the Chinese characters properly?

I just remembered that the source dataset was created using <code>encoding='GBK'</code>, so I tried again using <pre class="prettyprint"><code>data06_16 = pd.read_csv("../data/stocks1542monthly.csv", encoding="GBK") </code></pre> Now, I can see all the Chinese characters. Thanks guys!

I see here three possible issues: 1) You can try this: <pre class="prettyprint"><code>import codecs x = codecs.open("testdata.csv", "r", "utf-8") </code></pre> 2) Another possibility can be theoretically this: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame(pd.read_csv('testdata.csv',encoding='utf-8')) </code></pre> 3) Maybe you should convert you csv file into utf-8 before importing with Python (for example in Notepad++)? It can be a solution for one-time-import, not for automatic process, of course.

How to display Chinese characters inside a pandas dataframe?

Tags:

python

pandas

csv

encoding

chinese-locale

I can read a csv file in which there is a column containing Chinese characters (other columns are English and numbers). However, Chinese characters don't display correctly. see photo below

enter image description here

I loaded the csv file with pd.read_csv().

Either display(data06_16) or data06_16.head() won't display Chinese characters correctly.

I tried to add the following lines into my .bash_profile:

export LC_ALL=zh_CN.UTF-8
export LANG=zh_CN.UTF-8

export LC_ALL=en_US.UTF-8
export LANG=en_US.UTF-8

but it doesn't help.

Also I have tried to add encoding arg to pd.read_csv():

pd.read_csv('data.csv', encoding='utf_8')
pd.read_csv('data.csv', encoding='utf_16')
pd.read_csv('data.csv', encoding='utf_32')

These won't work at all.

How can I display the Chinese characters properly?

881

asked Sep 03 '16 14:09

Daniel

2 Answers

I just remembered that the source dataset was created using encoding='GBK', so I tried again using

data06_16 = pd.read_csv("../data/stocks1542monthly.csv", encoding="GBK")

Now, I can see all the Chinese characters.

Thanks guys!

141

answered Oct 15 '22 10:10

Daniel

I see here three possible issues:

1) You can try this:

import codecs
x = codecs.open("testdata.csv", "r", "utf-8")

2) Another possibility can be theoretically this:

import pandas as pd
df = pd.DataFrame(pd.read_csv('testdata.csv',encoding='utf-8'))

3) Maybe you should convert you csv file into utf-8 before importing with Python (for example in Notepad++)? It can be a solution for one-time-import, not for automatic process, of course.

answered Oct 15 '22 09:10

vlad.rad

Related questions
                            
                                Over-riding Django-allauth login/ registration urls with custom url/ pages
                            
                                Action with pandas SettingWithCopyWarning
                            
                                ValueError time data 'Fri Mar 11 15:59:57 EST 2016' does not match format '%a %b %d %H:%M:%S %Z %Y'
                            
                                How to apply function to multiple pandas dataframe
                            
                                How can I simulate onclick event in python? [closed]
                            
                                Pytest - run multiple tests from a single file
                            
                                Customize templates in a third party Django app
                            
                                How to implement pre and post increment in Python lists?
                            
                                How do I login to the Django Rest browsable API when I have a custom auth model?
                            
                                Does Spark Dataframe have an equivalent option of Panda's merge indicator?
                            
                                PUT dictionary in dictionary in Python requests
                            
                                Object going out of scope and being garbage collected in PySide/PyQt
                            
                                How to use typeshed with mypy?
                            
                                How to decode a mime part of a message and get a **unicode** string in Python 2.7?
                            
                                Get length of CSV to show progress
                            
                                how to read json file with pandas?
                            
                                how to change the black color to Red with opencv python
                            
                                pandas dataframe fillna() not working?
                            
                                how to download images using google earth engine's python API
                            
                                Python speech recognition error converting mp3 file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to display Chinese characters inside a pandas dataframe?

Tags:

python

pandas

csv

encoding

chinese-locale

Daniel

People also ask

2 Answers

Daniel

vlad.rad

Recent Activity

Donate For Us