Writing pandas DataFrame to JSON in unicode

Tags:

I'm trying to write a pandas DataFrame containing unicode to json, but the built in .to_json function escapes the characters. How do I fix this?

Example:

import pandas as pd df = pd.DataFrame([['τ', 'a', 1], ['π', 'b', 2]]) df.to_json('df.json')

This gives:

{"0":{"0":"\u03c4","1":"\u03c0"},"1":{"0":"a","1":"b"},"2":{"0":1,"1":2}}

Which differs from the desired result:

{"0":{"0":"τ","1":"π"},"1":{"0":"a","1":"b"},"2":{"0":1,"1":2}}

I have tried adding the force_ascii=False argument:

import pandas as pd df = pd.DataFrame([['τ', 'a', 1], ['π', 'b', 2]]) df.to_json('df.json', force_ascii=False)

But this gives the following error:

UnicodeEncodeError: 'charmap' codec can't encode character '\u03c4' in position 11: character maps to <undefined>

I'm using WinPython 3.4.4.2 64bit with pandas 0.18.0

620

asked Sep 21 '16 09:09

Swier

1 Answers

Opening a file with the encoding set to utf-8, and then passing that file to the .to_json function fixes the problem:

with open('df.json', 'w', encoding='utf-8') as file:     df.to_json(file, force_ascii=False)

gives the correct:

{"0":{"0":"τ","1":"π"},"1":{"0":"a","1":"b"},"2":{"0":1,"1":2}}

Note: it does still require the force_ascii=False argument.

answered Sep 18 '22 19:09

Swier

Related questions
                            
                                Using the class as a type hint for arguments in its methods [duplicate]
                            
                                Python webdriver to handle pop up browser windows which is not an alert
                            
                                Retrieve XY data from matplotlib figure [duplicate]
                            
                                Opening a pdf and reading in tables with python pandas
                            
                                Why does my use of click.argument produce "got an unexpected keyword argument 'help'?
                            
                                Python Requests getting ('Connection aborted.', BadStatusLine("''",)) error
                            
                                Insert or delete a step in scikit-learn Pipeline
                            
                                replace part of the string in pandas data frame
                            
                                How to execute two "aggregate" functions (like sum) concurrently, feeding them from the same iterator?
                            
                                Draw a line at specific position/annotate a Facetgrid in seaborn
                            
                                Dynamically importing Python module
                            
                                How to display picture and get mouse click coordinate on it [closed]
                            
                                Python multiprocessing - How to release memory when a process is done?
                            
                                scipy, lognormal distribution - parameters
                            
                                Getting container/parent object from within python
                            
                                How can I reorder multi-indexed dataframe columns at a specific level
                            
                                Converting (YYYY-MM-DD-HH:MM:SS) date time
                            
                                Why can functions in Python print variables in enclosing scope but cannot use them in assignment?
                            
                                ggplot styles in Python
                            
                                Computing the correlation coefficient between two multi-dimensional arrays

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Writing pandas DataFrame to JSON in unicode

Tags:

python

json

pandas

unicode

Swier

People also ask

1 Answers

Swier

Recent Activity

Donate For Us