Below is the test program, including a Chinese character: <pre class="prettyprint"><code># -*- coding: utf-8 -*- import json j = {"d":"中", "e":"a"} json = json.dumps(j, encoding="utf-8") print json </code></pre> Below is the result, look the json.dumps convert the utf-8 to the original numbers! <pre class="prettyprint"><code>{"e": "a", "d": "\u4e2d"} </code></pre> Why this is broken? Or anything I am wrong?

You should read json.org. The complete JSON specification is in the white box on the right. There is nothing wrong with the generated JSON. Generators are allowed to genereate either UTF-8 strings or plain ASCII strings, where characters are escaped with the <code>\uXXXX</code> notation. In your case, the Python <code>json</code> module decided for escaping, and <code>中</code> has the escaped notation <code>\u4e2d</code>. By the way: Any conforming JSON interpreter will correctly unescape this sequence again and give you back the actual character.

python: json.dumps can't handle utf-8?

Tags:

Below is the test program, including a Chinese character:

# -*- coding: utf-8 -*- import json  j = {"d":"中", "e":"a"} json = json.dumps(j, encoding="utf-8")  print json

Below is the result, look the json.dumps convert the utf-8 to the original numbers!

{"e": "a", "d": "\u4e2d"}

Why this is broken? Or anything I am wrong?

675

asked Nov 15 '10 12:11

Bin Chen

2 Answers

Looks like valid JSON to me. If you want json to output a string that has non-ASCII characters in it then you need to pass ensure_ascii=False and then encode manually afterward.

183

answered Oct 01 '22 16:10

Ignacio Vazquez-Abrams

You should read json.org. The complete JSON specification is in the white box on the right.

There is nothing wrong with the generated JSON. Generators are allowed to genereate either UTF-8 strings or plain ASCII strings, where characters are escaped with the \uXXXX notation. In your case, the Python json module decided for escaping, and 中 has the escaped notation \u4e2d.

By the way: Any conforming JSON interpreter will correctly unescape this sequence again and give you back the actual character.

answered Oct 01 '22 15:10

Boldewyn

Related questions
                            
                                MySQL2 with native extensions ERROR: Failed to build gem native extension. (Gem::Installer::ExtensionBuildError)
                            
                                Docblocks for Doctrine collections
                            
                                How do I pull the sqlite database from the android device? [duplicate]
                            
                                error: androidmanifest.xml file missing --> What am i missing?
                            
                                Get Content Type of Request
                            
                                How to determine if the first character of a NSString is a letter
                            
                                Is there a way to reuse builder code for retrofit
                            
                                Bootstrap datepicker change minDate/startDate from another datepicker
                            
                                Error:Execution failed for task ':app:packageDebug'. > !zip.isFile()
                            
                                Flutter: After flutter 1.22 update, I am getting error in Lineargradient properties [closed]
                            
                                const usage with pointers in C
                            
                                Xcode iPhone Programming: Loading a jpg into a UIImageView from URL

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With