Python: why does str() on some text from a UTF-8 file give a UnicodeDecodeError?

Tags:

python

character-encoding

I'm processing a UTF-8 file in Python, and have used simplejson to load it into a dictionary. However, I'm getting a UnicodeDecodeError when I try to turn one of the dictionary values into a string:

f = open('my_json.json', 'r')
master_dictionary = json.load(f)
#some json wrangling, then it fails on this line...
mysql_string += " ('" + str(v_dict['code'])
Traceback (most recent call last):
  File "my_file.py", line 25, in <module>
    str(v_dict['code']) + "'), "
UnicodeEncodeError: 'ascii' codec can't encode character u'\xf4' in position 35: ordinal not in range(128)

Why is Python even using ASCII? I thought it used UTF-8 by default, and the input is from a UTF-8 file.

$ file my_json.json 
my_json.json: UTF-8 Unicode English text

What is the problem?

648

asked Mar 31 '10 16:03

AP257

1 Answers

Python 2.x uses ASCII by default. Use unicode.encode() if you want to turn a unicode into a str:

v_dict['code'].encode('utf-8')

170

answered Oct 19 '22 21:10

Ignacio Vazquez-Abrams

Related questions
                            
                                Looking for advice on how to develop applets for Gnome / Ubuntu
                            
                                How do I read binary C++ protobuf data using Python protobuf?
                            
                                wxPython progress bar
                            
                                Python Format string "}" fill
                            
                                fastcgi, cherrypy, and python
                            
                                ImageFont's getsize() does not get correct text size?
                            
                                Got Django and Buildout working, but what about PIL and Postgres?
                            
                                Why scipy.io.wavfile.read does not return a tuple?
                            
                                Which technology is preferable to build a web based GUI Client? [closed]
                            
                                How can I use BeautifulSoup to find all the links in a page pointing to a specific domain?
                            
                                Multiple consumers & producers connected to a message queue, Is that possible in AMQP?
                            
                                Django and Postgres transaction rollback
                            
                                Why can't I do a hyphen in Django template view?
                            
                                Running script on server start in google app engine, in Python
                            
                                Different 404 pages depending on the application in Django
                            
                                customize the django admin panel?
                            
                                Best seed for parallel process
                            
                                PyYAML parse into arbitary object
                            
                                ctypes and pointer manipulation
                            
                                Python: How to transfer varrying length arrays over a network connection

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With