Python Unicode Encode Error

Tags:

I'm reading and parsing an Amazon XML file and while the XML file shows a ' , when I try to print it I get the following error:

'ascii' codec can't encode character u'\u2019' in position 16: ordinal not in range(128)

From what I've read online thus far, the error is coming from the fact that the XML file is in UTF-8, but Python wants to handle it as an ASCII encoded character. Is there a simple way to make the error go away and have my program print the XML as it reads?

574

asked Jul 11 '10 19:07

Alex B

1 Answers

Likely, your problem is that you parsed it okay, and now you're trying to print the contents of the XML and you can't because theres some foreign Unicode characters. Try to encode your unicode string as ascii first:

unicodeData.encode('ascii', 'ignore')

the 'ignore' part will tell it to just skip those characters. From the python docs:

>>> # Python 2: u = unichr(40960) + u'abcd' + unichr(1972) >>> u = chr(40960) + u'abcd' + chr(1972) >>> u.encode('utf-8') '\xea\x80\x80abcd\xde\xb4' >>> u.encode('ascii') Traceback (most recent call last):   File "<stdin>", line 1, in ? UnicodeEncodeError: 'ascii' codec can't encode character '\ua000' in position 0: ordinal not in range(128) >>> u.encode('ascii', 'ignore') 'abcd' >>> u.encode('ascii', 'replace') '?abcd?' >>> u.encode('ascii', 'xmlcharrefreplace') '&#40960;abcd&#1972;'

You might want to read this article: http://www.joelonsoftware.com/articles/Unicode.html, which I found very useful as a basic tutorial on what's going on. After the read, you'll stop feeling like you're just guessing what commands to use (or at least that happened to me).

105

answered Sep 20 '22 07:09

Scott Stafford

Related questions
                            
                                Convert pandas Series to DataFrame
                            
                                DateTimeField doesn't show in admin system
                            
                                Abstract attributes in Python [duplicate]
                            
                                Best way to find the months between two dates
                            
                                How can I strip first and last double quotes?
                            
                                Android Python Programming [closed]
                            
                                Why #egg=foo when pip-installing from git repo
                            
                                What is this odd colon behavior doing?
                            
                                Python split() without removing the delimiter [duplicate]
                            
                                Python's many ways of string formatting — are the older ones (going to be) deprecated?
                            
                                How to use sklearn fit_transform with pandas and return dataframe instead of numpy array?
                            
                                Get __name__ of calling function's module in Python
                            
                                Why do I get TypeError: can't multiply sequence by non-int of type 'float'?
                            
                                Why isn't assigning to an empty list (e.g. [] = "") an error?
                            
                                How to resolve "dyld: Library not loaded: @executable_path.." error
                            
                                Why does csvwriter.writerow() put a comma after each character?
                            
                                Does Python have a toString() equivalent, and can I convert a class to String?
                            
                                Save list of DataFrames to multisheet Excel spreadsheet
                            
                                python selenium click on button
                            
                                Link to Flask static files with url_for

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Unicode Encode Error

Tags:

python

encode

unicode

ascii

Alex B

People also ask

1 Answers

Scott Stafford

Recent Activity

Donate For Us