I permanently get the following error: <pre class="prettyprint"><code>UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4' in position 27: ordinal not in range(128) </code></pre> I already tried <ol> <li><code>x.encode("ascii", "ignore")</code></li> <li><code>x.encode("utf-8")</code></li> <li><code>x.decode("utf-8")</code></li> </ol> However, nothing works.

You have to discover in which encoding is this character at the source. I guess this is ISO-8859-1 (european languages), in which case it's "ä", but you should check. It could also be cyrillic or greek. See http://en.wikipedia.org/wiki/ISO/IEC_8859-1 for a complete list of characters in this encoding. Using this information, you can ask Python to convert it : In Python 2.7 <pre class="prettyprint"><code>>>> s = '\xe4' >>> t = s.decode('iso-8859-1') >>> print t ä >>> for c in t: ... print ord(c) ... 228 >>> u = t.encode('utf-8') >>> print u ä >>> for c in bytes(u): ... print ord(c) ... 195 164 </code></pre> String <code>t</code> is internally encoded in ISO-8859-1 in Python. String <code>u</code> is internally encoded in UTF-8, and that character takes 2 bytes in UTF-8. Notice also that the <code>print</code> instruction "knows" how to display these different encodings.

UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4'

Tags:

python

encoding

ascii

I permanently get the following error:

UnicodeEncodeError: 'ascii' codec can't encode character u'\xe4' in position 27: ordinal not in range(128)

I already tried

x.encode("ascii", "ignore")
x.encode("utf-8")
x.decode("utf-8")

However, nothing works.

376

asked Oct 27 '14 17:10

toom

1 Answers

You have to discover in which encoding is this character at the source.

I guess this is ISO-8859-1 (european languages), in which case it's "ä", but you should check. It could also be cyrillic or greek.

See http://en.wikipedia.org/wiki/ISO/IEC_8859-1 for a complete list of characters in this encoding.

Using this information, you can ask Python to convert it :

In Python 2.7

>>> s = '\xe4'
>>> t = s.decode('iso-8859-1')
>>> print t
ä
>>> for c in t:
...   print ord(c)
...
228
>>> u = t.encode('utf-8')
>>> print u
ä
>>> for c in bytes(u):
...   print ord(c)
...
195
164

String t is internally encoded in ISO-8859-1 in Python. String u is internally encoded in UTF-8, and that character takes 2 bytes in UTF-8. Notice also that the print instruction "knows" how to display these different encodings.

123

answered Sep 28 '22 13:09

Mickaël Bucas

Related questions
                            
                                Why should I use the __prepare__ method to get a class' namespace?
                            
                                OpenCV Python: cv2.VideoCapture can only find 2 of 3 cameras, Windows Camera app finds all
                            
                                Laravel's dd() equivalent in django
                            
                                Python: which types support weak references?
                            
                                Numpy type hints in Python (PEP 484)
                            
                                How to explicitly set samesite=None on a flask response
                            
                                What is the purpose of the sub-interpreter API in CPython?
                            
                                Why is there {Raw,Safe}ConfigParser in Python 3?
                            
                                How do you correct Module already loaded UserWarnings in Python?
                            
                                Python garbage collection can be that slow?
                            
                                Django development server reload takes too long
                            
                                Using Chrome's cookies in Python-Requests
                            
                                Code in Python, communicate in Node.js and Socket.IO, present in HTML
                            
                                How to handle database exceptions in Django
                            
                                Django Admin + FORCE_SCRIPT_NAME + Login redirects incorrectly
                            
                                How do I properly use Python's C API and exceptions?
                            
                                What magic prevents Tkinter programs from blocking in interactive shell?
                            
                                Compiling mysql-python on Windows with PIP
                            
                                Save the "Out[]" table of a pandas dataframe as a figure
                            
                                Timeseries streaming in bokeh

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With