Here are my attempts with error messages. What am I doing wrong? <pre class="prettyprint"><code>string.decode("ascii", "ignore") </code></pre> <blockquote> UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 37: ordinal not in range(128) </blockquote> <pre class="prettyprint"><code>string.encode('utf-8', "ignore") </code></pre> <blockquote> UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 37: ordinal not in range(128) </blockquote>

You can't decode a <code>unicode</code>, and you can't encode a <code>str</code>. Try doing it the other way around.

Guessing at all the things omitted from the original question, but, assuming Python 2.x the key is to read the error messages carefully: in particular where you call 'encode' but the message says 'decode' and vice versa, but also the types of the values included in the messages. In the first example <code>string</code> is of type <code>unicode</code> and you attempted to decode it which is an operation converting a byte string to unicode. Python helpfully attempted to convert the unicode value to <code>str</code> using the default 'ascii' encoding but since your string contained a non-ascii character you got the error which says that Python was unable to encode a unicode value. Here's an example which shows the type of the input string: <pre class="prettyprint"><code>>>> u"\xa0".decode("ascii", "ignore") Traceback (most recent call last): File "<pyshell#7>", line 1, in <module> u"\xa0".decode("ascii", "ignore") UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 0: ordinal not in range(128) </code></pre> In the second case you do the reverse attempting to encode a byte string. Encoding is an operation that converts unicode to a byte string so Python helpfully attempts to convert your byte string to unicode first and, since you didn't give it an ascii string the default ascii decoder fails: <pre class="prettyprint"><code>>>> "\xc2".encode("ascii", "ignore") Traceback (most recent call last): File "<pyshell#6>", line 1, in <module> "\xc2".encode("ascii", "ignore") UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 0: ordinal not in range(128) </code></pre>

string encoding and decoding?

Tags:

python

python-2.7

Here are my attempts with error messages. What am I doing wrong?

Click to copy

string.decode("ascii", "ignore")

UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 37: ordinal not in range(128)

Click to copy

string.encode('utf-8', "ignore")

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 37: ordinal not in range(128)

729

asked Jul 05 '12 07:07

waigani

2 Answers

You can't decode a unicode, and you can't encode a str. Try doing it the other way around.

answered Sep 22 '22 03:09

Ignacio Vazquez-Abrams

Guessing at all the things omitted from the original question, but, assuming Python 2.x the key is to read the error messages carefully: in particular where you call 'encode' but the message says 'decode' and vice versa, but also the types of the values included in the messages.

In the first example string is of type unicode and you attempted to decode it which is an operation converting a byte string to unicode. Python helpfully attempted to convert the unicode value to str using the default 'ascii' encoding but since your string contained a non-ascii character you got the error which says that Python was unable to encode a unicode value. Here's an example which shows the type of the input string:

Click to copy

>>> u"\xa0".decode("ascii", "ignore")  Traceback (most recent call last):   File "<pyshell#7>", line 1, in <module>     u"\xa0".decode("ascii", "ignore") UnicodeEncodeError: 'ascii' codec can't encode character u'\xa0' in position 0: ordinal not in range(128)

In the second case you do the reverse attempting to encode a byte string. Encoding is an operation that converts unicode to a byte string so Python helpfully attempts to convert your byte string to unicode first and, since you didn't give it an ascii string the default ascii decoder fails:

Click to copy

>>> "\xc2".encode("ascii", "ignore")  Traceback (most recent call last):   File "<pyshell#6>", line 1, in <module>     "\xc2".encode("ascii", "ignore") UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 0: ordinal not in range(128)

answered Sep 19 '22 03:09

Duncan

Related questions
                            
                                SQLAlchemy printing raw SQL from create()
                            
                                python dictionary sorting in descending order based on values
                            
                                Python class accessible by iterator and index
                            
                                Is there a need to close files that have no reference to them?
                            
                                Django template comparing string
                            
                                upload file to my dropbox from python script
                            
                                Pyinstaller setting icons don't change
                            
                                python try:except:finally
                            
                                How to format date string via multiple formats in python
                            
                                How to check (in template) if user belongs to a group
                            
                                overriding bool() for custom class [duplicate]
                            
                                Break or exit out of "with" statement?
                            
                                How to find out the current widget size in tkinter?
                            
                                A fast way to find the largest N elements in an numpy array
                            
                                How to retrieve SQL result column value using column name in Python?
                            
                                env: python\r: No such file or directory
                            
                                Django - Getting last object created, simultaneous filters
                            
                                Function to determine if two numbers are nearly equal when rounded to n significant decimal digits
                            
                                What does abstraction mean in programming?
                            
                                How to fix ''UnicodeDecodeError: 'charmap' codec can't decode byte 0x9d in position 29815: character maps to <undefined>''?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

string encoding and decoding?

Tags:

python

python-2.7

waigani

People also ask

2 Answers

Ignacio Vazquez-Abrams

Duncan

Recent Activity

Donate For Us