unicode().decode('utf-8', 'ignore') raising UnicodeEncodeError

Tags:

python

unicode

Here is the code:

>>> z = u'\u2022'.decode('utf-8', 'ignore') Traceback (most recent call last):   File "<stdin>", line 1, in <module>   File "/usr/lib/python2.6/encodings/utf_8.py", line 16, in decode     return codecs.utf_8_decode(input, errors, True) UnicodeEncodeError: 'latin-1' codec can't encode character u'\u2022' in position 0: ordinal not in range(256)

Why is UnicodeEncodeError raised when I am using .decode?

Why is any error raised when I am using 'ignore'?

641

asked Feb 23 '11 20:02

Facundo Casco

1 Answers

When I first started messing around with python strings and unicode, It took me awhile to understand the jargon of decode and encode too, so here's my post from here that may help:

Think of decoding as what you do to go from a regular bytestring to unicode and encoding as what you do to get back from unicode. In other words:

You de-code a str to produce a unicode string (in Python 2)

and en-code a unicode string to produce a str (in Python 2)

So:

unicode_char = u'\xb0'  encodedchar = unicode_char.encode('utf-8')

encodedchar will contain your unicode character, displayed in the selected encoding (in this case, utf-8).

The same principle applies to Python 3. You de-code a bytes object to produce a str object. And you en-code a str object to produce a bytes object.

131

answered Sep 25 '22 03:09

Aphex

Related questions
                            
                                Dynamically limiting queryset of related field
                            
                                'module' object is not callable - calling method in another file
                            
                                Python scikit-learn: exporting trained classifier
                            
                                numpy.r_ is not a function. What is it?
                            
                                Pre-populate an inline FormSet?
                            
                                How to build a single python file from multiple scripts?
                            
                                GridSearch for an estimator inside a OneVsRestClassifier
                            
                                Catch "socket.error: [Errno 111] Connection refused" exception
                            
                                How would I access variables from one class to another?
                            
                                Django equivalent of PHP's form value array/associative array
                            
                                Parentheses in Python Conditionals
                            
                                Merging a Python script's subprocess' stdout and stderr while keeping them distinguishable
                            
                                OpenCV Python: Draw minAreaRect ( RotatedRect not implemented)
                            
                                How to delete an instantiated object Python?
                            
                                Python & Pandas: How to query if a list-type column contains something?
                            
                                Basic method chaining
                            
                                when does Python allocate new memory for identical strings?
                            
                                In Python, heapq.heapify doesn't take cmp or key functions as arguments like sorted does
                            
                                How to detect string byte encoding?
                            
                                multiprocessing.dummy in Python is not utilising 100% cpu

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With