How do I override the str function without raising a UnicodeEncodeError?

Tags:

I am puzzled that defining __str__ for a class seems to have no effect on using the str function on a class instance. For example, I read in the Django documentation that:

The print statement and the str built-in call __str__() to determine the human-readable representation of an object.

But that doesn't appear to be true. Here's an example from a module where text is always assumed to be unicode:

import six

class Test(object):

    def __init__(self, text):
        self._text = text

    def __str__(self):
        if six.PY3:
            return str(self._text)
        else:
            return unicode(self._text)

    def __unicode__(self):
        if six.PY3:
            return str(self._text)
        else:
            return unicode(self._text)

In Python 2, it gives the following behavior:

>>> a=Test(u'café')
>>> print a.__str__()
café
>>> print a # same error with str(a)
---------------------------------------------------------------------------
UnicodeEncodeError                        Traceback (most recent call last)
<ipython-input-63-202e444820fd> in <module>()
----> 1 str(a)

UnicodeEncodeError: 'ascii' codec can't encode character u'\xe9' in position 3: ordinal not in range(128)

Is there a way to overload the str function?

802

asked May 06 '16 21:05

Ray Osborn

1 Answers

For Python 2, you are returning the wrong type from the __str__ method. You are returning unicode, while you must return str:

def __str__(self):
    if six.PY3:
        return str(self._text)
    else:
        return self._text.encode('utf8')

Because self._text is not already of type str, you'll need to encode it. Because you returned Unicode instead, Python is forced to encode it first, but the default ASCII encoding can't handle the non-ASCII é character.

Printing the object results in the right output only because my terminal is configured to handle UTF-8:

>>> a = Test(u'café')
>>> str(a)
'caf\xc3\xa9'
>>> print a
café
>>> unicode(a)
u'caf\xe9'

Note that there is no __unicode__ method in Python 3; your if six.PY3 in that method is entirely redundant. The following would work too:

class Test(object):
    def __init__(self, text):
        self._text = text

    def __str__(self):
        if six.PY3:
            return self._text
        else:
            return self._text.encode('utf8')

    def __unicode__(self):
        return self._text

However, if you are using the six library, you'd be far better of using the @six.python_2_unicode_compatible decorator, and only define a Python 3 version for the __str__ method:

@six.python_2_unicode_compatible
class Test(object):
    def __init__(self, text):
        self._text = text

    def __str__(self):
        return self._text

where it is assumed text is always Unicode. If you are working with Django, then you can get the same decorator from the django.utils.encoding module.

127

answered Sep 24 '22 02:09

Martijn Pieters

Related questions
                            
                                Requests CookieJar empty even thought the page have it
                            
                                Can't access returned h5py object instance
                            
                                How to show full Python Traceback with Tox/Py.test
                            
                                How to install/run Jupyter in Ubuntu 15.10?
                            
                                Why does Python "&=" set operator act differently than "&=" integer operation?
                            
                                Soundcloud API not returning all tracks from playlist through Python
                            
                                Difficulty in using sympy solver in python
                            
                                Merge pandas dataframe with unequal length
                            
                                Custom handshake data with Flask-SocketIO
                            
                                Altering the default Django messages tag
                            
                                Mocking pyodbc module calls for django unit tests
                            
                                Python: Designing a time-series filter after Fourier analysis
                            
                                How to convert a numpy matrix to a pandas series?
                            
                                Blurry edge detection
                            
                                Python: Predict the y value using Statsmodels - Linear Regression
                            
                                Failed processing format-parameters with mysql.connector in Python
                            
                                When is chr(ord(c)) not equal to c in Python?
                            
                                python speed processing per line VS in chunk
                            
                                Passing SOME of the parameters to a function in python
                            
                                What's the Ruby equivalent of Python's defaultdict?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I override the str function without raising a UnicodeEncodeError?

Tags:

python

unicode

python-2.x

Ray Osborn

People also ask

1 Answers

Martijn Pieters

Recent Activity

Donate For Us