Printing a utf-8 encoded string

Tags:

I'm using BeautifulSoup to extract some text from an HTML but I just can't figure out how to print it properly to the screen (or to a file for that matter).

Here's how my class containing the text looks like:

class Thread(object):     def __init__(self, title, author, date, content = u""):         self.title = title         self.author = author         self.date = date         self.content = content         self.replies = []      def __unicode__(self):         s = u""          for k, v in self.__dict__.items():             s += u"%s = %s " % (k, v)          return s      def __repr__(self):         return repr(unicode(self))      __str__ = __repr__

When trying to print an instance of Thread here's what I see on the console:

~/python-tests $ python test.py u'date = 21:01 03/02/11 content =  author = \u05d3"\u05e8 \u05d9\u05d5\u05e0\u05d9 \u05e1\u05d8\u05d0\u05e0\u05e6\'\u05e1\u05e7\u05d5 replies = [] title = \u05de\u05d1\u05e0\u05d4 \u05d4\u05de\u05d1\u05d7\u05df '

Whatever I try I cannot get the output I'd like (the above text should be Hebrew). My end goal is to serialize Thread to a file (using json or pickle) and be able to read it back.

I'm running this with Python 2.6.6 on Ubuntu 10.10.

888

asked Mar 05 '11 10:03

daniel

1 Answers

To output a Unicode string to a file (or the console) you need to choose a text encoding. In Python the default text encoding is ASCII, but to support Hebrew characters you need to use a different encoding, such as UTF-8:

s = unicode(your_object).encode('utf8') f.write(s)

answered Oct 07 '22 07:10

Mark Byers

Related questions
                            
                                Keras: Binary_crossentropy has negative values
                            
                                How to install python in a docker image?
                            
                                The workers in ThreadPoolExecutor is not really daemon
                            
                                django model Form. Include fields from related models
                            
                                Testing Python Decorators?
                            
                                How to make an internal hyperlink in Sphinx documentation [duplicate]
                            
                                are user defined classes mutable
                            
                                Beginner Python: AttributeError: 'list' object has no attribute
                            
                                "Reduce" function for Series
                            
                                Python 3 urllib ignore SSL certificate verification
                            
                                pip install -r requirements.txt [Errno 2] No such file or directory: 'requirements.txt'
                            
                                Google Coding Challenge Question 2020 : Unspecified Words
                            
                                Python mechanize - two buttons of type 'submit'
                            
                                Matplotlib: simultaneous plotting in multiple threads
                            
                                TypeError: 'NoneType' object has no attribute '__getitem__'
                            
                                Difference between Kivy and PY4A
                            
                                Correct way to test for numpy.dtype
                            
                                Pandas ".convert_objects(convert_numeric=True)" deprecated [duplicate]
                            
                                "unstack" a pandas column containing lists into multiple rows [duplicate]
                            
                                Wrapping exceptions in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Printing a utf-8 encoded string

Tags:

python

encoding

unicode

daniel

People also ask

1 Answers

Mark Byers

Recent Activity

Donate For Us