How do convert unicode escape sequences to unicode characters in a python string

Tags:

When I tried to get the content of a tag using "unicode(head.contents[3])" i get the output similar to this: "Christensen Sk\xf6ld". I want the escape sequence to be returned as string. How to do it in python?

918

asked Jun 13 '09 06:06

Vicky

1 Answers

Assuming Python sees the name as a normal string, you'll first have to decode it to unicode:

>>> name 'Christensen Sk\xf6ld' >>> unicode(name, 'latin-1') u'Christensen Sk\xf6ld'

Another way of achieving this:

>>> name.decode('latin-1') u'Christensen Sk\xf6ld'

Note the "u" in front of the string, signalling it is uncode. If you print this, the accented letter is shown properly:

>>> print name.decode('latin-1') Christensen Sköld

BTW: when necessary, you can use de "encode" method to turn the unicode into e.g. a UTF-8 string:

>>> name.decode('latin-1').encode('utf-8') 'Christensen Sk\xc3\xb6ld'

133

answered Sep 22 '22 02:09

Mark van Lent

Related questions
                            
                                Misunderstanding of python os.path.abspath
                            
                                How to send zip files in the python Flask framework?
                            
                                What is the purpose of $HOME/.local
                            
                                A + B without arithmetic operators, Python vs C++
                            
                                How to correctly use scipy's skew and kurtosis functions?
                            
                                plotly.offline.iplot gives a large blank field as its output in Jupyter Notebook/Lab
                            
                                Using Python to program MS Office macros?
                            
                                The "correct" way to define an exception in Python without PyLint complaining
                            
                                Is this the right way to run a shell script inside Python?
                            
                                Cannot concatenate 'str' and 'float' objects?
                            
                                In python, how to capture the stdout from a c++ shared library to a variable
                            
                                Rotate theta=0 on matplotlib polar plot
                            
                                Can I manually trigger signals in Django?
                            
                                Display SVG in IPython notebook from a function
                            
                                Matplotlib control which plot is on top
                            
                                How to quit ipdb while in post-mortem debugging?
                            
                                Django and domain driven design
                            
                                Datetime conversion - How to extract the inferred format?
                            
                                ImportError: No module named 'flask.ext' [duplicate]
                            
                                Writing unit tests in Django / Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do convert unicode escape sequences to unicode characters in a python string

Tags:

python

unicode

python-2.x

Vicky

People also ask

1 Answers

Mark van Lent

Recent Activity

Donate For Us