Python Convert Unicode-Hex utf-8 strings to Unicode strings

2 Answers

s = u'Gaga\xe2\x80\x99s'
t = u'Gaga\u2019s'
x = s.encode('raw-unicode-escape').decode('utf-8')
assert x==t

print(x)

yields

Gaga’s

answered Nov 14 '22 23:11

unutbu

Where ever you decoded the original string, it was likely decoded with latin-1 or a close relative. Since latin-1 is the first 256 codepoints of Unicode, this works:

>>> s = u'Gaga\xe2\x80\x99s'
>>> s.encode('latin-1').decode('utf8')
u'Gaga\u2019s'

answered Nov 14 '22 23:11

Mark Tolonen

Related questions
                            
                                Eclipse external tool for Qt .ui to .py with pyuic
                            
                                Python to delete a row in excel spreadsheet
                            
                                Adding elements to a tuple when I know I shouldn't be able to
                            
                                How to calculate longitude using PyEphem
                            
                                Recursion over a list of lists without isinstance()
                            
                                Can this be written as a python reduce function?
                            
                                How to print file contents with filename before each line?
                            
                                Extending BaseHTTPRequestHandler - getting the posted data
                            
                                working with high precision timestamps in python
                            
                                "if var and var2 == getSomeValue()" in python - if the first is false, is the second statement evaluated?'
                            
                                send data from LabView to Python and get back
                            
                                Python: wrap all functions in a library
                            
                                Why does \w+ match a trailing newline?
                            
                                List of IP addresses in Python to a list of CIDR
                            
                                Why doesnt Pythons += (plus equals) operator modify variables from inner functions?
                            
                                python: how do you concatenate time to a string?
                            
                                Passing JSON data to the front end using Django
                            
                                Python: Find location of data within JSON object, parse the corresponding data
                            
                                Perform a binary search for a string prefix in Python
                            
                                "sys.getrefcount()" return value

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Convert Unicode-Hex utf-8 strings to Unicode strings

Tags:

python

unicode

utf-8

Henry Thornton

People also ask

2 Answers

unutbu

Mark Tolonen

Recent Activity

Donate For Us