I have a unicode string like "Tanım" which is encoded as "Tan%u0131m" somehow. How can i convert this encoded string back to original unicode. Apparently urllib.unquote does not support unicode.

%uXXXX is a non-standard encoding scheme that has been rejected by the w3c, despite the fact that an implementation continues to live on in JavaScript land. The more common technique seems to be to UTF-8 encode the string and then % escape the resulting bytes using %XX. This scheme is supported by urllib.unquote: <pre class="prettyprint"><code>>>> urllib2.unquote("%0a") '\n' </code></pre> Unfortunately, if you really need to support %uXXXX, you will probably have to roll your own decoder. Otherwise, it is likely to be far more preferable to simply UTF-8 encode your unicode and then % escape the resulting bytes. A more complete example: <pre class="prettyprint"><code>>>> u"Tanım" u'Tan\u0131m' >>> url = urllib.quote(u"Tanım".encode('utf8')) >>> urllib.unquote(url).decode('utf8') u'Tan\u0131m' </code></pre>

How to unquote a urlencoded unicode string in python?

2 Answers

%uXXXX is a non-standard encoding scheme that has been rejected by the w3c, despite the fact that an implementation continues to live on in JavaScript land.

The more common technique seems to be to UTF-8 encode the string and then % escape the resulting bytes using %XX. This scheme is supported by urllib.unquote:

>>> urllib2.unquote("%0a") '\n'

Unfortunately, if you really need to support %uXXXX, you will probably have to roll your own decoder. Otherwise, it is likely to be far more preferable to simply UTF-8 encode your unicode and then % escape the resulting bytes.

A more complete example:

>>> u"Tanım" u'Tan\u0131m' >>> url = urllib.quote(u"Tanım".encode('utf8')) >>> urllib.unquote(url).decode('utf8') u'Tan\u0131m'

200

answered Sep 22 '22 21:09

Aaron Maenpaa

def unquote(text):     def unicode_unquoter(match):         return unichr(int(match.group(1),16))     return re.sub(r'%u([0-9a-fA-F]{4})',unicode_unquoter,text)

answered Sep 22 '22 21:09

Markus Jarderot

Related questions
                            
                                Does the SVM in sklearn support incremental (online) learning?
                            
                                SQLite Performance Benchmark -- why is :memory: so slow...only 1.5X as fast as disk?
                            
                                Computing diffs within groups of a dataframe
                            
                                Custom loss function in Keras
                            
                                Python: next() function
                            
                                Resource usage of google Go vs Python and Java on Appengine
                            
                                Time Series Decomposition function in Python
                            
                                Global error handler for any exception
                            
                                What is the difference between __init__.py and __main__.py? [duplicate]
                            
                                Is there an R equivalent of the pythonic "if __name__ == "__main__": main()"?
                            
                                Python: How to show matplotlib in flask [duplicate]
                            
                                Using Numpy Vectorize on Functions that Return Vectors
                            
                                Why is variable1 += variable2 much faster than variable1 = variable1 + variable2?
                            
                                How to rearrange array based upon index array
                            
                                Using Merge on a column and Index in Pandas
                            
                                Returning multiple values from pandas apply on a DataFrame
                            
                                Why is startswith slower than slicing
                            
                                Apply function to pandas groupby
                            
                                Relative import in Python 3 is not working [duplicate]
                            
                                How to handle a broken pipe (SIGPIPE) in python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to unquote a urlencoded unicode string in python?

Tags:

python

character-encoding

unicode

urllib

w3c

hamdiakoguz

People also ask

2 Answers

Aaron Maenpaa

Markus Jarderot

Recent Activity

Donate For Us