I have a list containing URLs with escaped characters in them. Those characters have been set by <code>urllib2.urlopen</code> when it recovers the html page: <pre class="prettyprint"><code>http://www.sample1webpage.com/index.php?title=%E9%A6%96%E9%A1%B5&action=edit http://www.sample1webpage.com/index.php?title=%E9%A6%96%E9%A1%B5&action=history http://www.sample1webpage.com/index.php?title=%E9%A6%96%E9%A1%B5&variant=zh </code></pre> Is there a way to transform them back to their unescaped form in python? P.S.: The URLs are encoded in utf-8

Using <code>urllib</code> package (<code>import urllib</code>) : <h3>Python 2.7</h3> From official documentation : <blockquote> <code>urllib.unquote(string)</code> Replace <code>%xx</code> escapes by their single-character equivalent. Example: <code>unquote('/%7Econnolly/')</code> yields <code>'/~connolly/'</code>. </blockquote> <h3>Python 3</h3> From official documentation : <blockquote> <code>urllib.parse.unquote(string, encoding='utf-8', errors='replace')</code> […] Example: <code>unquote('/El%20Ni%C3%B1o/')</code> yields <code>'/El Niño/'</code>. </blockquote>

Decode escaped characters in URL

Tags:

python

escaping

I have a list containing URLs with escaped characters in them. Those characters have been set by urllib2.urlopen when it recovers the html page:

http://www.sample1webpage.com/index.php?title=%E9%A6%96%E9%A1%B5&action=edit http://www.sample1webpage.com/index.php?title=%E9%A6%96%E9%A1%B5&action=history http://www.sample1webpage.com/index.php?title=%E9%A6%96%E9%A1%B5&variant=zh

Is there a way to transform them back to their unescaped form in python?

P.S.: The URLs are encoded in utf-8

343

asked Nov 15 '11 13:11

Tony

1 Answers

Using urllib package (import urllib) :

Python 2.7

From official documentation :

urllib.unquote(string)

Replace %xx escapes by their single-character equivalent.

Example: unquote('/%7Econnolly/') yields '/~connolly/'.

Python 3

From official documentation :

urllib.parse.unquote(string, encoding='utf-8', errors='replace')

[…]

Example: unquote('/El%20Ni%C3%B1o/') yields '/El Niño/'.

121

answered Nov 15 '22 13:11

Ignacio Vazquez-Abrams

Related questions
                            
                                Function chaining in Python
                            
                                Inheritance of private and protected methods in Python
                            
                                ERROR: Could not build wheels for scipy which use PEP 517 and cannot be installed directly
                            
                                How do I autoformat some Python code to be correctly formatted?
                            
                                Pandas sum by groupby, but exclude certain columns
                            
                                Flask Value error view function did not return a response [duplicate]
                            
                                Slicing a list in Python without generating a copy
                            
                                get UTC timestamp in python with datetime
                            
                                Check if all elements of a list are of the same type
                            
                                python: how to check if a line is an empty line
                            
                                How to execute Python scripts in Windows?
                            
                                How to add multiple values to a dictionary key in python? [closed]
                            
                                Django import error - no module named django.conf.urls.defaults
                            
                                Python : Get size of string in bytes
                            
                                Python: printing a file to stdout
                            
                                Regular expression syntax for "match nothing"?
                            
                                Python - Passing a function into another function
                            
                                Large, persistent DataFrame in pandas
                            
                                Detect python version in shell script
                            
                                Variable defined with with-statement available outside of with-block?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With