Basically, I am trying to download a URL using urllib2 in python.
the code is the following:
import urllib2 req = urllib2.Request('www.tattoo-cover.co.uk') req.add_header('User-agent','Mozilla/5.0') result = urllib2.urlopen(req)
it outputs ValueError and the program crushes for the URL in the example. When I access the url in a browser, it works fine.
Any ideas how to handle the problem?
UPDATE:
thanks for Ben James and sth the problem is detected => add 'http://'
Now the question is refined: Is it possible to handle such cases automatically with some builtin function or I have to do error handling with subsequent string concatenation?
When you enter a URL in a browser without the protocol, it defaults to HTTP. urllib2
won't make that assumption for you; you need to prefix it with http://
.
You have to use a complete URL including the protocol, not just specify a host name.
The correct URL would be http://www.tattoo-cover.co.uk/
.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With