I want to download image file from a url using python module "urllib.request", which works for some website (e.g. mangastream.com), but does not work for another (mangadoom.co) receiving error "HTTP Error 403: Forbidden". What could be the problem for the latter case and how to fix it? I am using python3.4 on OSX. <pre class="prettyprint"><code>import urllib.request # does not work img_url = 'http://mangadoom.co/wp-content/manga/5170/886/005.png' img_filename = 'my_img.png' urllib.request.urlretrieve(img_url, img_filename) </code></pre> At the end of error message it said: <pre class="prettyprint"><code>... HTTPError: HTTP Error 403: Forbidden </code></pre> However, it works for another website <pre class="prettyprint"><code># work img_url = 'http://img.mangastream.com/cdn/manga/51/3140/006.png' img_filename = 'my_img.png' urllib.request.urlretrieve(img_url, img_filename) </code></pre> I have tried the solutions from the post below, but none of them works on mangadoom.co. Downloading a picture via urllib and python How do I copy a remote image in python? The solution here also does not fit because my case is to download image. urllib2.HTTPError: HTTP Error 403: Forbidden Non-python solution is also welcome. Your suggestion will be very appreciated.

You can build an opener. Here's the example: <pre class="prettyprint"><code>import urllib.request opener=urllib.request.build_opener() opener.addheaders=[('User-Agent','Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1941.0 Safari/537.36')] urllib.request.install_opener(opener) url='' local='' urllib.request.urlretrieve(url,local) </code></pre> By the way, the following codes are the same: (none-opener) <pre class="prettyprint"><code>req=urllib.request.Request(url,data,hdr) html=urllib.request.urlopen(req) </code></pre> (opener builded) <pre class="prettyprint"><code>html=operate.open(url,data,timeout) </code></pre> However, we are not able to add header when we use: <pre class="prettyprint"><code>urllib.request.urlretrieve() </code></pre> So in this case, we have to build an opener.

download image from url using python urllib but receiving HTTP Error 403: Forbidden

Tags:

url

python-3.x

image

download

urllib

I want to download image file from a url using python module "urllib.request", which works for some website (e.g. mangastream.com), but does not work for another (mangadoom.co) receiving error "HTTP Error 403: Forbidden". What could be the problem for the latter case and how to fix it?

I am using python3.4 on OSX.

import urllib.request

# does not work
img_url = 'http://mangadoom.co/wp-content/manga/5170/886/005.png'
img_filename = 'my_img.png'
urllib.request.urlretrieve(img_url, img_filename)

At the end of error message it said:

... 
HTTPError: HTTP Error 403: Forbidden

However, it works for another website

# work
img_url = 'http://img.mangastream.com/cdn/manga/51/3140/006.png'
img_filename = 'my_img.png'
urllib.request.urlretrieve(img_url, img_filename)

I have tried the solutions from the post below, but none of them works on mangadoom.co.

Downloading a picture via urllib and python

How do I copy a remote image in python?

The solution here also does not fit because my case is to download image. urllib2.HTTPError: HTTP Error 403: Forbidden

Non-python solution is also welcome. Your suggestion will be very appreciated.

484

asked Jan 09 '16 09:01

neobot

2 Answers

This website is blocking the user-agent used by urllib, so you need to change it in your request. Unfortunately I don't think urlretrieve supports this directly.

I advise for the use of the beautiful requests library, the code becomes (from here) :

import requests
import shutil

r = requests.get('http://mangadoom.co/wp-content/manga/5170/886/005.png', stream=True)
if r.status_code == 200:
    with open("img.png", 'wb') as f:
        r.raw.decode_content = True
        shutil.copyfileobj(r.raw, f)

Note that it seems this website does not forbide requests user-agent. But if need to be modified it is easy :

r = requests.get('http://mangadoom.co/wp-content/manga/5170/886/005.png',
                 stream=True, headers={'User-agent': 'Mozilla/5.0'})

Also relevant : changing user-agent in urllib

answered Sep 20 '22 12:09

Benoit Seguin

You can build an opener. Here's the example:

import urllib.request

opener=urllib.request.build_opener()
opener.addheaders=[('User-Agent','Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1941.0 Safari/537.36')]
urllib.request.install_opener(opener)

url=''
local=''
urllib.request.urlretrieve(url,local)

By the way, the following codes are the same:

(none-opener)

req=urllib.request.Request(url,data,hdr)   
html=urllib.request.urlopen(req)

(opener builded)

html=operate.open(url,data,timeout)

However, we are not able to add header when we use:

urllib.request.urlretrieve()

So in this case, we have to build an opener.

answered Sep 18 '22 12:09

er.Zhu

Related questions
                            
                                how to get the opposite color of any background images
                            
                                Find extreme outer points in image with Python OpenCV
                            
                                How to read bytes of a local image file in Dart/Flutter?
                            
                                Check the width and height of an image
                            
                                OpenCV how to find a list of connected components in a binary image
                            
                                Android crop image like camscanner
                            
                                Latex - Is it possible to have text on top of images?
                            
                                Is it possible to linear-gradient-fill a grouped path in SVG (by css or attr on jQuery event)
                            
                                Meteor Images, CSS, "Normal" Web Serving
                            
                                Javascript Image onLoad
                            
                                How to resize a picture?
                            
                                How can I add a drop-shadow to an image using PHP?
                            
                                Crop an image instead of stretching it
                            
                                Html Image over image
                            
                                Orientation does not behave correctly with Photo in ALAsset
                            
                                Make ImageView have dark transparency
                            
                                How to "smart resize" a displayed image to original aspect ratio
                            
                                How do I programmatically check whether a GIF image is animated?
                            
                                MVC3/Razor thumbnail/resize image ideas?
                            
                                Storing Images : DB or File System -

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With