Why is mechanize throwing a HTTP 403 error?

Tags:

For some reason I get a HTTP Error 403: Forbidden when I try opening the page http://questionablecontent.net. I used to get a robots.txt error, but that has been solved. Additionally, I can't even find their robots.txt file.

I can still view the webpage from chrome, so what I'm wondering is: does mechanize look differently than chrome even after setting the appropriate headers?

Here is my code (which does not work):

br = mechanize.Browser()
cj = cookielib.LWPCookieJar()
br.set_cookiejar(cj)
br.set_handle_equiv(True)
br.set_handle_redirect(True)
br.set_handle_robots(False)
br.set_handle_refresh(mechanize._http.HTTPRefreshProcessor(), max_time=1)
br.addheaders = [('User-agent', 'Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.0.1) Gecko/2008071615 Fedora/3.0.1-1.fc9 Firefox/3.0.1')]

I also tried setting the addheaders to the same headers as my browser (which I found here):

br.addheaders = [('User-agent','Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/28.0.1500.72 Safari/537.36')]

... but that didn't work either.

Finally, I tried using Selenium and that worked, seeing as it loads the page in chrome and then communicates with Python. However, I still would like to get it working with mechanize. Also, I'm still unsure as to how chrome and mechanize look different to their server.

923

asked Jul 30 '13 04:07

Matthew Wesly

1 Answers

The trick is probably in the request headers selenium is sending, apart from the user agent header, some servers check other headers as well to ensure a real browser is talking to them. look at one of my older answers:

urllib2.HTTPError: HTTP Error 403: Forbidden

In your place, I would try adding all the headers your real chrome browser sends, and then eliminate the unnecessary ones.

171

answered Oct 13 '22 12:10

andrean

Related questions
                            
                                Specifying dictionary argument with dict() or braces matters in set_bbox
                            
                                Getting output of pybrain prediction as array
                            
                                couldn't remove origin point in matplotlib polycollection
                            
                                Global name in Python
                            
                                Is chronological order of logging messages guaranteed?
                            
                                Python: Generating All Pairwise-Unique Pairings
                            
                                socket-based simple python chat program
                            
                                Parsing Python function calls to get argument positions
                            
                                Kivy: How to make widget behave like overflow:hidden
                            
                                Scipy Sparse Matrix - Dense Vector Multiplication Performance - Blocks vs Large Matrix
                            
                                How do I use a python static checker with Eclipse?
                            
                                How to customize module name for Sphinx
                            
                                How can I copy from an html file to the clipboard in Python in formatted text?
                            
                                On Mac, how to create a drag & drop app for a python script where the script name rather than Python shows in the MenuBar (etc.)?
                            
                                Using factory_boy with SQLAlchemy and class methods
                            
                                Problems with autodoc and explicitly specified instance attributes
                            
                                Non-trivial sums of outer products without temporaries in numpy
                            
                                Coloring networkx edges based on weight
                            
                                How to tokenize continuous words with no whitespace delimiters?
                            
                                How to compare clusters?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is mechanize throwing a HTTP 403 error?

Tags:

python

http-headers

mechanize

Matthew Wesly

People also ask

1 Answers

andrean

Recent Activity

Donate For Us