Python HTTP HEAD - dealing with redirects properly?

Tags:

I can use urllib2 to make HEAD requests like so:

import urllib2
request = urllib2.Request('http://example.com')
request.get_method = lambda: 'HEAD'
urllib2.urlopen(request)

The problem is that it appears that when this follows redirects, it uses GET instead of HEAD.

The purpose of this HEAD request is to check the size and content type of the URL I'm about to download so that I can ensure that I don't download some huge document. (The URL is supplied by a random internet user through IRC).

How could I make it use HEAD requests when following redirects?

297

asked Apr 01 '12 19:04

Krenair

2 Answers

You can do this with the requests library:

>>> import requests
>>> r = requests.head('http://github.com', allow_redirects=True)
>>> r
<Response [200]>
>>> r.history
[<Response [301]>]
>>> r.url
u'https://github.com/'

answered Sep 28 '22 07:09

jterrace

Good question! If you're set on using urllib2, you'll want to look at this answer about the construction of your own redirect handler.

In short (read: blatantly stolen from the previous answer):

import urllib2

#redirect_handler = urllib2.HTTPRedirectHandler()

class MyHTTPRedirectHandler(urllib2.HTTPRedirectHandler):
    def http_error_302(self, req, fp, code, msg, headers):
        print "Cookie Manip Right Here"
        return urllib2.HTTPRedirectHandler.http_error_302(self, req, fp, code, msg, headers)

    http_error_301 = http_error_303 = http_error_307 = http_error_302

cookieprocessor = urllib2.HTTPCookieProcessor()

opener = urllib2.build_opener(MyHTTPRedirectHandler, cookieprocessor)
urllib2.install_opener(opener)

response =urllib2.urlopen("WHEREEVER")
print response.read()

print cookieprocessor.cookiejar

Also, as mentioned in the errata, you can use Python Requests.

answered Sep 28 '22 08:09

MrGomez

Related questions
                            
                                Python: How do I redirect this output?
                            
                                Lagrange interpolation in Python
                            
                                In Python, how do I check if a drive exists w/o throwing an error for removable drives?
                            
                                Mercurial and hgweb on IIS 7.5 - python error
                            
                                What's the logical value of "string" in Python? [duplicate]
                            
                                How to convert EST/EDT to GMT?
                            
                                how to define a function from a string using python
                            
                                Encode file path properly using python
                            
                                how to get POST data in django 1.3
                            
                                python pyusb import usb.core doesn't work
                            
                                How to remove white space at the bottom of matplotlib graph?
                            
                                How to trim characters in Python?
                            
                                Determine if package installed with Yum Python API?
                            
                                Turn some print off in python unittest
                            
                                Python pickle: fix \r characters before loading
                            
                                SWIG interfacing C library to Python (Creating 'iterable' Python data type from C 'sequence' struct)
                            
                                Python alternative to Java applet?
                            
                                Using Enthought Python instead of the system Python
                            
                                Inline for in expression evaluation
                            
                                Automatically Resize Command Line Window

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python HTTP HEAD - dealing with redirects properly?

Tags:

python

redirect

head

urllib2

Krenair

People also ask

2 Answers

jterrace

MrGomez

Recent Activity

Donate For Us