Python: Get HTTP headers from urllib2.urlopen call?

Tags:

Does urllib2 fetch the whole page when a urlopen call is made?

I'd like to just read the HTTP response header without getting the page. It looks like urllib2 opens the HTTP connection and then subsequently gets the actual HTML page... or does it just start buffering the page with the urlopen call?

import urllib2 myurl = 'http://www.kidsidebyside.org/2009/05/come-and-draw-the-circle-of-unity-with-us/' page = urllib2.urlopen(myurl) // open connection, get headers  html = page.readlines()  // stream page

406

asked May 09 '09 14:05

shigeta

2 Answers

Use the response.info() method to get the headers.

From the urllib2 docs:

urllib2.urlopen(url[, data][, timeout])

...

This function returns a file-like object with two additional methods:

geturl() — return the URL of the resource retrieved, commonly used to determine if a redirect was followed

info() — return the meta-information of the page, such as headers, in the form of an httplib.HTTPMessage instance (see Quick Reference to HTTP Headers)

So, for your example, try stepping through the result of response.info().headers for what you're looking for.

Note the major caveat to using httplib.HTTPMessage is documented in python issue 4773.

answered Oct 10 '22 23:10

tolmeda

What about sending a HEAD request instead of a normal GET request. The following snipped (copied from a similar question) does exactly that.

>>> import httplib >>> conn = httplib.HTTPConnection("www.google.com") >>> conn.request("HEAD", "/index.html") >>> res = conn.getresponse() >>> print res.status, res.reason 200 OK >>> print res.getheaders() [('content-length', '0'), ('expires', '-1'), ('server', 'gws'), ('cache-control', 'private, max-age=0'), ('date', 'Sat, 20 Sep 2008 06:43:36 GMT'), ('content-type', 'text/html; charset=ISO-8859-1')]

answered Oct 10 '22 23:10

reto

Related questions
                            
                                Spark DataFrame TimestampType - how to get Year, Month, Day values from field?
                            
                                How to count unique ID after groupBy in pyspark
                            
                                type hinting within a class [duplicate]
                            
                                global variable warning in python [duplicate]
                            
                                Loading .RData files into Python
                            
                                memoize to disk - python - persistent memoization
                            
                                Obtain Latitude and Longitude from a GeoTIFF File
                            
                                Extract all keys from a list of dictionaries
                            
                                Does Python csv writer always use DOS end-of-line characters?
                            
                                Find dictionary items whose key matches a substring
                            
                                Installing python dateutil
                            
                                How to fix " AttributeError at /api/doc 'AutoSchema' object has no attribute 'get_link' " error in Django
                            
                                Python Pandas iterate over rows and access column names
                            
                                How to use 'User' as foreign key in Django 1.5
                            
                                Not a Valid Choice for Dynamic Select Field WTFORMS
                            
                                Zero pad numpy array
                            
                                'if' statement in jinja2 template
                            
                                How to force migrations to a DB if some tables already exist in Django?
                            
                                Delete every non utf-8 symbols from string
                            
                                Jupyter (IPython) notebook not plotting

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python: Get HTTP headers from urllib2.urlopen call?

Tags:

python

urllib

forwarding

shigeta

People also ask

2 Answers

tolmeda

reto

Recent Activity

Donate For Us