I am developing a download manager. Using the requests module in python to check for a valid link (and hopefully broken links). My code for checking link below: <pre class="prettyprint"><code>url = 'http://pyscripter.googlecode.com/files/PyScripter-v2.5.3-Setup.exe' r = requests.get(url, allow_redirects=False) # this line takes 40 seconds if r.status_code==200: print("link valid") else: print("link invalid") </code></pre> Now, the issue is this takes approximately 40 seconds to perform this check, which is huge. My question is how can I speed this up maybe using urllib2 or something?? Note: Also if I replace <code>url</code> with the actual URL which is 'http://pyscripter.googlecode.com/files/PyScripter-v2.5.3-Setup.exe', this takes one second so it appears to be an issue with requests.

Not all hosts support <code>head</code> requests. You can use this instead: <pre class="prettyprint"><code>r = requests.get(url, stream=True) </code></pre> This actually only download the headers, not the response content. Moreover, if the idea is to get the file afterwards, you don't have to make another request. See here for more infos.

Don't use <code>get</code> that actually retrieves the file, use: <pre class="prettyprint"><code>r = requests.head(url,allow_redirects=False) </code></pre> Which goes from 6.9secs on my machine to 0.4secs

python requests is slow

Tags:

python

download

urllib

python-requests

urllib2

I am developing a download manager. Using the requests module in python to check for a valid link (and hopefully broken links). My code for checking link below:

Click to copy

url = 'http://pyscripter.googlecode.com/files/PyScripter-v2.5.3-Setup.exe'
r = requests.get(url, allow_redirects=False) # this line takes 40 seconds
if r.status_code==200:
    print("link valid")
else:
    print("link invalid")

Now, the issue is this takes approximately 40 seconds to perform this check, which is huge. My question is how can I speed this up maybe using urllib2 or something??

Note: Also if I replace url with the actual URL which is 'http://pyscripter.googlecode.com/files/PyScripter-v2.5.3-Setup.exe', this takes one second so it appears to be an issue with requests.

220

asked Apr 03 '13 06:04

scandalous

2 Answers

Not all hosts support head requests. You can use this instead:

Click to copy

r = requests.get(url, stream=True)

This actually only download the headers, not the response content. Moreover, if the idea is to get the file afterwards, you don't have to make another request.

See here for more infos.

163

answered Sep 30 '22 17:09

michaelmeyer

Don't use get that actually retrieves the file, use:

Click to copy

r = requests.head(url,allow_redirects=False)

Which goes from 6.9secs on my machine to 0.4secs

answered Sep 30 '22 17:09

Jon Clements

Related questions
                            
                                Python Regex that adds space after dot
                            
                                How to download files over HTTP via python-urllib2 correctly?
                            
                                Python Algorithm Challenge?
                            
                                Python builtin "all" with generators
                            
                                Sort (hex) colors to match rainbow
                            
                                All tkinter functions run when program starts
                            
                                Reading non-ASCII characters from a text file
                            
                                Invalid syntax using dict comprehension
                            
                                How to correctly do HttpResponseRedirect with reverse?
                            
                                Find the biggest number formed by digits of input numer
                            
                                Python - Logger string formatting
                            
                                Can we increase a lowercase character by one
                            
                                Tuple unpacking in list construction (python3)
                            
                                What's the correct usage of matplotlib.mlab.normpdf()?
                            
                                Remove empty string from list
                            
                                How To Run Postgres locally
                            
                                Export many small DataFrames to a single Excel worksheet
                            
                                Determining if a string contains a word [duplicate]
                            
                                Making a SOAP request using Python requests module
                            
                                Function to calculate the difference between sum of squares and square of sums

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python requests is slow

Tags:

python

download

urllib

python-requests

urllib2

scandalous

People also ask

2 Answers

michaelmeyer

Jon Clements

Recent Activity

Donate For Us