python requests module and connection reuse

Tags:

I am working with python's requests module for HTTP communication, and I am wondering how to reuse already-established TCP connections? The requests module is stateless and if I repeatedly call get for the same URL, wouldn't it create a new connection each time?

Thanks!!

402

asked Jul 21 '14 20:07

gmemon

2 Answers

Global functions like requests.get or requests.post create the requests.Session instance on each call. Connections made with these functions cannot be reused, because you cannot access automatically created session and use it's connection pool for subsequent requests. It's fine to use these functions if you have to do just a few requests. Otherwise you'll want to manage sessions yourself.

Here is a quick display of requests behavior when you use global get function and session.

Preparation, not really relevant to the question:

>>> import logging, requests, timeit >>> logging.basicConfig(level=logging.DEBUG, format="%(message)s")

See, a new connection is established each time you call get:

>>> _ = requests.get("https://www.wikipedia.org") Starting new HTTPS connection (1): www.wikipedia.org >>> _ = requests.get("https://www.wikipedia.org") Starting new HTTPS connection (1): www.wikipedia.org

But if you use the same session for subsequent calls, the connection gets reused:

>>> session = requests.Session() >>> _ = session.get("https://www.wikipedia.org") Starting new HTTPS connection (1): www.wikipedia.org >>> _ = session.get("https://www.wikipedia.org") >>> _ = session.get("https://www.wikipedia.org") >>> _ = session.get("https://www.wikipedia.org")

Performance:

>>> timeit.timeit('_ = requests.get("https://www.wikipedia.org")', 'import requests', number=100) Starting new HTTPS connection (1): www.wikipedia.org Starting new HTTPS connection (1): www.wikipedia.org Starting new HTTPS connection (1): www.wikipedia.org ... Starting new HTTPS connection (1): www.wikipedia.org Starting new HTTPS connection (1): www.wikipedia.org Starting new HTTPS connection (1): www.wikipedia.org 52.74904417991638 >>> timeit.timeit('_ = session.get("https://www.wikipedia.org")', 'import requests; session = requests.Session()', number=100) Starting new HTTPS connection (1): www.wikipedia.org 15.770191192626953

Works much faster when you reuse the session (and thus session's connection pool).

answered Sep 20 '22 00:09

Діма Киричук

The requests module is stateless and if I repeatedly call get for the same URL, wouldnt it create a new connection each time?

The requests module is not stateless; it just lets you ignore the state and effectively use a global singleton state if you choose to do so.*

And it (or, rather, one of the underlying libraries, urllib3) maintains a connection pool keyed by (hostname, port) pair, so it will usually just magically reuse a connection if it can.

As the documentation says:

Excellent news — thanks to urllib3, keep-alive is 100% automatic within a session! Any requests that you make within a session will automatically reuse the appropriate connection!

Note that connections are only released back to the pool for reuse once all body data has been read; be sure to either set stream to False or read the content property of the Response object.

So, what does "if it can" mean? As the docs above imply, if you're keeping streaming response objects alive, their connections obviously can't be reused.

Also, the connection pool is really a finite cache, not infinite, so if you spam out a ton of connections and two of them are to the same server, you won't always reuse the connection, just often. But usually, that's what you actually want.

_{* The particular state relevant here is the transport adapter. Each session gets a transport adapter. You can specify the adapter manually, or you can specify a global default, or you can just use the default global default, which basically just wraps up a urllib3.PoolManager for managing its HTTP connections. For more information, read the docs.}

answered Sep 22 '22 00:09

abarnert

Related questions
                            
                                Generating Symmetric Matrices in Numpy
                            
                                Parsing date with timezone from an email?
                            
                                How to add third-party Java JAR files for use in PySpark
                            
                                python/pandas: convert month int to month name
                            
                                How to explain the int() function to a beginner
                            
                                sort csv by column
                            
                                usleep in Python
                            
                                networkx add_node with specific position
                            
                                How to install SimpleJson Package for Python
                            
                                How do I subtract two dates in Django/Python?
                            
                                How do you set a conditional in python based on datatypes?
                            
                                Writing UTF-8 String to MySQL with Python
                            
                                Bottle framework and OOP, using method instead of function
                            
                                Python - Download Images from google Image search?
                            
                                Running a Python script outside of Django
                            
                                differences between "d = dict()" and "d = {}"
                            
                                Possible to append multiple lists at once? (Python)
                            
                                Convert percent string to float in pandas read_csv
                            
                                In Python, is it better to use list comprehensions or for-each loops?
                            
                                Find the root of the git repository where the file lives

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python requests module and connection reuse

Tags:

python

python-requests

keep-alive

gmemon

People also ask

2 Answers

Діма Киричук

abarnert

Recent Activity

Donate For Us