Python3 urllib.request will not close connections immediately

Tags:

I've got the following code to run a continuous loop to fetch some content from a website:

from http.cookiejar import CookieJar
from urllib import request

cj = CookieJar()
cp = request.HTTPCookieProcessor(cj)
hh = request.HTTPHandler()
opener = request.build_opener(cp, hh)

while True:
    # build url
    req = request.Request(url=url)
    p = opener.open(req)
    c = p.read()
    # process c
    p.close()
    # check for abort condition, or continue

The contents are correctly read. But for some reason, the TCP connections won't close. I'm observing the active connection count from a dd-wrt router interface, and it goes up consistently. If the script continue to run, it'll exhaust the 4096 connection limit of the router. When this happens, the script simply enter waiting state (the router won't allow new connections, but timeout hasn't hit yet). After couple minutes, those connections will be closed and the script can resume again.

I was able to observe the state of those hanging connections from the router. They share the same state: TIME_WAIT .

I'm expecting this script to use no more than 1 TCP connection simultaneously. What am I doing wrong?

I'm using Python 3.4.2 on Mac OS X 10.10.

267

asked Nov 09 '14 08:11

He Shiming

1 Answers

Through some research, I discovered the cause of this problem: the design of TCP protocol . In a nutshell, when you disconnect, the connection isn't dropped immediately, it enters 'TIME_WAIT' state, and will time out after 4 minutes. Unlike what I was expecting, the connection doesn't immediately disappear.

According to this question, it's also not possible to forcefully drop a connection (without restarting the network stack).

It turns out in my particular case, like this question stated, a better option would be to use a persistent connection, a.k.a. HTTP keep-alive. As I'm querying the same server, this will work.

177

answered Oct 16 '22 21:10

He Shiming

Related questions
                            
                                Adding a colorbar to a pcolormesh with polar projection
                            
                                Explanation of pylint bad-format-string
                            
                                Do matplotlib.contourf levels depend on the amount of colors in the colormap?
                            
                                How to build interactive menu for command-line application in python? [closed]
                            
                                Python: counting how many times a given line is executed
                            
                                What is "revirtual" in this answer?
                            
                                vim python navigate to imported files
                            
                                Why am I allowed pickle instancemethods that are Theano functions, but not normal instancemethods?
                            
                                Python: Getting all the items out of a `threading.local`
                            
                                Logarithmically scaled minor tick marks on a matplotlib colorbar?
                            
                                Send JSON response from Sqlite queries in Python
                            
                                Setting word wrap on QLabel breaks size constrains for the window
                            
                                argparse conflict resolver for options in subcommands turns keyword argument into positional argument
                            
                                pylab/networkx; no node labels displayed after update
                            
                                numpy: is it possible to preserve the dtype of columns when using column_stack
                            
                                Which timezone does Django use in DateField's auto_now_add?
                            
                                decorating a class function with a callable instance
                            
                                Change object's attribute on session commit - Flask SQLAlchemy
                            
                                Attach generated PDF in Mailgun message Django/Python
                            
                                Multiple values in single column of a pandas DataFrame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python3 urllib.request will not close connections immediately

Tags:

python

python-3.x

macos

urllib

He Shiming

People also ask

1 Answers

He Shiming

Recent Activity

Donate For Us