TCP keep-alive gets involved after TCP zero-window and closes the connection erroneously

Tags:

We're seeing this pattern happen a lot between two RHEL 6 boxes that are transferring data via a TCP connection. The client issues a TCP Window Full, 0.2s later the client sends TCP Keep-Alives, to which the server responds with what look like correctly shaped responses. The client is unsatisfied by this however and continues sending TCP Keep-Alives until it finally closes the connection with an RST nearly 9s later.

This is despite the RHEL boxes having the default TCP Keep-Alive configuration:

net.ipv4.tcp_keepalive_time = 7200
net.ipv4.tcp_keepalive_probes = 9
net.ipv4.tcp_keepalive_intvl = 75

...which declares that this should only occur until 2hrs of silence. Am I reading my PCAP wrong (relevant packets available on request)?

Below is Wireshark screenshot of the pattern, with my own packet notes in the middle.

Wireshark screenshot

945

asked Nov 09 '15 22:11

Martin Cowie

1 Answers

Actually, these "keep-alive" packets are not used for TCP keep-alive! They are used for window size updates detection.

Wireshark treats them as keep-alive packets just because these packets look like keep-alive packet.

A TCP keep-alive packet is simply an ACK with the sequence number set to one less than the current sequence number for the connection.

(We assume that ip 10.120.67.113 refers to host A, 10.120.67.132 refers to host B.) In packet No.249511, A acks seq 24507484. In next packet(No.249512), B send seq 24507483(24507484-1).

enter image description here

Why there are so many "keep-alive" packets, what are they used for?

A sends data to B, and B replies zero-window size to tell A that he temporarily can't receive data anymore. In order to assure that A knows when B can receive data again, A send "keep-alive" packet to B again and again with persistence timer, B replies to A with his window size info (In our case, B's window size has always been zero).

And the normal TCP exponential backoff is used when calculating the persist timer. So we can see that A send its first "keep-alive" packet after 0.2s, send its second packet after 0.4s, the third is sent after 0.8, the fouth is sent after 1.6s...

This phenomenon is related to TCP flow control.

105

answered Oct 15 '22 07:10

cosven

Related questions
                            
                                How can I make net.Read wait for input in golang?
                            
                                What is duplicate ACK when does it occur?
                            
                                TCP/IP Protocol stack without an OS
                            
                                twisted - get OS-chosen listen port
                            
                                Difference between resolving a query and creating an endpoint with IP and port (in boost asio)
                            
                                Ncat "bad file descriptor" error upon client connection
                            
                                Python - How to check if socket is still connected
                            
                                The trait bound `(): futures::Future` is not satisfied when using TcpConnectionNew
                            
                                Exception from lambda expressions
                            
                                Sending data to logstash via tcp
                            
                                Gauging a web browser's bandwidth
                            
                                Why is the maximum port range 65535 in the TCP/IP Suite?
                            
                                "Repair" network connections programmatically/from command line
                            
                                When *exactly* is a socket ready to write?
                            
                                Tcp connections hang on CLOSE_WAIT status
                            
                                How to mock an outgoing Socket connection?
                            
                                Socket servers, SocketAsyncEventArgs and concurrent connections in .Net
                            
                                S3 Upload with pycurl interrupts
                            
                                Youtube Video Streaming protocol
                            
                                Can't use ServerSocket on Android

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TCP keep-alive gets involved after TCP zero-window and closes the connection erroneously

Tags:

tcp

wireshark

keep-alive

Martin Cowie

People also ask

1 Answers

cosven

Recent Activity

Donate For Us