How to use SO_KEEPALIVE option properly to detect that the client at the other end is down?

Tags:

I was trying to learn the usage of option SO_KEEPALIVE in socket programming in C language under Linux environment.

I created a server socket and used my browser to connect to it. It was successful and I was able to read the GET request, but I got stuck on the usage of SO_KEEPALIVE.

I checked this link [email protected] but I could not find any example which shows how to use it.

As soon as I detect the client's request on accept() function I set the SO_KEEPALIVE option value 1 on the client socket. Now I don't know, how to check if the client is down, how to change the time interval between the probes sent etc.

I mean, how will I get the signal that the client is down? (Without reading or writing at the client - I thought I will get some signal when probes are not replied back from client), how should I program it after setting the option SO_KEEPALIVE on).

Also if suppose the probes are sent every 3 secs and the client goes down in between I will not get to know that client is down and I may get SIGPIPE.

Anyways importantly I wanna know how to use SO_KEEPALIVE in the code.

627

asked Mar 25 '11 16:03

Durin

3 Answers

To modify the number of probes or the probe intervals, you write values to the /proc filesystem like

 echo 600 > /proc/sys/net/ipv4/tcp_keepalive_time  echo 60 > /proc/sys/net/ipv4/tcp_keepalive_intvl  echo 20 > /proc/sys/net/ipv4/tcp_keepalive_probes

Note that these values are global for all keepalive enabled sockets on the system, You can also override these settings on a per socket basis when you set the setsockopt, see section 4.2 of the document you linked.

You can't "check" the status of the socket from userspace with keepalive. Instead, the kernel is simply more aggressive about forcing the remote end to acknowledge packets, and determining if the socket has gone bad. When you attempt to write to the socket, you will get a SIGPIPE if keepalive has determined remote end is down.

122

answered Oct 24 '22 20:10

bdk

You'll get the same result if you enable SO_KEEPALIVE, as if you don't enable SO_KEEPALIVE - typically you'll find the socket ready and get an error when you read from it.

You can set the keepalive timeout on a per-socket basis under Linux (this may be a Linux-specific feature). I'd recommend this rather than changing the system-wide setting. See the man page for tcp for more info.

Finally, if your client is a web browser, it's quite likely that it will close the socket fairly quickly anyway - most of them will only hold keepalive (HTTP 1.1) connections open for a relatively short time (30s, 1 min etc). Of course if the client machine has disappeared or network down (which is what SO_KEEPALIVE is really useful for detecting), then it won't be able to actively close the socket.

answered Oct 24 '22 21:10

MarkR

As already discussed, SO_KEEPALIVE makes the kernel more aggressive about continually verifying the connection even when you're not doing anything, but does not change or enhance the way the information is delivered to you. You'll find out when you try to actually do something (for example "write"), and you'll find out right away since the kernel is now just reporting the status of a previously set flag, rather than having to wait a few seconds (or much longer in some cases) for network activity to fail. The exact same code logic you had for handling the "other side went away unexpectedly" condition will still be used; what changes is the timing (not the method).

Virtually every "practical" sockets program in some way provides non-blocking access to the sockets during the data phase (maybe with select()/poll(), or maybe with fcntl()/O_NONBLOCK/EINPROGRESS&EWOULDBLOCK, or if your kernel supports it maybe with MSG_DONTWAIT). Assuming this is already done for other reasons, it's trivial (sometimes requiring no code at all) to in addition find out right away about a connection dropping. But if the data phase does not already somehow provide non-blocking access to the sockets, you won't find out about the connection dropping until the next time you try to do something.

(A TCP socket connection without some sort of non-blocking behaviour during the data phase is notoriously fragile, as if the wrong packet encounters a network problem it's very easy for the program to then "hang" indefinitely, and there's not a whole lot you can do about it.)

answered Oct 24 '22 21:10

Chuck Kollars

Related questions
                            
                                Hiding longjmps in C++ interface to C code
                            
                                Disable application after expiry date for trial
                            
                                lemon parser parsing 0 token
                            
                                Why do we need *.lib files? [closed]
                            
                                Optimize log entropy calculation in sparse matrix
                            
                                getting source code for linux's /bin/ss tool [closed]
                            
                                Message Truncated in MPI_Recv
                            
                                Why printf() when printing multiple strings (%s) leaves newline and how to solve this?
                            
                                Python with C libraries
                            
                                "Bad file descriptor" error when reading from pipe as stdin
                            
                                Getting The Memory Address Of A DLL Function
                            
                                Creating a new GSource in GLib
                            
                                Why do those two ways to set a variable to all 1s lead to different results?
                            
                                How generate pseudo-random numbers in uniform and gaussian distribution without float/double numbers?
                            
                                Virtual Memory on OSX/iOS versus Windows commit/reserve behaviour
                            
                                CUDA Dynamic Parallelism MakeFile
                            
                                The brace in C language char array initialization
                            
                                Turn simple C program into server using netcat
                            
                                C heap address changes between runs while other addresses persist
                            
                                Why does returning a floating-point value change its value?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use SO_KEEPALIVE option properly to detect that the client at the other end is down?

Tags:

c

linux

sockets

keep-alive

Durin

People also ask

3 Answers

bdk

MarkR

Chuck Kollars

Recent Activity

Donate For Us