How to hand-over a TCP listening socket with minimal downtime?

Some background:

I have an application listening on a TCP socket. It is started and shut down with a regular System V style init script.

My problem is that it needs some time to start up before it is ready to service the TCP socket. It's not too long, perhaps only 5 seconds, but that's 5 seconds too long when a restart needs to be performed during a workday. It's also crucial that existing connections remain open and are finished normally.

Reasons for a restart of the application are patches, upgrades, and the like. I unfortunately find myself in the position that, every once in a while, I need to do this kind of thing in production.

The question:

I'm looking for a way to do a neat hand-over of the TCP listening socket, from one process to another, and as a result get only a split second of downtime. I'd like existing connections / sockets to remain open and finish processing in the old process, while the new process starts servicing new connectinos.

Is there some proven method of doing this using BSD-sockets? (Bonus points for an EventMachine solution.)

Are there perhaps open-source libraries out there implementing this, that I can use as is, or use as a reference? (Again, non-Ruby and non-EventMachine solutions are appreciated too!)

577

asked Feb 05 '10 11:02

Stéphan Kochen

1 Answers

There are a couple of ways to do this with no downtime, with appropriate modifications to the server program.

One is to implement a restart capability in the server itself, for example upon receipt of a certain signal or other message. The program would then exec its new version, passing it the file descriptor number of the listening socket e.g. as an argument. This socket would have the FD_CLOEXEC flag clear (the default) so that it would be inherited. Since the other sockets will continue to be serviced by the original process and should not be passed on to the new process, the flag should be set on those e.g. using fcntl(). After forking and execing the new process, the original process can go ahead and close the listening socket without any interruption to the service, since the new process is now listening on that socket.

An alternative method, if you do not want the old server to have to fork and exec the new server itself, would be to use a Unix-domain socket to communicate between the old and new server process. A new server process could check for such a socket in a well-known location in the file system when it is starting. If present, the new server would connect to this socket and request that the old server transfer its listening socket as ancillary data using SCM_RIGHTS. An example of this is given at the end of cmsg(3).

answered Nov 14 '22 19:11

mark4o

Related questions
                            
                                How to check in PHP, if a socket still connected, if I don't have the socket handler?
                            
                                How to handle a Thread Issue in ZeroMQ + Ruby?
                            
                                Android VpnService protect socket that's stored in native code?
                            
                                How to make 2 clients connect each other directly, after having both connected a meeting-point server?
                            
                                Best practice to detect a client disconnection in .NET?
                            
                                How do I handle this pointer in getaddrinfo?
                            
                                Very Strange Problem sending data via Sockets in C#
                            
                                How to receive multicast data on a multihomed server's non-default interface
                            
                                Check for incoming data in Java Socket
                            
                                Java <-> C Bridge
                            
                                Do I need a PORT when joining a multicast group or just the IP?
                            
                                Sending an ArrayList<String> from the server side to the client side over TCP using socket?
                            
                                ZeroMQ securely over the internet
                            
                                How to know the end of FTP Welcome message
                            
                                Zookeeper Network Ensemble does not start appropiately
                            
                                How can I determine if a non-blocking socket is really connected?
                            
                                Decode video in Cuda using a socket / memory instead of a file
                            
                                HTTP multipart/form-data. What happends when binary data has no string representation?
                            
                                Good Client Socket Pool
                            
                                How do I save sockets in a hash and loop over them from another thread?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to hand-over a TCP listening socket with minimal downtime?

Tags:

sockets

eventmachine

high-availability