The socket API is the de-facto standard for TCP/IP and UDP/IP communications (that is, networking code as we know it). However, one of its core functions, <code>accept()</code> is a bit magical. To borrow a semi-formal definition: <blockquote> accept() is used on the server side. It accepts a received incoming attempt to create a new TCP connection from the remote client, and creates a new socket associated with the socket address pair of this connection. </blockquote> In other words, <code>accept</code> returns a new socket through which the server can communicate with the newly connected client. The old socket (on which <code>accept</code> was called) stays open, on the same port, listening for new connections. How does <code>accept</code> work? How is it implemented? There's a lot of confusion on this topic. Many people claim accept opens a new port and you communicate with the client through it. But this obviously isn't true, as no new port is opened. You actually can communicate through the same port with different clients, but how? When several threads call <code>recv</code> on the same port, how does the data know where to go? I guess it's something along the lines of the client's address being associated with a socket descriptor, and whenever data comes through <code>recv</code> it's routed to the correct socket, but I'm not sure. It'd be great to get a thorough explanation of the inner-workings of this mechanism.

Your confusion lies in thinking that a socket is identified by Server IP : Server Port. When in actuality, sockets are uniquely identified by a quartet of information: <code>Client IP : Client Port</code> and <code>Server IP : Server Port</code> So while the Server IP and Server Port are constant in all accepted connections, the client side information is what allows it to keep track of where everything is going. Example to clarify things: Say we have a server at <code>192.168.1.1:80</code> and two clients, <code>10.0.0.1</code> and <code>10.0.0.2</code>. <code>10.0.0.1</code> opens a connection on local port <code>1234</code> and connects to the server. Now the server has one socket identified as follows: <pre class="prettyprint"><code>10.0.0.1:1234 - 192.168.1.1:80 </code></pre> Now <code>10.0.0.2</code> opens a connection on local port <code>5678</code> and connects to the server. Now the server has two sockets identified as follows: <pre class="prettyprint"><code>10.0.0.1:1234 - 192.168.1.1:80 10.0.0.2:5678 - 192.168.1.1:80 </code></pre>

How does the socket API accept() function work?

Tags:

networking

tcp

sockets

The socket API is the de-facto standard for TCP/IP and UDP/IP communications (that is, networking code as we know it). However, one of its core functions, accept() is a bit magical.

To borrow a semi-formal definition:

accept() is used on the server side. It accepts a received incoming attempt to create a new TCP connection from the remote client, and creates a new socket associated with the socket address pair of this connection.

In other words, accept returns a new socket through which the server can communicate with the newly connected client. The old socket (on which accept was called) stays open, on the same port, listening for new connections.

How does accept work? How is it implemented? There's a lot of confusion on this topic. Many people claim accept opens a new port and you communicate with the client through it. But this obviously isn't true, as no new port is opened. You actually can communicate through the same port with different clients, but how? When several threads call recv on the same port, how does the data know where to go?

I guess it's something along the lines of the client's address being associated with a socket descriptor, and whenever data comes through recv it's routed to the correct socket, but I'm not sure.

It'd be great to get a thorough explanation of the inner-workings of this mechanism.

925

asked Jan 28 '09 19:01

Eli Bendersky

2 Answers

Your confusion lies in thinking that a socket is identified by Server IP : Server Port. When in actuality, sockets are uniquely identified by a quartet of information:

Client IP : Client Port and Server IP : Server Port

So while the Server IP and Server Port are constant in all accepted connections, the client side information is what allows it to keep track of where everything is going.

Example to clarify things:

Say we have a server at 192.168.1.1:80 and two clients, 10.0.0.1 and 10.0.0.2.

10.0.0.1 opens a connection on local port 1234 and connects to the server. Now the server has one socket identified as follows:

10.0.0.1:1234 - 192.168.1.1:80

Now 10.0.0.2 opens a connection on local port 5678 and connects to the server. Now the server has two sockets identified as follows:

10.0.0.1:1234 - 192.168.1.1:80   10.0.0.2:5678 - 192.168.1.1:80

104

answered Oct 02 '22 12:10

17 of 26

Just to add to the answer given by user "17 of 26"

The socket actually consists of 5 tuple - (source ip, source port, destination ip, destination port, protocol). Here the protocol could TCP or UDP or any transport layer protocol. This protocol is identified in the packet from the 'protocol' field in the IP datagram.

Thus it is possible to have to different applications on the server communicating to to the same client on exactly the same 4-tuples but different in protocol field. For example

Apache at server side talking on (server1.com:880-client1:1234 on TCP) and World of Warcraft talking on (server1.com:880-client1:1234 on UDP)

Both the client and server will handle this as protocol field in the IP packet in both cases is different even if all the other 4 fields are same.

answered Oct 02 '22 13:10

Methos

Related questions
                            
                                Viewing my IIS hosted site on other machines on my network
                            
                                How do I find out which computer is the domain controller in Windows programmatically?
                            
                                Java socket API: How to tell if a connection has been closed?
                            
                                Getting name of windows computer running python script?
                            
                                TCP vs UDP on video stream
                            
                                Regular expression that matches valid IPv6 addresses
                            
                                Parsing IPv6 extension headers containing unknown extensions
                            
                                Best TCP port number range for internal applications [closed]
                            
                                How do I check for a network connection?
                            
                                How can I kill whatever process is using port 8080 so that I can vagrant up?
                            
                                How can I check if an ip is in a network in Python?
                            
                                How should one go about choosing a default TCP/IP port for a new service?
                            
                                Easiest way to read from a URL into a string in .NET
                            
                                Can TCP and UDP sockets use the same port?
                            
                                Getting MAC Address
                            
                                How to overcome root domain CNAME restrictions?
                            
                                How do I access the host machine from the guest machine? [closed]
                            
                                Find UNC path of a network drive?
                            
                                How to monitor network calls made from iOS Simulator
                            
                                Wireshark localhost traffic capture [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With