Will read() ever block after select()?

Tags:

I'm reading a stream of data through TCP/IP socket. The stream load is very uneven. Sometimes large bulks of data arrive every second, sometimes no data come for an hour. In the case of long inactivity period (no data from remote server, but connection is still online) my program should take some actions.

I'm implementing a timeout using a select(). It tells me if there are data ready, but I don't know exactly how much can I read without causing read() to block. Blocking is unacceptable as it may last far longer than the timeout I need.

For the sake of efficiency, stream is read into large buffer and read() call is provided with that buffer size.

Will read() block after select() if the buffer to be filled is greater than amount of data available right now in the socket?

865

asked Mar 18 '11 12:03

Basilevs

3 Answers

Actually it should not block (that is what select() is for!), but in fact, it might, exceptionally. Normally, read() should return up to the maximum number of bytes that you've specified, which possibly includes zero bytes (this is actually a valid thing to happen!), but it should never block after previously having reported readiness.

Nevertheless, see the Linux select man page:

Under Linux, select() may report a socket file descriptor as "ready for reading", while nevertheless a subsequent read blocks. This could for example happen when data has arrived but upon examination has wrong checksum and is discarded. There may be other circumstances in which a file descriptor is spuriously reported as ready. Thus it may be safer to use O_NONBLOCK on sockets that should not block.

101

answered Oct 23 '22 05:10

Damon

There is O_NONBLOCK which can be set by fcntl/F_SETFL and should result in non-blocking read.

answered Oct 23 '22 06:10

ony

A blocking file descriptor will block on read() until there is something to read - could be one byte or your entire request. A non-blocking descriptor won't block on read() if there is nothing to read. Select() is not read(). It basically puts the process to sleep and monitors the file descriptor(s), including non-blocking descriptors. When there is activity on one of the descriptors (or the timeout period expires) select returns and you can read your data, or do something else in the case of the timeout.

So you have two separate issues. (1) You want to "take some actions" when there is no data. That's the select timeout. (2) Once you have data (notified by select) you don't want to block on a read. That's the non-blocking mode. When you get EAGAIN on the non-blocking read you loop back to the select and/or "take some actions" and loop back to select.

answered Oct 23 '22 06:10

Duck

Related questions
                            
                                merging sorted arrays [duplicate]
                            
                                how to keep all methods in a class with ProGuard
                            
                                End of nonblocking file
                            
                                Permute all unique enumerations of a vector in R
                            
                                Facebook Like Widget on Fan page, Comment area out of visible area
                            
                                How can I determine if my convolution is separable?
                            
                                Can Boost.Spirit be theoretically/practically used to parse C++(0x) (or any other language)?
                            
                                How to achieve test isolation testing Oracle PL/SQL?
                            
                                What is the most mature/stable mysql node.js module
                            
                                Testing backbone.js application with jasmine - how to test model bindings on a view?
                            
                                Interesting behaviour of type "decimal" in C#
                            
                                Algorithm for determining whether a point is inside a 3D mesh

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Will read() ever block after select()?

Tags:

linux

sockets

Basilevs

People also ask

3 Answers

Damon

ony

Duck

Recent Activity

Donate For Us