I need to read a line of text (terminated by a newline) without making assumptions about the length. So I now face to possibilities: <ul> <li>Use <code>fgets</code> and check each time if the last character is a newline and continuously append to a buffer</li> <li>Read each character using <code>fgetc</code> and occasionally <code>realloc</code> the buffer</li> </ul> Intuition tells me the <code>fgetc</code> variant might be slower, but then again I don't see how <code>fgets</code> can do it without examining every character (also my intuition isn't always that good). The lines are quite large so the performance is important. I would like to know the pros and cons of each approach. Thank you in advance.

Does your environment provide the <code>getline(3)</code> function? If so, I'd say go for that. The big advantage I see is that it allocates the buffer itself (if you want), and will <code>realloc()</code> the buffer you pass in if it's too small. (So this means you need to pass in something gotten from <code>malloc()</code>). This gets rid of some of the pain of fgets/fgetc, and you can hope that whoever wrote the C library that implements it took care of making it efficient. Bonus: the man page on Linux has a nice example of how to use it in an efficient manner.

C fgets versus fgetc for reading line

Tags:

c

io

stdio

fgets

fgetc

I need to read a line of text (terminated by a newline) without making assumptions about the length. So I now face to possibilities:

Use fgets and check each time if the last character is a newline and continuously append to a buffer
Read each character using fgetc and occasionally realloc the buffer

Intuition tells me the fgetc variant might be slower, but then again I don't see how fgets can do it without examining every character (also my intuition isn't always that good). The lines are quite large so the performance is important.

I would like to know the pros and cons of each approach. Thank you in advance.

590

asked Mar 03 '11 20:03

nc3b

2 Answers

I suggest using fgets() coupled with dynamic memory allocation - or you can investigate the interface to getline() that is in the POSIX 2008 standard and available on more recent Linux machines. That does the memory allocation stuff for you. You need to keep tabs on the buffer length as well as its address - so you might even create yourself a structure to handle the information.

Although fgetc() also works, it is marginally fiddlier - but only marginally so. Underneath the covers, it uses the same mechanisms as fgets(). The internals may be able to exploit speedier operation - analogous to strchr() - that are not available when you call fgetc() directly.

answered Sep 29 '22 23:09

Jonathan Leffler

Does your environment provide the getline(3) function? If so, I'd say go for that.

The big advantage I see is that it allocates the buffer itself (if you want), and will realloc() the buffer you pass in if it's too small. (So this means you need to pass in something gotten from malloc()).

This gets rid of some of the pain of fgets/fgetc, and you can hope that whoever wrote the C library that implements it took care of making it efficient.

Bonus: the man page on Linux has a nice example of how to use it in an efficient manner.

answered Sep 30 '22 00:09

Mat

Related questions
                            
                                Detecting integral overflow with scanf
                            
                                What are possible reasons for binary files corruption on android devices
                            
                                How to link old C code with reserved keywords in it with C++?
                            
                                Is it possible to write a conformant implementation of malloc in C?
                            
                                Declare a C++ function that has C calling convention but internal linkage [duplicate]
                            
                                Unicode vs Multi-byte
                            
                                Is anyone using Maven/NAR for any large scale C/C++ projects? [closed]
                            
                                C, socket programming: Connecting multiple clients to server using select()
                            
                                How do I use getch from curses without clearing the screen?
                            
                                Longest Common Subsequence for Multiple Sequences
                            
                                Library ABI compatibility between versions of Visual Studio
                            
                                How do I fix undefined reference to _imp__*?
                            
                                conditional vs operator?
                            
                                Are there any equivalents to the futex in Linux/Unix?
                            
                                Reading .mat file using C: how to read cell-structure properly
                            
                                ARM M4 Instructions per Cycle (IPC) counters
                            
                                load ELF file into memory
                            
                                C source inclusion name length
                            
                                Determine if peer has closed reading end of socket
                            
                                size of size_t compared to unsigned int

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With