I assume everyone here is familiar with the adage that all text files should end with a newline. I've known of this "rule" for years but I've always wondered — why?

Because that’s how the POSIX standard defines a line: <blockquote> <dl> <dt>3.206 Line</dt> <dd>A sequence of zero or more non- <newline> characters plus a terminating <newline> character.</dd> </dl> </blockquote> Therefore, lines not ending in a newline character aren't considered actual lines. That's why some programs have problems processing the last line of a file if it isn't newline terminated. There's at least one hard advantage to this guideline when working on a terminal emulator: All Unix tools expect this convention and work with it. For instance, when concatenating files with <code>cat</code>, a file terminated by newline will have a different effect than one without: <pre class="prettyprint"><code>$ more a.txt foo $ more b.txt bar$ more c.txt baz $ cat {a,b,c}.txt foo barbaz</code></pre> And, as the previous example also demonstrates, when displaying the file on the command line (e.g. via <code>more</code>), a newline-terminated file results in a correct display. An improperly terminated file might be garbled (second line). For consistency, it’s very helpful to follow this rule – doing otherwise will incur extra work when dealing with the default Unix tools. <hr> Think about it differently: If lines aren’t terminated by newline, making commands such as <code>cat</code> useful is much harder: how do you make a command to concatenate files such that <ol> <li>it puts each file’s start on a new line, which is what you want 95% of the time; but</li> <li>it allows merging the last and first line of two files, as in the example above between <code>b.txt</code> and <code>c.txt</code>?</li> </ol> Of course this is solvable but you need to make the usage of <code>cat</code> more complex (by adding positional command line arguments, e.g. <code>cat a.txt --no-newline b.txt c.txt</code>), and now the command rather than each individual file controls how it is pasted together with other files. This is almost certainly not convenient. … Or you need to introduce a special sentinel character to mark a line that is supposed to be continued rather than terminated. Well, now you’re stuck with the same situation as on POSIX, except inverted (line continuation rather than line termination character). <hr> Now, on non POSIX compliant systems (nowadays that’s mostly Windows), the point is moot: files don’t generally end with a newline, and the (informal) definition of a line might for instance be “text that is separated by newlines” (note the emphasis). This is entirely valid. However, for structured data (e.g. programming code) it makes parsing minimally more complicated: it generally means that parsers have to be rewritten. If a parser was originally written with the POSIX definition in mind, then it might be easier to modify the token stream rather than the parser — in other words, add an “artificial newline” token to the end of the input.

Why should text files end with a newline?

1 Answers

Because that’s how the POSIX standard defines a line:

3.206 Line

A sequence of zero or more non- <newline> characters plus a terminating <newline> character.

Therefore, lines not ending in a newline character aren't considered actual lines. That's why some programs have problems processing the last line of a file if it isn't newline terminated.

There's at least one hard advantage to this guideline when working on a terminal emulator: All Unix tools expect this convention and work with it. For instance, when concatenating files with cat, a file terminated by newline will have a different effect than one without:

$ more a.txt foo $ more b.txt bar$ more c.txt baz $ cat {a,b,c}.txt foo barbaz

And, as the previous example also demonstrates, when displaying the file on the command line (e.g. via more), a newline-terminated file results in a correct display. An improperly terminated file might be garbled (second line).

For consistency, it’s very helpful to follow this rule – doing otherwise will incur extra work when dealing with the default Unix tools.

Think about it differently: If lines aren’t terminated by newline, making commands such as cat useful is much harder: how do you make a command to concatenate files such that

it puts each file’s start on a new line, which is what you want 95% of the time; but
it allows merging the last and first line of two files, as in the example above between b.txt and c.txt?

Of course this is solvable but you need to make the usage of cat more complex (by adding positional command line arguments, e.g. cat a.txt --no-newline b.txt c.txt), and now the command rather than each individual file controls how it is pasted together with other files. This is almost certainly not convenient.

… Or you need to introduce a special sentinel character to mark a line that is supposed to be continued rather than terminated. Well, now you’re stuck with the same situation as on POSIX, except inverted (line continuation rather than line termination character).

_{Now, on non POSIX compliant systems (nowadays that’s mostly Windows), the point is moot: files don’t generally end with a newline, and the (informal) definition of a line might for instance be “text that is separated by newlines” (note the emphasis). This is entirely valid. However, for structured data (e.g. programming code) it makes parsing minimally more complicated: it generally means that parsers have to be rewritten. If a parser was originally written with the POSIX definition in mind, then it might be easier to modify the token stream rather than the parser — in other words, add an “artificial newline” token to the end of the input.}

answered Oct 14 '22 11:10

Konrad Rudolph

Related questions
                            
                                Generate an integer that is not among four billion given ones
                            
                                Rename a file in C#
                            
                                How to upload a file in Django? [closed]
                            
                                How to read all files in a folder from Java?
                            
                                How do I save a String to a text file using Java?
                            
                                TypeError: a bytes-like object is required, not 'str' when writing to a file in Python3
                            
                                Convert DOS line endings to Linux line endings in Vim
                            
                                Limit file format when using <input type="file">?
                            
                                Writing a list to a file with Python
                            
                                How to get full path of a file?
                            
                                "Cross origin requests are only supported for HTTP." error when loading a local file
                            
                                Is there a way to check if a file is in use?
                            
                                How to create a file in memory for user to download, but not through server?
                            
                                How can I check file size in Python?
                            
                                How do I remove/delete a folder that is not empty?
                            
                                How to get file creation & modification date/times?
                            
                                Remove a symlink to a directory
                            
                                How to move a file in Python?
                            
                                Download a single folder or directory from a GitHub repo
                            
                                How do I create a Java string from the contents of a file?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why should text files end with a newline?

Tags:

file

unix

newline

text-files

Will Robertson

People also ask

1 Answers

Konrad Rudolph

Recent Activity

Donate For Us