I usually use the read command to read an input file to the shell script line by line. An example code such as the one below yields a wrong result if a new line isn't inserted at the end of the last line in the input file, blah.txt. <pre class="prettyprint"><code>#!/bin/sh while read line do echo $line done <blah.txt </code></pre> So if the input file reads something like - <pre class="prettyprint"><code>One Two Three Four </code></pre> and I do not hit return after Four, the script fails to read the last line, and prints <pre class="prettyprint"><code>One Two Three </code></pre> Now if I leave an extra blank line after Four, like, <pre class="prettyprint"><code>One Two Three Four //blank line </code></pre> the output prints all the lines, including Four. However, this is not the case when I read a line using the <code>cat</code> command; all lines including the last get printed without me having to add an extra blank line at the end. Anyone has ideas on why this happens? The scripts I create will mostly be run by others, so it isn't necessary they're going to add an extra blank line at the end of every input file. I've been trying to figure this out for ages; I'd appreciate it if you have any solutions(of course, the <code>cat</code> command is one, but I'd like to know the reason behind read not working as well).

<code>read</code> reads until it finds a newline character or the end of file, and returns a non-zero exit code if it encounters an end-of-file. So it's quite possible for it to both read a line and return a non-zero exit code. Consequently, the following code is not safe if the input might not be terminated by a newline: <pre class="prettyprint"><code>while read LINE; do # do something with LINE done </code></pre> because the body of the <code>while</code> won't be executed on the last line. Technically speaking, a file not terminated with a newline is not a text file, and text tools may fail in odd ways on such a file. However, I'm always reluctant to fall back on that explanation. One way to solve the problem is to test if what was read is non-empty (<code>-n</code>): <pre class="prettyprint"><code>while read -r LINE || [[ -n $LINE ]]; do # do something with LINE done </code></pre> Other solutions include using <code>mapfile</code> to read the file into an array, piping the file through some utility which is guaranteed to terminate the last line properly (<code>grep .</code>, for example, if you don't want to deal with blank lines), or doing the iterative processing with a tool like <code>awk</code> (which is usually my preference). Note that <code>-r</code> is almost certainly needed in the <code>read</code> builtin; it causes <code>read</code> to not reinterpret <code>\</code>-sequences in the input.

Reading input files by line using read command in shell scripting skips last line

Q: How read file line by line in shell script and store each line in a variable?

We use the read command with -r argument to read the contents without escaping the backslash character. We read the content of each line and store that in the variable line and inside the while loop we echo with a formatted -e argument to use special characters like \n and print the contents of the line variable.

Tags:

I usually use the read command to read an input file to the shell script line by line. An example code such as the one below yields a wrong result if a new line isn't inserted at the end of the last line in the input file, blah.txt.

#!/bin/sh

while read line
do
echo $line
done <blah.txt

So if the input file reads something like -

One 
Two
Three
Four

and I do not hit return after Four, the script fails to read the last line, and prints

One
Two
Three

Now if I leave an extra blank line after Four, like,

One 
Two
Three
Four
//blank line

the output prints all the lines, including Four. However, this is not the case when I read a line using the cat command; all lines including the last get printed without me having to add an extra blank line at the end.

Anyone has ideas on why this happens? The scripts I create will mostly be run by others, so it isn't necessary they're going to add an extra blank line at the end of every input file.

I've been trying to figure this out for ages; I'd appreciate it if you have any solutions(of course, the cat command is one, but I'd like to know the reason behind read not working as well).

601

asked Jun 24 '13 04:06

Caife

1 Answers

read reads until it finds a newline character or the end of file, and returns a non-zero exit code if it encounters an end-of-file. So it's quite possible for it to both read a line and return a non-zero exit code.

Consequently, the following code is not safe if the input might not be terminated by a newline:

while read LINE; do
  # do something with LINE
done

because the body of the while won't be executed on the last line.

Technically speaking, a file not terminated with a newline is not a text file, and text tools may fail in odd ways on such a file. However, I'm always reluctant to fall back on that explanation.

One way to solve the problem is to test if what was read is non-empty (-n):

while read -r LINE || [[ -n $LINE ]]; do
  # do something with LINE
done

Other solutions include using mapfile to read the file into an array, piping the file through some utility which is guaranteed to terminate the last line properly (grep ., for example, if you don't want to deal with blank lines), or doing the iterative processing with a tool like awk (which is usually my preference).

Note that -r is almost certainly needed in the read builtin; it causes read to not reinterpret \-sequences in the input.

171

answered Oct 01 '22 17:10

rici

Related questions
                            
                                Calling `[UIView -systemLayoutSizeFittingSize:]` on a UITableViewCell always fails
                            
                                Cannot implicitly convert type '.List<AnonymousType#1>' to '.List<WebApplication2.Customer>'
                            
                                JEE7: Do EJB and CDI beans support container-managed transactions?
                            
                                Best Way to store configuration setting inside my asp.net mvc application
                            
                                Error compiling a Groovy project using @Grab annotation
                            
                                Cumsum reset at NaN
                            
                                CSS placing one image on top of another
                            
                                Strange array initialize expression?
                            
                                Is there a SCP alternative for PowerShell?
                            
                                How to convert a formatted local time to epoch?
                            
                                NodeJS required module not available in other modules
                            
                                What is the difference between the :before_save and :before_update Active Record callbacks?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With