How do I iterate through each line of a text file with Bash? With this script: <pre class="prettyprint"><code>echo "Start!" for p in (peptides.txt) do echo "${p}" done </code></pre> I get this output on the screen: <pre class="prettyprint"><code>Start! ./runPep.sh: line 3: syntax error near unexpected token `(' ./runPep.sh: line 3: `for p in (peptides.txt)' </code></pre> (Later I want to do something more complicated with <code>$p</code> than just output to the screen.) <hr> The environment variable SHELL is (from env): <pre class="prettyprint"><code>SHELL=/bin/bash </code></pre> <code>/bin/bash --version</code> output: <pre class="prettyprint"><code>GNU bash, version 3.1.17(1)-release (x86_64-suse-linux-gnu) Copyright (C) 2005 Free Software Foundation, Inc. </code></pre> <code>cat /proc/version</code> output: <pre class="prettyprint"><code>Linux version 2.6.18.2-34-default (geeko@buildhost) (gcc version 4.1.2 20061115 (prerelease) (SUSE Linux)) #1 SMP Mon Nov 27 11:46:27 UTC 2006 </code></pre> The file peptides.txt contains: <pre class="prettyprint"><code>RKEKNVQ IPKKLLQK QYFHQLEKMNVK IPKKLLQK GDLSTALEVAIDCYEK QYFHQLEKMNVKIPENIYR RKEKNVQ VLAKHGKLQDAIN ILGFMK LEDVALQILL </code></pre>

One way to do it is: <pre class="prettyprint"><code>while read p; do echo "$p" done <peptides.txt </code></pre> As pointed out in the comments, this has the side effects of trimming leading whitespace, interpreting backslash sequences, and skipping the last line if it's missing a terminating linefeed. If these are concerns, you can do: <pre class="prettyprint"><code>while IFS="" read -r p || [ -n "$p" ] do printf '%s\n' "$p" done < peptides.txt </code></pre> <hr> Exceptionally, if the loop body may read from standard input, you can open the file using a different file descriptor: <pre class="prettyprint"><code>while read -u 10 p; do ... done 10<peptides.txt </code></pre> Here, 10 is just an arbitrary number (different from 0, 1, 2).

<pre class="prettyprint"><code>cat peptides.txt | while read line do # do something with $line here done </code></pre> and the one-liner variant: <pre class="prettyprint"><code>cat peptides.txt | while read line; do something_with_$line_here; done </code></pre> These options will skip the last line of the file if there is no trailing line feed. You can avoid this by the following: <pre class="prettyprint"><code>cat peptides.txt | while read line || [[ -n $line ]]; do # do something with $line here done </code></pre>

Option 1a: While loop: Single line at a time: Input redirection <pre class="prettyprint"><code>#!/bin/bash filename='peptides.txt' echo Start while read p; do echo "$p" done < "$filename" </code></pre> Option 1b: While loop: Single line at a time: Open the file, read from a file descriptor (in this case file descriptor #4). <pre class="prettyprint"><code>#!/bin/bash filename='peptides.txt' exec 4<"$filename" echo Start while read -u4 p ; do echo "$p" done </code></pre>

This is no better than other answers, but is one more way to get the job done in a file without spaces (see comments). I find that I often need one-liners to dig through lists in text files without the extra step of using separate script files. <pre class="prettyprint"><code>for word in $(cat peptides.txt); do echo $word; done </code></pre> This format allows me to put it all in one command-line. Change the "echo $word" portion to whatever you want and you can issue multiple commands separated by semicolons. The following example uses the file's contents as arguments into two other scripts you may have written. <pre class="prettyprint"><code>for word in $(cat peptides.txt); do cmd_a.sh $word; cmd_b.py $word; done </code></pre> Or if you intend to use this like a stream editor (learn sed) you can dump the output to another file as follows. <pre class="prettyprint"><code>for word in $(cat peptides.txt); do cmd_a.sh $word; cmd_b.py $word; done > outfile.txt </code></pre> I've used these as written above because I have used text files where I've created them with one word per line. (See comments) If you have spaces that you don't want splitting your words/lines, it gets a little uglier, but the same command still works as follows: <pre class="prettyprint"><code>OLDIFS=$IFS; IFS=$'\n'; for line in $(cat peptides.txt); do cmd_a.sh $line; cmd_b.py $line; done > outfile.txt; IFS=$OLDIFS </code></pre> This just tells the shell to split on newlines only, not spaces, then returns the environment back to what it was previously. At this point, you may want to consider putting it all into a shell script rather than squeezing it all into a single line, though. Best of luck!

Looping through the content of a file in Bash

Tags:

linux

bash

loops

io

unix

How do I iterate through each line of a text file with Bash?

With this script:

echo "Start!"
for p in (peptides.txt)
do
    echo "${p}"
done

I get this output on the screen:

Start!
./runPep.sh: line 3: syntax error near unexpected token `('
./runPep.sh: line 3: `for p in (peptides.txt)'

(Later I want to do something more complicated with $p than just output to the screen.)

The environment variable SHELL is (from env):

SHELL=/bin/bash

/bin/bash --version output:

GNU bash, version 3.1.17(1)-release (x86_64-suse-linux-gnu)
Copyright (C) 2005 Free Software Foundation, Inc.

cat /proc/version output:

Linux version 2.6.18.2-34-default (geeko@buildhost) (gcc version 4.1.2 20061115 (prerelease) (SUSE Linux)) #1 SMP Mon Nov 27 11:46:27 UTC 2006

The file peptides.txt contains:

RKEKNVQ
IPKKLLQK
QYFHQLEKMNVK
IPKKLLQK
GDLSTALEVAIDCYEK
QYFHQLEKMNVKIPENIYR
RKEKNVQ
VLAKHGKLQDAIN
ILGFMK
LEDVALQILL

907

asked Oct 05 '09 17:10

Peter Mortensen

4 Answers

One way to do it is:

while read p; do
  echo "$p"
done <peptides.txt

As pointed out in the comments, this has the side effects of trimming leading whitespace, interpreting backslash sequences, and skipping the last line if it's missing a terminating linefeed. If these are concerns, you can do:

while IFS="" read -r p || [ -n "$p" ]
do
  printf '%s\n' "$p"
done < peptides.txt

Exceptionally, if the loop body may read from standard input, you can open the file using a different file descriptor:

while read -u 10 p; do
  ...
done 10<peptides.txt

Here, 10 is just an arbitrary number (different from 0, 1, 2).

answered Oct 28 '22 16:10

Bruno De Fraine

cat peptides.txt | while read line 
do
   # do something with $line here
done

and the one-liner variant:

cat peptides.txt | while read line; do something_with_$line_here; done

These options will skip the last line of the file if there is no trailing line feed.

You can avoid this by the following:

cat peptides.txt | while read line || [[ -n $line ]];
do
   # do something with $line here
done

answered Oct 28 '22 15:10

Warren Young

Option 1a: While loop: Single line at a time: Input redirection

#!/bin/bash
filename='peptides.txt'
echo Start
while read p; do 
    echo "$p"
done < "$filename"

Option 1b: While loop: Single line at a time:
Open the file, read from a file descriptor (in this case file descriptor #4).

#!/bin/bash
filename='peptides.txt'
exec 4<"$filename"
echo Start
while read -u4 p ; do
    echo "$p"
done

213

answered Oct 28 '22 17:10

Stan Graves

This is no better than other answers, but is one more way to get the job done in a file without spaces (see comments). I find that I often need one-liners to dig through lists in text files without the extra step of using separate script files.

for word in $(cat peptides.txt); do echo $word; done

This format allows me to put it all in one command-line. Change the "echo $word" portion to whatever you want and you can issue multiple commands separated by semicolons. The following example uses the file's contents as arguments into two other scripts you may have written.

for word in $(cat peptides.txt); do cmd_a.sh $word; cmd_b.py $word; done

Or if you intend to use this like a stream editor (learn sed) you can dump the output to another file as follows.

for word in $(cat peptides.txt); do cmd_a.sh $word; cmd_b.py $word; done > outfile.txt

I've used these as written above because I have used text files where I've created them with one word per line. (See comments) If you have spaces that you don't want splitting your words/lines, it gets a little uglier, but the same command still works as follows:

OLDIFS=$IFS; IFS=$'\n'; for line in $(cat peptides.txt); do cmd_a.sh $line; cmd_b.py $line; done > outfile.txt; IFS=$OLDIFS

This just tells the shell to split on newlines only, not spaces, then returns the environment back to what it was previously. At this point, you may want to consider putting it all into a shell script rather than squeezing it all into a single line, though.

Best of luck!

139