How to join lines not starting with specific pattern to the previous line in UNIX?

Tags:

Please take a look at the sample file and the desired output below to understand what I am looking for.

It can be done with loops in a shell script but I am struggling to get an awk/sed one liner.

SampleFile.txt

These are leaves.
These are branches.
These are greenery which gives
oxygen, provides control over temperature
and maintains cleans the air.
These are tigers
These are bears
and deer and squirrels and other animals.
These are something you want to kill
Which will see you killed in the end.
These are things you must to think to save your tomorrow.

Desired output

These are leaves.
These are branches.
These are greenery which gives oxygen, provides control over temperature and maintains cleans the air.
These are tigers
These are bears and deer and squirrels and other animals.
These are something you want to kill Which will see you killed in the end.
These are things you must to think to save your tomorrow.

453

asked Jun 21 '16 16:06

instinct246

2 Answers

With sed:

sed ':a;N;/\nThese/!s/\n/ /;ta;P;D' infile

resulting in

These are leaves.
These are branches.
These are greenery which gives oxygen, provides control over temperature and maintains cleans the air.
These are tigers
These are bears and deer and squirrels and other animals.
These are something you want to kill Which will see you killed in the end.
These are things you must to think to save your tomorrow.

Here is how it works:

sed '
:a                   # Label to jump to
N                    # Append next line to pattern space
/\nThese/!s/\n/ /    # If the newline is NOT followed by "These", append
                     # the line by replacing the newline with a space
ta                   # If we changed something, jump to label
P                    # Print part until newline
D                    # Delete part until newline
' infile

The N;P;D is the idiomatic way of keeping multiple lines in the pattern space; the conditional branching part takes care of the situation where we append more than one line.

This works with GNU sed; for other seds like the one found in Mac OS, the oneliner has to be split up so branching and label are in separate commands, the newlines may have to be escaped, and we need an extra semicolon:

sed -e ':a' -e 'N;/'$'\n''These/!s/'$'\n''/ /;ta' -e 'P;D;' infile

_{This last command is untested; see this answer for differences between different seds and how to handle them.}

Another alternative is to enter the newlines literally:

sed -e ':a' -e 'N;/\
These/!s/\
/ /;ta' -e 'P;D;' infile

But then, by definition, it's no longer a one-liner.

181

answered Oct 16 '22 07:10

Benjamin W.

Please try the following:

awk 'BEGIN {accum_line = "";} /^These/{if(length(accum_line)){print accum_line; accum_line = "";}} {accum_line = accum_line " " $0;} END {if(length(accum_line)){print accum_line; }}' < data.txt

The code consists of three parts:

The block marked by BEGIN is executed before anything else. It's useful for global initialization
The block marked by END is executed when the regular processing finished. It is good for wrapping the things. Like printing the last collected data if this line has no These at the beginning (this case)
The rest is the code performed for each line. First, the pattern is searched for and the relevant things are done. Second, data collection is done regardless of the string contents.

answered Oct 16 '22 06:10

GMichael

Related questions
                            
                                Bash Scripting - shell command output redirection
                            
                                Is there a simple way in linux to strip a website of text from command line?
                            
                                Alternative to scp, transferring files between linux machines by opening parallel connections
                            
                                Spaces in path names giving trouble with Find in Bash. Any *simple* work-around?
                            
                                Bulk renaming of files based on lookup
                            
                                using bash command in perl
                            
                                Perl line runs 30 times quicker with single quotes than with double quotes
                            
                                Linux: Bash: what does mkdir return
                            
                                Need to remove the count from the output when using "uniq -c" command
                            
                                bash parse filename
                            
                                How to find the Set - Subset of two files from the command line?
                            
                                How to iterate through all ASCII characters in Bash?
                            
                                Create PostgreSQL backup files with timestamp
                            
                                bash awk first 1st column and 3rd column with everything after
                            
                                what does ## in shell script means [duplicate]
                            
                                Removing Lines and columns with all zeros
                            
                                Running vi within a bash script and executing vi commands to edit another file
                            
                                bash round minutes to 5
                            
                                How can I *only* get the number of bytes available on a disk in bash?
                            
                                How to get the complete calling command of a BASH script from inside the script (not just the arguments)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to join lines not starting with specific pattern to the previous line in UNIX?

Tags:

bash

shell

unix

sed

awk

instinct246

People also ask

2 Answers

Benjamin W.

GMichael

Recent Activity

Donate For Us