How to combine three consecutive lines of text file in sed?

Tags:

sed

I have a file, that consists of a repeating sequence of three lines, that I want to merge together. Put in other words, I'd like to replace every but third \n into space. E.g. I'd like the transform input

 href="file:///home/adam/MyDocs/some_file.pdf"
 visited="2013-06-02T20:40:06Z"
 exec="'firefox %u'"
 href="file:///home/adam/Desktop/FreeRDP-WebConnect-1.0.0.167-Setup.exe"
 visited="2013-06-03T08:50:37Z"
 exec="'firefox %u'"
 href="file:///home/adam/Friends/contact.txt"
 visited="2013-06-03T16:01:16Z"
 exec="'gedit %u'"
 href="file:///home/adam/Pictures/Screenshot%20from%202013-06-03%2019:10:36.png"
 visited="2013-06-03T17:10:36Z"
 exec="'eog %u'"

into

 href="file:///home/adam/MyDocs/some_file.pdf" visited="2013-06-02T20:40:06Z" exec="'firefox %u'"
 href="file:///home/adam/Desktop/FreeRDP-WebConnect-1.0.0.167-Setup.exe" visited="2013-06-03T08:50:37Z" exec="'firefox %u'"
 href="file:///home/adam/Friends/contact.txt" visited="2013-06-03T16:01:16Z" exec="'gedit %u'"
 href="file:///home/adam/Pictures/Screenshot%20from%202013-06-03%2019:10:36.png" visited="2013-06-03T17:10:36Z" exec="'eog %u'"

Unfortunately the file is rather long, so I'd prefer not to load the whole file into memory and not to write to result back into file - just print the concatenated lines into the standard output so I can pipe it further.

I know that potentially sed might just work for it, but after I had given it a honest try, I am still at square one; the learning curve is just too steep for me. :-(

I did a rough benchmarking and I found out, that the sed variant is almost twice as fast.

time awk '{ printf "%s", $0; if (NR % 3 == 0) print ""; else printf " " }' out.txt >/dev/null

real    0m1.893s
user    0m1.860s
sys     0m0.028s

and

time cat out.txt | sed 'N;N;s/\n/ /g' > /dev/null

real    0m1.360s
user    0m1.264s
sys    0m0.236s

It is interesting: why does sed require more kernel time than awk?

The out.txt is 200MB long and the processor is Intel(R) Core(TM) i7-3610QM CPU @ 2.30GHz on Linux-Mint 14 with kernel 3.8.13-030813-generic.

I need this in my effort to parse the recently-used.xbel, the recently opened files list in the Cinnamon

If you came here for this specific problem, this line should help you:

xpath -q  -e "//bookmark[*]/@href | //bookmark[*]/@visited | //bookmark[*]/info/metadata/bookmark:applications[1]/bookmark:application[1]/@exec"  recently-used.xbel | sed 's/href="\(.*\)"/"\1"/;N;s/visited="\(.*\)"/\1/;N;s/exec="\(.*\)"/"\1"/;s/\n/ /g' | xargs -n3 whatever-script-you-write

215

asked Jun 03 '13 21:06

Adam Ryczkowski

2 Answers

how about this:

 sed 'N;N;s/\n/ /g' file

147

answered Oct 22 '22 16:10

Kent

You can use awk to do this pretty easily:

awk '{ printf "%s", $0; if (NR % 3 == 0) print ""; else printf " " }' file

The basic idea is "print each line folowed by a space, unless it's every third line, in which case print a newline".

answered Oct 22 '22 17:10

Carl Norum

Related questions
                            
                                How can I remove all but the last 10 lines from a file?
                            
                                How to escape the ampersand character while using sed
                            
                                strip the last and first character from a String
                            
                                sed command to replace multiple spaces into single spaces
                            
                                Extract lines between two line numbers in shell
                            
                                How to check if the sed command replaced some string? [duplicate]
                            
                                Sed replace with the output of a bash command using capture group as argument
                            
                                how to use sed to extract lines in specified order?
                            
                                Invalid preceding regular expression given by sed
                            
                                How to use a bash script variable with sed [duplicate]
                            
                                Regex Space character in Sed
                            
                                Replace a letter with another from the last word from the last two lines of a text file
                            
                                Remove files not containing a specific string
                            
                                How do I replace a token with the result of `pwd` in sed?
                            
                                how to replace all lines between two points and subtitute it with some text in sed
                            
                                How can I delete all lines before a specific string from a number of files
                            
                                remove white space in bash using sed
                            
                                sed replace last line matching pattern
                            
                                Extracting part of a string to a variable in bash
                            
                                Delete lines from file with SED or AWK

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With