I can't figure how to tell sed dot match new line: <code>echo -e "one\ntwo\nthree" | sed 's/one.*two/one/m'</code> I expect to get: <blockquote> one three </blockquote> instead I get original: <blockquote> one two three </blockquote>

If you use a GNU <code>sed</code>, you may match any character, including line break chars, with a mere <code>.</code>, see : <blockquote> <code>.</code> Matches any character, including newline. </blockquote> All you need to use is a <code>-z</code> option: <pre class="prettyprint"><code>echo -e "one\ntwo\nthree" | sed -z 's/one.*two/one/' # => one # three </code></pre> See the online <code>sed</code> demo. However, <code>one.*two</code> might not be what you need since <code>*</code> is always greedy in POSIX regex patterns. So, <code>one.*two</code> will match the leftmost <code>one</code>, then any 0 or more chars as many as possible, and then the rightmost <code>two</code>. If you need to remove <code>one</code>, then any 0+ chars as few as possible, and then the leftmost <code>two</code>, you will have to use <code>perl</code>: <pre class="prettyprint"><code>perl -i -0 -pe 's/one.*?two//sg' file # Non-Unicode version perl -i -CSD -Mutf8 -0 -pe 's/one.*?two//sg' file # S&R in a UTF8 file </code></pre> The <code>-0</code> option enables the slurp mode so that the file could be read as a whole and not line-by-line, <code>-i</code> will enable inline file modification, <code>s</code> will make <code>.</code> match any char including line break chars, and <code>.*?</code> will match any 0 or more chars as few as possible due to a non-greedy <code>*?</code>. The <code>-CSD -Mutf8</code> part make sure your input is decoded and output re-encoded back correctly.

<code>sed</code> is line-based tool. I don't think these is an option. You can use <code>h/H</code>(hold), <code>g/G</code>(get). <pre class="prettyprint"><code>$ echo -e 'one\ntwo\nthree' | sed -n '1h;1!H;${g;s/one.*two/one/p}' one three </code></pre> Maybe you should try <code>vim</code> <pre class="prettyprint"><code>:%s/one\_.*two/one/g </code></pre>

This might work for you: <pre class="prettyprint"><code><<<$'one\ntwo\nthree' sed '/two/d' </code></pre> or <pre class="prettyprint"><code><<<$'one\ntwo\nthree' sed '2d' </code></pre> or <pre class="prettyprint"><code><<<$'one\ntwo\nthree' sed 'n;d' </code></pre> or <pre class="prettyprint"><code><<<$'one\ntwo\nthree' sed 'N;N;s/two.//' </code></pre> <code>Sed</code> does match all characters (including the <code>\n</code>) using a dot <code>.</code> but usually it has already stripped the <code>\n</code> off, as part of the cycle, so it no longer present in the pattern space to be matched. Only certain commands (<code>N</code>,<code>H</code> and <code>G</code>) preserve newlines in the pattern/hold space. <ol> <li> <code>N</code> appends a newline to the pattern space and then appends the next line.</li> <li> <code>H</code> does exactly the same except it acts on the hold space.</li> <li> <code>G</code> appends a newline to the pattern space and then appends whatever is in the hold space too.</li> </ol> The hold space is empty until you place something in it so: <pre class="prettyprint"><code>sed G file </code></pre> will insert an empty line after each line. <pre class="prettyprint"><code>sed 'G;G' file </code></pre> will insert 2 empty lines etc etc.

how to tell sed "dot match new line"

3 Answers

If you use a GNU sed, you may match any character, including line break chars, with a mere ., see :

.
Matches any character, including newline.

All you need to use is a -z option:

echo -e "one\ntwo\nthree" | sed -z 's/one.*two/one/'
# => one
#    three

See the online sed demo.

However, one.*two might not be what you need since * is always greedy in POSIX regex patterns. So, one.*two will match the leftmost one, then any 0 or more chars as many as possible, and then the rightmost two. If you need to remove one, then any 0+ chars as few as possible, and then the leftmost two, you will have to use perl:

perl -i -0 -pe 's/one.*?two//sg' file             # Non-Unicode version
perl -i -CSD -Mutf8 -0 -pe 's/one.*?two//sg' file # S&R in a UTF8 file

The -0 option enables the slurp mode so that the file could be read as a whole and not line-by-line, -i will enable inline file modification, s will make . match any char including line break chars, and .*? will match any 0 or more chars as few as possible due to a non-greedy *?. The -CSD -Mutf8 part make sure your input is decoded and output re-encoded back correctly.

196

answered Sep 22 '22 17:09

Wiktor Stribiżew

sed is line-based tool. I don't think these is an option.
You can use h/H(hold), g/G(get).

$ echo -e 'one\ntwo\nthree' | sed -n '1h;1!H;${g;s/one.*two/one/p}'
one
three

Maybe you should try vim

:%s/one\_.*two/one/g

answered Sep 23 '22 17:09

kev

This might work for you:

<<<$'one\ntwo\nthree' sed '/two/d'

<<<$'one\ntwo\nthree' sed '2d'

<<<$'one\ntwo\nthree' sed 'n;d'

<<<$'one\ntwo\nthree' sed 'N;N;s/two.//'

Sed does match all characters (including the \n) using a dot . but usually it has already stripped the \n off, as part of the cycle, so it no longer present in the pattern space to be matched.

Only certain commands (N,H and G) preserve newlines in the pattern/hold space.

N appends a newline to the pattern space and then appends the next line.
H does exactly the same except it acts on the hold space.
G appends a newline to the pattern space and then appends whatever is in the hold space too.

The hold space is empty until you place something in it so:

sed G file

will insert an empty line after each line.

sed 'G;G' file

will insert 2 empty lines etc etc.

answered Sep 21 '22 17:09

potong

Related questions
                            
                                vim/vi/sed: Act on a certain number of lines from the end of the file
                            
                                Remove first N lines of a file in place in unix command line
                            
                                sed -i option is not working on solaris
                            
                                sed extract digits
                            
                                Find and remove DOS line endings on Ubuntu
                            
                                Scripts for listing all the distinct characters in a text file
                            
                                SED to remove a Line with REGEX Pattern
                            
                                UNIX: Using egrep or sed to find the line with the first occurrence of a string?
                            
                                Cleaner way to write multiple sed commands?
                            
                                Delete line from file at specified line number in bourne shell [duplicate]
                            
                                Sed error : bad option in substitution expression
                            
                                How to split file on first empty line in a portable way in shell (e.g. using sed)?
                            
                                I want to use "awk" or sed to print all the lines that start with "comm=" in a file
                            
                                Format output in columns
                            
                                Add prefix to each word of each line in bash
                            
                                Remove multi-line comments
                            
                                Why does this regex run differently in sed than in Perl/Ruby?
                            
                                sed: change values of properties of an environment in a .yml file
                            
                                How portable is it to use semi-colons as command separators in sed?
                            
                                How can I get Octave to change the variables in my input files?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to tell sed "dot match new line"

Tags:

sed

theta

People also ask

3 Answers

Wiktor Stribiżew

kev

potong

Recent Activity

Donate For Us