i've got a hundreds of files with thousands of lines, which i need to delete some lines that follows a pattern,so i went to SED with regex .The struct of files is something like this <pre class="prettyprint"><code>A,12121212121212,foo,bar,lorem C,32JL,JL C,32JL,JL C,32JL,JL C,32JL,JL A,21212121212121,foo,bar,lorem C,32JL,JL C,32JL,JL C,32JL,JL A,9999,88888,77777 </code></pre> I need to delete All the lines that starts with "A" and ends with "lorem" Expected output- <pre class="prettyprint"><code>C,32JL,JL C,32JL,JL C,32JL,JL C,32JL,JL C,32JL,JL C,32JL,JL C,32JL,JL A,9999,88888,77777 </code></pre> I've made the Regex : <pre class="prettyprint"><code>^(A).*(lorem) </code></pre> And it match in my text editor (Sublime,UltraEdit) In the UNIX shell <pre class="prettyprint"><code>sed '/^(A).*(lorem)/d' file.txt </code></pre> But somehow it doesn't work,it shows the whole file, and i can't figure out why. Can someone help me please?

<pre class="prettyprint"><code>$ sed '/^A.*lorem$/d' file.txt </code></pre> <ul> <li> <code>^A</code>: starts with an <code>A</code> </li> <li> <code>.*</code>: stuff in the middle</li> <li> <code>lorem$</code>: ends with <code>lorem</code> </li> </ul>

The others gave you correct solutions but didn't explain why your regex didn't work. The <code>()</code> surely were useless, but if you had used the regex with other tools/languages, you might very well have had the expected result. It didn't work with <code>sed</code> because it will by default use POSIX's basic regular expressions, where the characters for grouping are <code>$</code> and <code>$</code>, while <code>(</code> and <code>)</code> will match literal characters. There were no such brackets in your input text, so it didn't match. Your regular expression would have worked if you had used GNU's <code>sed -r</code> or BSD's <code>sed -E</code>, the flag switching to POSIX's extended regular expressions where <code>(</code> and <code>)</code> are used to group and <code>$</code> <code>$</code> match the literal brackets. In conclusion, the following commands will do the same thing : <ul> <li><code>sed '/^A.*lorem$/d' file.txt</code></li> <li> <code>sed -r '/^(A).*(lorem)$/d' file.txt</code> (with GNU sed)</li> <li> <code>sed -E '/^(A).*(lorem)$/d' file.txt</code> (with BSD sed and modern GNU sed)</li> <li><code>sed '/^$A$.*$lorem$$/d' file.txt</code></li> </ul>

Remove the brackets. Using your code, the appropriate one-liner becomes- <pre class="prettyprint"><code>sed '/^A.*lorem/d' file.txt </code></pre> If you want to be more rigourous, you can look at James's answer which more correctly terminates the regex as- <pre class="prettyprint"><code>sed '/^A.*lorem$/d' file.txt </code></pre> Both will work.

SED to remove a Line with REGEX Pattern

Tags:

regex

bash

unix

sed

i've got a hundreds of files with thousands of lines, which i need to delete some lines that follows a pattern,so i went to SED with regex .The struct of files is something like this

A,12121212121212,foo,bar,lorem
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
A,21212121212121,foo,bar,lorem
C,32JL,JL
C,32JL,JL
C,32JL,JL
A,9999,88888,77777

I need to delete All the lines that starts with "A" and ends with "lorem"

Expected output-

C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
C,32JL,JL
A,9999,88888,77777

I've made the Regex :

^(A).*(lorem)

And it match in my text editor (Sublime,UltraEdit)

In the UNIX shell

sed '/^(A).*(lorem)/d' file.txt

But somehow it doesn't work,it shows the whole file, and i can't figure out why.

Can someone help me please?

957

asked Oct 25 '16 13:10

Imkls

3 Answers

$ sed '/^A.*lorem$/d' file.txt

^A: starts with an A
.*: stuff in the middle
lorem$: ends with lorem

answered Oct 18 '22 16:10

James Brown

The others gave you correct solutions but didn't explain why your regex didn't work. The () surely were useless, but if you had used the regex with other tools/languages, you might very well have had the expected result.

It didn't work with sed because it will by default use POSIX's basic regular expressions, where the characters for grouping are $ and $, while ( and ) will match literal characters. There were no such brackets in your input text, so it didn't match.

Your regular expression would have worked if you had used GNU's sed -r or BSD's sed -E, the flag switching to POSIX's extended regular expressions where ( and ) are used to group and $ $ match the literal brackets.

In conclusion, the following commands will do the same thing :

sed '/^A.*lorem$/d' file.txt
sed -r '/^(A).*(lorem)$/d' file.txt (with GNU sed)
sed -E '/^(A).*(lorem)$/d' file.txt (with BSD sed and modern GNU sed)
sed '/^$A$.*$lorem$$/d' file.txt

answered Oct 18 '22 17:10

Aaron

Remove the brackets.

Using your code, the appropriate one-liner becomes-

sed '/^A.*lorem/d' file.txt

If you want to be more rigourous, you can look at James's answer which more correctly terminates the regex as-

sed '/^A.*lorem$/d' file.txt

Both will work.

answered Oct 18 '22 15:10

Chem-man17

Related questions
                            
                                Trying to validate a name field to be sure it is all alpha characters or a hyphen or apostrophe
                            
                                Java regular expression match
                            
                                Javascript: regex for replace words inside text and not part of the words
                            
                                Bash- How to convert non-alphanumerical character to "_"
                            
                                Matching a regular expression multiple times with Perl
                            
                                Regular Expression to get parameter list from function definition [duplicate]
                            
                                PHP - Regex for a string of special characters
                            
                                Regex to match "True" or "False"
                            
                                Trim whitespace from middle of string
                            
                                JSONPath :contains filter
                            
                                replace capturing group
                            
                                Regex for files in a directory
                            
                                Regular expression to match style="whatever:0; morestuff:1; otherstuff:3"
                            
                                eclipse search - regex finding start/end of file with carriage return
                            
                                Javascript multiple email regexp validation
                            
                                Regex to match string not ending with pattern
                            
                                Using regex in Scala to group and pattern match
                            
                                Regex to allow numbers, plus symbol, minus symbol and brackets
                            
                                Instagram username Regex -PHP
                            
                                How can I show text with html format in xamarin forms

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With