I was wondering how to match a line not containing a specific word using Python-style Regex (Just use Regex, not involve Python functions)? Example: <pre class="prettyprint"><code>PART ONE OVERVIEW 1 Chapter 1 Introduction 3 </code></pre> I want to match lines that do not contain the word "PART"？

This should work: <pre class="prettyprint"><code>/^((?!PART).)*$/ </code></pre> If you only wanted to exclude it from the beginning of the line (I know you don't, but just FYI), you could use this: <pre class="prettyprint"><code>/^(?!PART)/ </code></pre> <h3>Edit (by request): Why this pattern works</h3> The <code>(?!...)</code> syntax is a negative lookahead, which I've always found tough to explain. Basically, it means "whatever follows this point must not match the regular expression <code>/PART/</code>." The site I've linked explains this far better than I can, but I'll try to break this down: <pre class="prettyprint"><code>^ #Start matching from the beginning of the string. (?!PART) #This position must not be followed by the string "PART". . #Matches any character except line breaks (it will include those in single-line mode). $ #Match all the way until the end of the string. </code></pre> The <code>((?!xxx).)*</code> idiom is probably hardest to understand. As we saw, <code>(?!PART)</code> looks at the string ahead and says that whatever comes next can't match the subpattern <code>/PART/</code>. So what we're doing with <code>((?!xxx).)*</code> is going through the string letter by letter and applying the rule to all of them. Each character can be anything, but if you take that character and the next few characters after it, you'd better not get the word PART. The <code>^</code> and <code>$</code> anchors are there to demand that the rule be applied to the entire string, from beginning to end. Without those anchors, any piece of the string that didn't begin with PART would be a match. Even PART itself would have matches in it, because (for example) the letter A isn't followed by the exact string PART. Since we do have <code>^</code> and <code>$</code>, if PART were anywhere in the string, one of the characters would match <code>(?=PART).</code> and the overall match would fail. Hope that's clear enough to be helpful.

How to match a line not containing a word [duplicate]

Tags:

regex

I was wondering how to match a line not containing a specific word using Python-style Regex (Just use Regex, not involve Python functions)?

Example:

PART ONE OVERVIEW 1  Chapter 1 Introduction 3

I want to match lines that do not contain the word "PART"？

751

asked Jun 07 '11 00:06

Tim

1 Answers

This should work:

/^((?!PART).)*$/

If you only wanted to exclude it from the beginning of the line (I know you don't, but just FYI), you could use this:

/^(?!PART)/

Edit (by request): Why this pattern works

The (?!...) syntax is a negative lookahead, which I've always found tough to explain. Basically, it means "whatever follows this point must not match the regular expression /PART/." The site I've linked explains this far better than I can, but I'll try to break this down:

^         #Start matching from the beginning of the string.     (?!PART)  #This position must not be followed by the string "PART". .         #Matches any character except line breaks (it will include those in single-line mode). $         #Match all the way until the end of the string.

The ((?!xxx).)* idiom is probably hardest to understand. As we saw, (?!PART) looks at the string ahead and says that whatever comes next can't match the subpattern /PART/. So what we're doing with ((?!xxx).)* is going through the string letter by letter and applying the rule to all of them. Each character can be anything, but if you take that character and the next few characters after it, you'd better not get the word PART.

The ^ and $ anchors are there to demand that the rule be applied to the entire string, from beginning to end. Without those anchors, any piece of the string that didn't begin with PART would be a match. Even PART itself would have matches in it, because (for example) the letter A isn't followed by the exact string PART.

Since we do have ^ and $, if PART were anywhere in the string, one of the characters would match (?=PART). and the overall match would fail. Hope that's clear enough to be helpful.

answered Sep 23 '22 00:09

Justin Morgan

Related questions
                            
                                How to find overlapping matches with a regexp?
                            
                                How to use grep to get anything just after `name=`?
                            
                                Multi-selection with regex (sublime text 2)
                            
                                Warning: preg_replace(): Unknown modifier 'g'
                            
                                In Javascript, how can I perform a global replace on string with a variable inside '/' and '/g'?
                            
                                Regex for Comma delimited list
                            
                                Python and regular expression with Unicode
                            
                                Split Ruby regex over multiple lines
                            
                                Chrome dev tools: any way to exclude requests whose URL matches a regex?
                            
                                Regex any ASCII character
                            
                                How can non-ASCII characters be removed from a string?
                            
                                JS regex to split by line
                            
                                regexes: How to access multiple matches of a group? [duplicate]
                            
                                How do I deal with special characters like \^$.?*|+()[{ in my regex?
                            
                                How do I return a string from a regex match in python? [duplicate]
                            
                                Using sed to delete all lines between two matching patterns
                            
                                Java regex to extract text between tags
                            
                                Escape dot in a regex range
                            
                                RegEx to split camelCase or TitleCase (advanced)
                            
                                How to match hyphens with Regular Expression?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With