How does this regex work when finding the last occurrence of a word?

Tags:

regex

I came across a regex like the following:

foo(?!.*foo)

if it is fed with foo bar bar foo, it will find the last occurrence of foo. I know it uses a mechanism called negative lookahead which means it will match a word which not end with characters after the ?!. But how does the regex here works?

512

asked May 20 '14 04:05

photosynthesis

3 Answers

Slightly different answer from sshashank (because the word containing in his answer doesn't work for me and in regex you have to be pedantic—it's all about precision.) I'm 100% sure sshashank knows this and only phrased it that way for brevity.

The regex matches foo, not followed (i.e., negative lookahead (?!) by this:

{{{any number of any characters (i.e., .*) then the characters foo}}}

If the lookahead fails, the portion corresponding to .* does not contain foo. foo comes later.

See this automatic translation:

NODE                     EXPLANATION
--------------------------------------------------------------------------------
  foo                      'foo'
--------------------------------------------------------------------------------
  (?!                      look ahead to see if there is not:
--------------------------------------------------------------------------------
    .*                       any character except \n (0 or more times
                             (matching the most amount possible))
--------------------------------------------------------------------------------
    foo                      'foo'
--------------------------------------------------------------------------------
  )                        end of look-ahead

The same in different words from regex101:

/foo(?!.*foo)/

foo matches the characters foo literally (case sensitive)
(?!.*foo) Negative Lookahead - Assert that it is impossible to match the regex below
    .* matches any character (except newline)
        Quantifier: Between zero and unlimited times, as many times as possible, giving back as needed [greedy]
    foo matches the characters foo literally (case sensitive)

What does RegexBuddy have to say?

foo(?!.*foo)

foo(?!.*foo)

Match the character string “foo” literally (case sensitive) foo
Assert that it is impossible to match the regex below starting at this position (negative lookahead) (?!.*foo)
- Match any single character that is NOT a line break character (line feed, carriage return, next line, line separator, paragraph separator) .*
  - Between zero and unlimited times, as many times as possible, giving back as needed (greedy) *
- Match the character string “foo” literally (case sensitive) foo

131

answered Apr 02 '23 20:04

zx81

It matches foo only if it is not followed (?!) by any more text (.*) containing foo in it.

answered Apr 02 '23 20:04

sshashank124

Negative lookahead is essential if you want to match something not followed by something else.

Short explanation:

foo(?!.*foo) matches foo when not followed by any character except \n and `foo`

For example, say you have the following two strings.

foobar
barfoo

And the regular expression:

foo(?!bar)

This matches foo when not followed by bar so it would match the string barfoo here.

answered Apr 02 '23 19:04

hwnd

Related questions
                            
                                regex matching an open and close tag and a certain text patterns inside that tag [duplicate]
                            
                                Find and Replace Whole Words (not substrings) in Emacs
                            
                                Regular expression in index function
                            
                                Why this javascript regex doesn't work?
                            
                                Regular expression: Match string between two slashes if the string itself contains escaped slashes
                            
                                List all files not starting with a number
                            
                                How do I remove all punctuation that follows a string?
                            
                                simple .htaccess redirect : how to redirect with parameters?
                            
                                Build a dictionary from successful regex matches in python
                            
                                How to make this regex allow spaces c#
                            
                                How to concatenate $1 with number in a regex
                            
                                Why is (new RegExp("\\w") === /\w/) false in JS?
                            
                                regex to match word boundary beginning with special characters
                            
                                Ruby alphanumeric check
                            
                                javascript regex, word not followed and not preceded by specific char
                            
                                How to convert some character into five digit unicode one in Python 3.3?
                            
                                Django url pattern regex to pass a email as a parameter in the url
                            
                                Regex for matching just the first occurrence of a comma in each line [closed]
                            
                                Sed: How to replace a string found after a specific pattern is located in a file
                            
                                Javascript: Regex to escape parentheses and spaces

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With