I have plenty of confusion in regular expression and I am trying to solve them. Here I have the following string: <pre class="prettyprint"><code>{start}do or die{end}extended string </code></pre> My two different regexes, where I only changed the position of the dot: <pre class="prettyprint"><code>(.(?!{end}))* //returns: {start}do or di //^ See here ((?!{end}).)* //returns: {start}do or die //^ See here </code></pre> Why does the first regex eats the last "e" ? And also how does this negative lookahead make this * quantifier non greedy? I mean why it can't consume characters beyond {end}?

With your negative lookahead you say, that it is impossible to match the regex, which in your case is: <code>{end}</code>. And <code>.</code> captures everything except new line. So with your first regex: <pre class="prettyprint"><code>(.(?!{end}))* </code></pre> It leaves out the <code>e</code>, because: <code>e{end}</code> can't match because of the negative lookahead. While in your second regex, where you have the dot on the other side it can until: <code>{end}d</code> so the <code>e</code> is included in your second regex.

i have figured a work flow for the regex engine for both the regex on completing the task... First, for <code>(.(?!{end}))*</code> the approach for the regex engine as follows... <pre class="prettyprint"><code>"{start}do or die{end}extended string" ^ .(dot) matches "{" and {end} tries to match here but fails.So "{" included "{start}do or die{end}extended string" ^ . (dot) matches "s" and {end} tries to match here but fails.So "s" included .... ....so on... "{start}do or die{end}extended string" ^ (dot) matches "e" and {end} here matches "{end}" so "e" is excluded.. so the match we get is "{start}do or di" </code></pre> for the secodn regex ((?!{end}).)*.... <pre class="prettyprint"><code>"{start}do or die{end}extended string" ^ {end} regex tries to match here but fails to match.So dot consumes "{". "{start}do or die{end}extended string" ^ {end} regex tries to match here but fails again.So dot consumes "s". .... ..so on.. "{start}do or die{end}extended string" ^ {end} regex tries to match here but fails.So dot consumes the "e" "{start}do or die{end}extended string" ^ {end} regex tries to match here and succeed.So the whole regex fail here. So we ended up with a match which is "{start}do or die" </code></pre>

difference in match due to the position of negative lookahead?

Tags:

javascript

string

regex

php

lookahead

I have plenty of confusion in regular expression and I am trying to solve them. Here I have the following string:

{start}do or die{end}extended string

My two different regexes, where I only changed the position of the dot:

(.(?!{end}))* //returns: {start}do or di
                                      //^ See here
((?!{end}).)* //returns: {start}do or die
                                      //^ See here

Why does the first regex eats the last "e" ?

And also how does this negative lookahead make this * quantifier non greedy? I mean why it can't consume characters beyond {end}?

523

asked Jul 17 '15 18:07

AL-zami

2 Answers

With your negative lookahead you say, that it is impossible to match the regex, which in your case is: {end}. And . captures everything except new line.

So with your first regex:

(.(?!{end}))*

It leaves out the e, because: e{end} can't match because of the negative lookahead. While in your second regex, where you have the dot on the other side it can until: {end}d so the e is included in your second regex.

128

answered Sep 30 '22 19:09

Rizier123

i have figured a work flow for the regex engine for both the regex on completing the task...

First, for (.(?!{end}))* the approach for the regex engine as follows...

"{start}do or die{end}extended string"
^   .(dot) matches "{" and {end} tries to match here but fails.So "{" included
"{start}do or die{end}extended string"
 ^  . (dot) matches "s" and {end} tries to match here but fails.So "s" included

....
....so on...
"{start}do or die{end}extended string"
               ^ (dot) matches "e" and {end} here matches "{end}" so "e" is excluded..
so the match we get is "{start}do or di"

for the secodn regex ((?!{end}).)*....

"{start}do or die{end}extended string"
^ {end} regex tries to match here but fails to match.So dot consumes "{".

"{start}do or die{end}extended string"
 ^ {end} regex tries to match here but fails again.So dot consumes "s".

....
..so on..
"{start}do or die{end}extended string"
               ^   {end} regex tries to match here but fails.So dot consumes the "e"
"{start}do or die{end}extended string"
                ^   {end} regex tries to match here and succeed.So the whole regex fail here.

So we ended up with a match which is "{start}do or die"

answered Sep 30 '22 20:09

AL-zami

Related questions
                            
                                What happened to tokenList styling for Polymer 1.0
                            
                                Microsoft Edge window.open() not honoring width height, and opens in background
                            
                                List dependencies injected
                            
                                Java byteArray equivalent in JavaScript
                            
                                Mocha watch doesnt trigger on new files
                            
                                AngularJS Upload a file and send it to a DB
                            
                                CSS Lazy Loading in Chrome
                            
                                Google maps api fails to load map tiles
                            
                                jQuery droppable zone in droppable zone
                            
                                Javascript testing with mocha the html5 file api?
                            
                                How to use Globalize 1.0 and get specified culture info
                            
                                Multiple validations on React PropTypes
                            
                                How do i overwrite protractor.conf.js values from the command line?
                            
                                JS dataTables.fixedHeader different width between header and datas
                            
                                Is it possible to change script running clientside on a webpage?
                            
                                Deep Object Equality of only own Object's Properties in Chai
                            
                                How can I control program flow using events and promises?
                            
                                Set CSS counter-increment via jQuery
                            
                                Full screen video sprites
                            
                                Jquery change selectbox value based on radio button value (or other way round)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

difference in match due to the position of negative lookahead?

Tags:

javascript

string

regex

php

lookahead

AL-zami

People also ask

2 Answers

Rizier123

AL-zami

Recent Activity

Donate For Us