How do the following quantifiers differ - with respect of scenarios, speed, etc.
?
, ??
and ?+
all match 0 or 1 times.*
, *? and
*+` all match 0 or more times.+
, +?
and ++
all match 1 or more times.?
, *
and +
are greedy.??
, *?
and +?
are reluctant/lazy. ?+
, *+
and ++
are possessive.Can anyone help me to understand what these terms mean? Why are there three variations of each quantifier for the same job?
The lazy mode of quantifiers is an opposite to the greedy mode. It means: “repeat minimal number of times”. We can enable it by putting a question mark '?' after the quantifier, so that it becomes *? or +? or even ?? for '?' .
Like a greedy quantifier, a possessive quantifier repeats the token as many times as possible. Unlike a greedy quantifier, it does not give up matches as the engine backtracks. With a possessive quantifier, the deal is all or nothing.
Greedy Quantifier (Default) Greedy quantifiers try to match the longest text that matches a given pattern. Greedy quantifiers work by first reading the entire string before trying any match. If the whole text doesn't match, remove the last character and try again, repeating the process until a match is found.
So the difference between the greedy and the non-greedy match is the following: The greedy match will try to match as many repetitions of the quantified pattern as possible. The non-greedy match will try to match as few repetitions of the quantified pattern as possible.
Take the string
aaaab
and see how the following regexes match it:
Regex Submatches
group 1 group 2 group3
(a?)(a*)(ab) a aa ab
(a??)(a*)(ab) aaa ab
(a?+)(a*)(ab) a aa ab
(a*)(a?)(ab) aaa ab
(a*?)(a?)(ab) aa a ab
(a*+)(a?)(ab) aaaa <Match fails!>
(a+)(a*)(ab) aaa ab
(a+?)(a*)(ab) a aa ab
(a++)(a*)(ab) aaaa <Match fails!>
Explanation:
a?
tries to match one a
, but it's prepared to match nothing if that's necessary for the whole match to succeed.a??
tries to match nothing, but it's prepared to match one a
if that's necessary for the whole match to succeed.a?+
tries to match one a
. If it can do that, it will not back down to match nothing if that were necessary for the overall match to succeed. If it can't match an a
, then it will gladly match nothing, though.a*
tries to match as many a
s as it can, but it's prepared to match fewer a
s, even nothing if that's necessary for the whole match to succeed.a*?
tries to match nothing, but it's prepared to match just as many a
s as is absolutely necessary in order for the whole match to succeed, but not more.a*+
tries to match as many a
s as it can. If it can do that, it will not back down to match fewer a
s if that were necessary for the overall match to succeed. If it can't match even a single a
, then it will gladly match nothing, though.a+
tries to match as many a
s as it can, but it's prepared to match fewer a
s (but at least one) if that's necessary for the whole match to succeed.a+?
tries to match only one a
, but it's prepared to match just as many a
s as is absolutely necessary in order for the whole match to succeed, but not more.a++
tries to match as many a
s as it can. If it can do that, it will not back down to match fewer a
s if that were necessary for the overall match to succeed. If it can't match even a single a
, then the regex fails.If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With