Regex capture order: wrong alternative matched after greedy pattern

Tags:

regex

I have this pattern:

(\w+)(sin|in|pak|red)$

And the replacement pattern is this:

$1tak

The problem is that this word:

setesin

will be transformed to:

setestak

instead of

setetak

For some reason, in always takes precedence to sin in the pattern.

How can I enforce the pattern to follow that order?

455

asked Nov 28 '16 13:11

1 Answers

Use a lazy quantifier:

(\w+?)(sin|in|pak|red)$
    ^

See the regex demo

The \w+ contains a greedy quantifier that: 1) grabs as many chars as it can (and note it can match s, i, all letters, digits and underscores) and then backtracks (yielding one char after another moving from right to left), trying to accommodate for the subsequent patterns. Since the in is found first, it is matched, and the whole group is considered matched, the regex goes on to check the end of string with $. A lazy quantifier will have the regex engine skip the \w+? after matching 1 word char, and other patterns will be tried, moving from left to right.

101

answered Oct 13 '22 01:10

Wiktor Stribiżew

Related questions
                            
                                Why does making this getter nullable cause a compile error?
                            
                                Parse a soap XML to a C# class
                            
                                Can I disable ViewCell.ContextActions based on a condition
                            
                                OCR TesseractEngine
                            
                                Override ToString in NUnit without access to source code
                            
                                Registry.SetValue not working for x86
                            
                                Passing dynamically created SQL Parameters into dapper as an anonymous type
                            
                                LINQ to Entities DateTime Compare
                            
                                GetType on Nullable Boolean
                            
                                Failed to allocate a managed memory buffer of 268435456 bytes. The amount of available memory may be low
                            
                                Assert "at least one item in the result collection matches predicate"
                            
                                PowerShell Not Returning to command line after running "dotnet run ..."
                            
                                Do I have to unsubscribe from button events after using it in c#?
                            
                                Xml Requests do not work
                            
                                Serializing anonymous types
                            
                                Xamarin Forms Collapsable StackLayout
                            
                                Unity3d development: JNI ERROR (app bug): accessed stale local reference 0x200001 (index 0 in a table of size 0)
                            
                                The type or namespace name `UnityEditor' could not be found
                            
                                Add drop down in excel using EPplus
                            
                                Can't send Content-Type header with c# HttpClient

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Regex capture order: wrong alternative matched after greedy pattern

Tags:

c#

regex

Cornwell

People also ask

1 Answers

Wiktor Stribiżew

Recent Activity

Donate For Us