Regex Pattern Catastrophic backtracking

Tags:

I have the regex shown below used in one of my old Java systems which is causing backtracking issues lately. Quite often the backtracking threads cause the CPU of the machine to hit the upper limit and it does not return back until the application is restarted.

Could any one suggest a better way to rewrite this pattern or a tool which would help me to do so?

Pattern:

^\[(([\p{N}]*\]\,\[[\p{N}]*)*|[\p{N}]*)\]$

Values working:

[1234567],[89023432],[124534543],[4564362],[1234543],[12234567],[124567],[1234567],[1234567]

Catastrophic backtracking values — if anything is wrong in the values (an extra brace added at the end):

[1234567],[89023432],[124534543],[4564362],[1234543],[12234567],[124567],[1234567],[1234567]]

663

asked Apr 20 '15 14:04

Achilles

1 Answers

Never use * when + is what you mean. The first thing I noticed about your regex is that almost everything is optional. Only the opening and closing square brackets are required, and I'm pretty sure you don't want to treat [] as a valid input.

One of the biggest causes of runaway backtracking is to have two or more alternatives that can match the same things. That's what you've got with the |[\p{N}]* part. The regex engine has to try every conceivable path through the string before it gives up, so all those \p{N}* constructs get into an endless tug-of-war over every group of digits.

But there's no point trying to fix those problems, because the overall structure is wrong. I think this is what you're looking for:

^\[\p{N}+\](?:,\[\p{N}+\])*$

After it consumes the first token ([1234567]), if the next thing in the string is not a comma or the end of the string, it fails immediately. If it does see a comma, it must go on to match another complete token ([89023432]), or it fails immediately.

That's probably the most important thing to remember when you're creating a regex: if it's going to fail, you want it to fail as quickly as possible. You can use features like atomic groups and possessive quantifiers toward that end, but if you get the structure of the regex right, you rarely need them. Backtracking is not inevitable.

197

answered Oct 28 '22 20:10

Alan Moore

Related questions
                            
                                Missing scheme (IllegalArgumentException) while using java.nio.file.Paths interface
                            
                                What is the difference in converting string buffer to string using .toString(), String.valueOf() and + " "
                            
                                What is the significance of "key password" in keystore using keytool
                            
                                Android - View.requestLayout doesn't work in OnLayoutChangeListener
                            
                                Why java.util.Objects private constructor throws assertionError
                            
                                NoClassDefFoundError: org/hibernate/annotations/common/reflection/MetadataProvider
                            
                                How to make simple workflow from existing code?
                            
                                Why are we allowed to have a final main method in java?
                            
                                How can I have JAX-RS return a Java 8 LocalDateTime property as a JavaScript-style Date String?
                            
                                Gradle: how to list all "given tests"
                            
                                Jackson prefers private constructor over @JsonCreator when deserializing a class with @JsonValue
                            
                                Is it possible to enable -Werror for JavaCompile in gradle?
                            
                                IntelliJ how do I generate a new class?
                            
                                Unit Testing /login in Spring MVC using MockMvc
                            
                                java.lang.NoClassDefFoundError: javax/mail/MessagingException unsolved
                            
                                How to know affected rows in Cassandra(CQL)?
                            
                                "Unfortunately, Launcher has stopped" on Android Nexus 6 emulator
                            
                                Execution failed for task ':app:dexDebug' Android Studio
                            
                                Constructor Inside Inner Static Class in java?
                            
                                Reading line breaks in CSV which are quoted in the file in FlatfileItemReader of spring batch

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Regex Pattern Catastrophic backtracking

Tags:

java

regex

backtracking

Achilles

People also ask

1 Answers

Alan Moore

Recent Activity

Donate For Us