Java Regex Capturing Groups

People also ask

What is regex capturing group?

Capturing groups are a way to treat multiple characters as a single unit. They are created by placing the characters to be grouped inside a set of parentheses. For example, the regular expression (dog) creates a single group containing the letters "d" "o" and "g" .

What is non capturing group in regex?

Non-capturing groups are important constructs within Java Regular Expressions. They create a sub-pattern that functions as a single unit but does not save the matched character sequence. In this tutorial, we'll explore how to use non-capturing groups in Java Regular Expressions.

What is first capturing group in regex?

First group matches abc. Escaped parentheses group the regex between them. They capture the text matched by the regex inside them into a numbered group that can be reused with a numbered backreference. They allow you to apply regex operators to the entire grouped regex.

How do you use non capturing groups?

Sometimes you want to use parentheses to group parts of an expression together, but you don't want the group to capture anything from the substring it matches. To do this use (?: and ) to enclose the group.

The issue you're having is with the type of quantifier. You're using a greedy quantifier in your first group (index 1 - index 0 represents the whole Pattern), which means it'll match as much as it can (and since it's any character, it'll match as many characters as there are in order to fulfill the condition for the next groups).

In short, your 1st group .* matches anything as long as the next group \\d+ can match something (in this case, the last digit).

As per the 3rd group, it will match anything after the last digit.

If you change it to a reluctant quantifier in your 1st group, you'll get the result I suppose you are expecting, that is, the 3000 part.

Note the question mark in the 1st group.

Click to copy

String line = "This order was placed for QT3000! OK?";
Pattern pattern = Pattern.compile("(.*?)(\\d+)(.*)");
Matcher matcher = pattern.matcher(line);
while (matcher.find()) {
    System.out.println("group 1: " + matcher.group(1));
    System.out.println("group 2: " + matcher.group(2));
    System.out.println("group 3: " + matcher.group(3));
}

Output:

Click to copy

group 1: This order was placed for QT
group 2: 3000
group 3: ! OK?

More info on Java Pattern here.

Finally, the capturing groups are delimited by round brackets, and provide a very useful way to use back-references (amongst other things), once your Pattern is matched to the input.

In Java 6 groups can only be referenced by their order (beware of nested groups and the subtlety of ordering).

In Java 7 it's much easier, as you can use named groups.

This is totally OK.

The first group (m.group(0)) always captures the whole area that is covered by your regular expression. In this case, it's the whole string.
Regular expressions are greedy by default, meaning that the first group captures as much as possible without violating the regex. The (.*)(\\d+) (the first part of your regex) covers the ...QT300 int the first group and the 0 in the second.
You can quickly fix this by making the first group non-greedy: change (.*) to (.*?).

For more info on greedy vs. lazy, check this site.

Your understanding is correct. However, if we walk through:

(.*) will swallow the whole string;
it will need to give back characters so that (\\d+) is satistifed (which is why 0 is captured, and not 3000);
the last (.*) will then capture the rest.

I am not sure what the original intent of the author was, however.

From the doc :

Click to copy

Capturing groups</a> are indexed from left
 * to right, starting at one.  Group zero denotes the entire pattern, so
 * the expression m.group(0) is equivalent to m.group().

So capture group 0 send the whole line.

Related questions
                            
                                Fast Bitmap Blur For Android SDK
                            
                                Unsure if I understand TransactionAwarePersistenceManagerFactoryProxy
                            
                                What is "pom" packaging in maven?
                            
                                Why is 128==128 false but 127==127 is true when comparing Integer wrappers in Java?
                            
                                Gradle: Could not determine java version from '11.0.2'
                            
                                Why use a prime number in hashCode?
                            
                                How do different retention policies affect my annotations?
                            
                                Using HeapDumpOnOutOfMemoryError parameter for heap dump for JBoss
                            
                                Get specific ArrayList item
                            
                                Missing return statement in a non-void method compiles
                            
                                How to change a field name in JSON using Jackson
                            
                                JavaFX Application Icon
                            
                                How to add elements of a Java8 stream into an existing List
                            
                                Using Regular Expressions to Extract a Value in Java
                            
                                How to check if a variable exists in a FreeMarker template?
                            
                                C# vs Java Enum (for those new to C#)
                            
                                What goes into the "Controller" in "MVC"?
                            
                                Why is an array not assignable to Iterable?
                            
                                What is the meaning of double tilde (~~) in Java?
                            
                                Why is creating a Thread said to be expensive?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Java Regex Capturing Groups

Tags:

java

regex

People also ask

Recent Activity

Donate For Us