Lazy (ungreedy) matching multiple groups using regex

Tags:

I would like to grab the contents of any value between pairs of <tag></tag> tags.

<tag>
This is one block of text
</tag>

<tag>
This is another one
</tag>

The regex I have come up with is

/<tag>(.*)</tag>/m

Though, it appears to be greedy and is capturing everything within the enclosed parentheses up until the very last </tag>. I would like it to be as lazy as possible so that everytime it sees a closing tag, it will treat that as a match group and start over.

How can I write the regex so that I will be able to get multiple matches in the given scenario?

I have included a sample of what I am describing in the following link

http://rubular.com/r/JW5M3rnqIE

Note: This is not XML, nor is it really based on any existing standard format. I won't need anything sophisticated like a full-fledged library that comes with a nice parser.

370

asked Oct 14 '12 18:10

MxLDevs

1 Answers

Go with regex pattern:

/<tag>(.*?)<\/tag>/im

Lazy (non-greedy) is .*?, not .*.

To find multiple occurrences, use:

string.scan(/<tag>(.*?)<\/tag>/im)

100

answered Oct 05 '22 01:10

Ωmega

Related questions
                            
                                Amazon SNS -> SQS message body
                            
                                Android Scroller simple example
                            
                                java.lang.NoClassDefFoundError: org/springframework/context/EnvironmentAware
                            
                                Right way to test page load time in selenium?
                            
                                Simulate color transparency
                            
                                Why does the Spring Autowire stops working when I add the "RunWith" annotation?
                            
                                Intercepting based on HTTP header in RESTeasy
                            
                                Jenkins Build error java.lang.ClassNotFoundException: hudson.remoting.Launcher
                            
                                Do runnable jars (via Eclipse) contain tracking information?
                            
                                AspectJ: two kinds of tutorials
                            
                                Build sample data for apache commons Fast Fourier Transform algorithm
                            
                                Maven. Put .DLL in the root of JAR
                            
                                Any way to recover Netbeans 7.2 bookmarks navigation old style?
                            
                                Is it more effective to buffer an output stream than an input stream in Java?
                            
                                How to POST a form using Jersey 2.0
                            
                                Java RegEx Matcher.groupCount returns 0
                            
                                How to remove escape characters from a string in JAVA
                            
                                Java JNI and dependent libraries on Windows
                            
                                Memory-efficient sparse array in Java
                            
                                How to run maven generated jar on CLI

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Lazy (ungreedy) matching multiple groups using regex

Tags:

java

regex

php

ruby

perl

MxLDevs

People also ask

1 Answers

Ωmega

Recent Activity

Donate For Us