regexes: How to access multiple matches of a group? [duplicate]

Tags:

I am putting together a fairly complex regular expression. One part of the expression matches strings such as '+a', '-57' etc. A + or a - followed by any number of letters or numbers. I want to match 0 or more strings matching this pattern.

This is the expression I came up with:

([\+-][a-zA-Z0-9]+)*

If I were to search the string '-56+a' using this pattern I would expect to get two matches:

+a and -56

However, I only get the last match returned:

>>> m = re.match("([\+-][a-zA-Z0-9]+)*", '-56+a') >>> m.groups() ('+a',)

Looking at the python docs I see that:

If a group matches multiple times, only the last match is accessible:
>>> m = re.match(r"(..)+", "a1b2c3")  # Matches 3 times. >>> m.group(1)                        # Returns only the last match. 'c3' 

So, my question is: how do you access multiple group matches?

305

asked Feb 20 '11 22:02

Tom Scrace

1 Answers

Drop the * from your regex (so it matches exactly one instance of your pattern). Then use either re.findall(...) or re.finditer (see here) to return all matches.

Update:

It sounds like you're essentially building a recursive descent parser. For relatively simple parsing tasks, it is quite common and entirely reasonable to do that by hand. If you're interested in a library solution (in case your parsing task may become more complicated later on, for example), have a look at pyparsing.

196

answered Oct 07 '22 01:10

phooji

Related questions
                            
                                Error trying to install Postgres for python (psycopg2)
                            
                                Find the date for the first Monday after a given date
                            
                                Get all text inside a tag in lxml
                            
                                How can I convert radians to degrees with Python?
                            
                                How can I denote unused function arguments?
                            
                                inverting image in Python with OpenCV
                            
                                Debugging the error "gcc: error: x86_64-linux-gnu-gcc: No such file or directory"
                            
                                Find Monday's date with Python
                            
                                SSL: CERTIFICATE_VERIFY_FAILED with Python3
                            
                                Python urllib2, basic HTTP authentication, and tr.im
                            
                                Scikit-learn: How to obtain True Positive, True Negative, False Positive and False Negative
                            
                                Intersecting two dictionaries
                            
                                Memory error when using pandas read_csv
                            
                                When and how to use Tornado? When is it useless?
                            
                                matplotlib: can I create AxesSubplot objects, then add them to a Figure instance?
                            
                                Python remove set from set
                            
                                Pandas timeseries plot setting x-axis major and minor ticks and labels
                            
                                how to convert 2d list to 2d numpy array?
                            
                                Mocking Functions Using Python Mock
                            
                                Is 'file' a keyword in python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

regexes: How to access multiple matches of a group? [duplicate]

Tags:

python

regex

Tom Scrace

People also ask

1 Answers

phooji

Recent Activity

Donate For Us