I'm getting confused returning multiple groups in Python. My RegEx is this: <pre class="prettyprint"><code>lun_q = 'Lun:\s*(\d+\s?)*' </code></pre> And my string is <pre class="prettyprint"><code>s = '''Lun: 0 1 2 3 295 296 297 298'''` </code></pre> I return a matched object, and then want to look at the groups, but all it shows it the last number (258): <pre class="prettyprint"><code>r.groups() (u'298',) </code></pre> Why isn't it returning groups of <code>0,1,2,3,4</code> etc.?

Your regex only contains a single pair of parentheses (one capturing group), so you only get one group in your match. If you use a repetition operator on a capturing group (<code>+</code> or <code>*</code>), the group gets "overwritten" each time the group is repeated, meaning that only the last match is captured. In your example here, you're probably better off using <code>.split()</code>, in combination with a regex: <pre class="prettyprint"><code>lun_q = 'Lun:\s*(\d+(?:\s+\d+)*)' s = '''Lun: 0 1 2 3 295 296 297 298''' r = re.search(lun_q, s) if r: luns = r.group(1).split() # optionally, also convert luns from strings to integers luns = [int(lun) for lun in luns] </code></pre>

RegEx with multiple groups?

Tags:

python

regex

I'm getting confused returning multiple groups in Python. My RegEx is this:

Click to copy

lun_q = 'Lun:\s*(\d+\s?)*'

And my string is

Click to copy

s = '''Lun:                     0 1 2 3 295 296 297 298'''`

I return a matched object, and then want to look at the groups, but all it shows it the last number (258):

Click to copy

r.groups()   (u'298',)

Why isn't it returning groups of 0,1,2,3,4 etc.?

410

asked Feb 10 '11 22:02

joslinm

2 Answers

Your regex only contains a single pair of parentheses (one capturing group), so you only get one group in your match. If you use a repetition operator on a capturing group (+ or *), the group gets "overwritten" each time the group is repeated, meaning that only the last match is captured.

In your example here, you're probably better off using .split(), in combination with a regex:

Click to copy

lun_q = 'Lun:\s*(\d+(?:\s+\d+)*)' s = '''Lun: 0 1 2 3 295 296 297 298'''  r = re.search(lun_q, s)  if r:     luns = r.group(1).split()      # optionally, also convert luns from strings to integers     luns = [int(lun) for lun in luns]

161

answered Sep 23 '22 23:09

Ben Blank

Another approach would be to use the regex you have to validate your data and then use a more specific regex that targets each item you wish to extract using a match iterator.

Click to copy

import re s = '''Lun: 0 1 2 3 295 296 297 298''' lun_validate_regex = re.compile(r'Lun:\s*((\d+)(\s\d+)*)') match = lun_validate_regex.match(s) if match:     token_regex = re.compile(r"\d{1,3}")     match_iterator = token_regex.finditer(match.group(1))     for token_match in match_iterator:         #do something brilliant

answered Sep 21 '22 23:09

pokstad

Related questions
                            
                                What is the Difference between file_upload() and put_object() when uploading files to S3 using boto3
                            
                                Using pytest with a src layer
                            
                                Why does domain driven design seem only popular with static languages like C# & Java? [closed]
                            
                                Map of all points below a certain time of travel?
                            
                                Why do int keys of a python dict turn into strings when using json.dumps?
                            
                                How to use Gensim doc2vec with pre-trained word vectors?
                            
                                Flask Dynamic data update without reload page
                            
                                Python and MySQL
                            
                                Python code-folding in emacs?
                            
                                Matplotlib returning a plot object
                            
                                How do I properly override __setattr__ and __getattribute__ on new-style classes in Python?
                            
                                what is the difference for python between lambda and regular function?
                            
                                Efficient creation of numpy arrays from list comprehension and in general
                            
                                Interactive input/output using Python
                            
                                Python unittest's assertDictContainsSubset recommended alternative [duplicate]
                            
                                Optional dependencies in distutils / pip
                            
                                Is a day always 86,400 epoch seconds long?
                            
                                Store mouse click event coordinates with matplotlib
                            
                                Getting gradient of model output w.r.t weights using Keras
                            
                                What is the runtime complexity of python list functions?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

RegEx with multiple groups?

Tags:

python

regex

joslinm

People also ask

2 Answers

Ben Blank

pokstad

Recent Activity

Donate For Us