I want a regex that matches any set of digits, with one possible dot. If there is another dot and more digits after it, do an overlapping match with the previous digits, the dot, and the following digits. example string = <code>'aa323aa232.02.03.23.99aa87..0.111111.mm'</code> desired results = <code>[323, 232.02, 02.03, 03.23, 23.99, 87, 0.111111]</code> currently using: <pre class="prettyprint"><code>import re i = 'aa323aa232.02.03.23.99aa87..0.111111.mm' matches = re.findall(r'(?=(\d+\.{0,1}\d+))', i) print matches </code></pre> output: <pre class="prettyprint"><code>['323', '23', '232.02', '32.02', '2.02', '02.03', '2.03', '03.23', '3.23', '23.99', '3.99', '99', '87', '0.111111', '111111', '11111', '1111', '111', '11'] </code></pre>

This uses a lookahead assertion for capturing, and then another expression for gobbling characters following your rules: <pre class="prettyprint"><code>>>> import re >>> i = 'aa323aa232.02.03.23.99aa87..0.111111.mm' >>> re.findall(r'(?=(\d+(?:\.\d+)?))\d+(?:\.\d+(?!\.?\d))?', i) </code></pre> Output <pre class="prettyprint"><code>['323', '232.02', '02.03', '03.23', '23.99', '87', '0.111111'] </code></pre>

Overlapping regex

Tags:

python

regex

I want a regex that matches any set of digits, with one possible dot. If there is another dot and more digits after it, do an overlapping match with the previous digits, the dot, and the following digits.
example string = 'aa323aa232.02.03.23.99aa87..0.111111.mm'
desired results = [323, 232.02, 02.03, 03.23, 23.99, 87, 0.111111]

currently using:

Click to copy

import re
i = 'aa323aa232.02.03.23.99aa87..0.111111.mm'
matches = re.findall(r'(?=(\d+\.{0,1}\d+))', i)
print matches

output:

Click to copy

['323', '23', '232.02', '32.02', '2.02', '02.03', '2.03', '03.23', '3.23', '23.99', '3.99', '99', '87', '0.111111', '111111', '11111', '1111', '111', '11']

436

asked Jun 20 '14 21:06

user193661

1 Answers

This uses a lookahead assertion for capturing, and then another expression for gobbling characters following your rules:

Click to copy

>>> import re
>>> i = 'aa323aa232.02.03.23.99aa87..0.111111.mm'
>>> re.findall(r'(?=(\d+(?:\.\d+)?))\d+(?:\.\d+(?!\.?\d))?', i)

Output

Click to copy

['323', '232.02', '02.03', '03.23', '23.99', '87', '0.111111']

answered Oct 14 '22 12:10

Miller

Related questions
                            
                                Writing a csv temporary file using tempfile
                            
                                Determine if __getattr__ is method or attribute call
                            
                                Can not the computed centroid values to be plotted over the existing plot based on data
                            
                                Why \g<0> behaves differently than \0 in re.sub?
                            
                                Find two pairs of pairs that sum to the same value
                            
                                Python Invalid syntax in elif [closed]
                            
                                pybluez installation errors on Mac OS
                            
                                How can I embed a python interpreter frame in python using tkinter?
                            
                                How to handle in_data in Pyaudio callback mode?
                            
                                How to build a Django REST route that extracts multiple arguments from the URL?
                            
                                Using Mechanize (Python) to fill form
                            
                                ImportError: cannot import name add_newdocs
                            
                                Show current cursor position in Selenium
                            
                                Using Middleware to ignore duplicates in Scrapy
                            
                                SQLAlchemy: update from_select
                            
                                How to avoid axis values with 1e7 in pandas and matplotlib
                            
                                How to define a "callable" parameter in a Python docstring?
                            
                                Programmatically rotate monitor
                            
                                ranks within groupby in pandas
                            
                                May someone explain the following os.fork() example to me?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Overlapping regex

Tags:

python

regex

user193661

People also ask

1 Answers

Miller

Recent Activity

Donate For Us