I need to find, process and remove (one by one) any substrings that match a rather long regex: <pre class="prettyprint"><code># p is a compiled regex # s is a string while 1: m = p.match(s) if m is None: break process(m.group(0)) #do something with the matched pattern s = re.sub(m.group(0), '', s) #remove it from string s </code></pre> The code above is not good for 2 reasons: <ol> <li>It doesn't work if m.group(0) happens to contain any regex-special characters (like *, +, etc.).</li> <li>It feels like I'm duplicating the work: first I search the string for the regular expression, and then I have to kinda go look for it again to remove it.</li> </ol> What's a good way to do this?

The re.sub function can take a function as an argument so you can combine the replacement and processing steps if you wish: <pre class="prettyprint"><code># p is a compiled regex # s is a string def process_match(m): # Process the match here. return '' s = p.sub(process_match, s) </code></pre>

python regex match and replace

Tags:

python

regex

I need to find, process and remove (one by one) any substrings that match a rather long regex:

# p is a compiled regex
# s is a string  
while 1:
    m = p.match(s)
    if m is None:
        break
    process(m.group(0)) #do something with the matched pattern
    s = re.sub(m.group(0), '', s) #remove it from string s

The code above is not good for 2 reasons:

It doesn't work if m.group(0) happens to contain any regex-special characters (like *, +, etc.).
It feels like I'm duplicating the work: first I search the string for the regular expression, and then I have to kinda go look for it again to remove it.

What's a good way to do this?

277

asked Aug 22 '10 21:08

max

1 Answers

The re.sub function can take a function as an argument so you can combine the replacement and processing steps if you wish:

# p is a compiled regex
# s is a string  
def process_match(m):
    # Process the match here.
    return ''

s = p.sub(process_match, s)

144

answered Sep 17 '22 16:09

Mark Byers

Related questions
                            
                                Passing arguments (for argparse) with unittest discover
                            
                                sqlalchemy, using check constraints
                            
                                TensorBoard: How to plot histogram for gradients?
                            
                                How to smooth by interpolation when using pcolormesh?
                            
                                Is there a comprehensive table of Python's "magic constants"?
                            
                                Simplifying / optimizing a chain of for-loops
                            
                                Heroku - No web process running
                            
                                Search and replace placeholder text in PDF with Python
                            
                                Why does a newly created variable in Python have a ref-count of four?
                            
                                Recommended way to implement __eq__ and __hash__
                            
                                ModuleNotFoundError: No module named 'BaseHTTPServer'
                            
                                python a,b = b,a implementation? How is it different from C++ swap function?
                            
                                VSCode: The term 'python' is not recognized...but py works
                            
                                Python and Dart Integration in Flutter Mobile Application
                            
                                PyTorch: What's the difference between state_dict and parameters()?
                            
                                Use Python Pool with context manager or close and join
                            
                                pytorch RuntimeError: Expected object of scalar type Double but got scalar type Float
                            
                                Spark: Why does Python significantly outperform Scala in my use case?
                            
                                How do I reply to an email using the Python imaplib and include the original message?
                            
                                Simple multilingual CMS? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With