<p>Is there a way that I can find out how many matches of a regex are in a string in Python? For example, if I have the string <code>"It actually happened when it acted out of turn."</code></p> <p>I want to know how many times <code>"t a"</code> appears in the string. In that string, <code>"t a"</code> appears twice. I want my function to tell me it appeared twice. Is this possible?</p>

<pre class="prettyprint"><code>import re len(re.findall(pattern, string_to_search)) </code></pre>

Find out how many times a regex matches in a string in Python

2 Answers

import re len(re.findall(pattern, string_to_search))

118

answered Sep 21 '22 13:09

SilentGhost

The existing solutions based on findall are fine for non-overlapping matches (and no doubt optimal except maybe for HUGE number of matches), although alternatives such as sum(1 for m in re.finditer(thepattern, thestring)) (to avoid ever materializing the list when all you care about is the count) are also quite possible. Somewhat idiosyncratic would be using subn and ignoring the resulting string...:

def countnonoverlappingrematches(pattern, thestring):   return re.subn(pattern, '', thestring)[1]

the only real advantage of this latter idea would come if you only cared to count (say) up to 100 matches; then, re.subn(pattern, '', thestring, 100)[1] might be practical (returning 100 whether there are 100 matches, or 1000, or even larger numbers).

Counting overlapping matches requires you to write more code, because the built-in functions in question are all focused on NON-overlapping matches. There's also a problem of definition, e.g, with pattern being 'a+' and thestring being 'aa', would you consider this to be just one match, or three (the first a, the second one, both of them), or...?

Assuming for example that you want possibly-overlapping matches starting at distinct spots in the string (which then would give TWO matches for the example in the previous paragraph):

def countoverlappingdistinct(pattern, thestring):   total = 0   start = 0   there = re.compile(pattern)   while True:     mo = there.search(thestring, start)     if mo is None: return total     total += 1     start = 1 + mo.start()

Note that you do have to compile the pattern into a RE object in this case: function re.search does not accept a start argument (starting position for the search) the way method search does, so you'd have to be slicing thestring as you go -- definitely more effort than just having the next search start at the next possible distinct starting point, which is what I'm doing in this function.

answered Sep 18 '22 13:09

Alex Martelli

Related questions
                            
                                Why does this take so long to match? Is it a bug?
                            
                                Format strings vs concatenation
                            
                                How to declare a static attribute in Python?
                            
                                Scikit Learn - K-Means - Elbow - criterion
                            
                                Best way to choose a random file from a directory
                            
                                How to make new anaconda env from yml file
                            
                                How to position suptitle?
                            
                                How do I enumerate() over a list of tuples in Python?
                            
                                Efficiently finding the last line in a text file [duplicate]
                            
                                np arrays being immutable - "assignment destination is read-only"
                            
                                Is there an equivalent of Pythons range(12) in C#?
                            
                                Math operations from string [duplicate]
                            
                                How can I install the Beautiful Soup module on the Mac?
                            
                                Making Python's `assert` throw an exception that I choose
                            
                                Installing h5py on an Ubuntu server
                            
                                How to give delay between each requests in scrapy?
                            
                                File "/usr/bin/pip", line 9, in <module> from pip import main ImportError: cannot import name main
                            
                                Python Observer Pattern: Examples, Tips? [closed]
                            
                                Second y-axis label getting cut off
                            
                                How to convert bytes type to dictionary?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Find out how many times a regex matches in a string in Python

Tags:

python

regex

Dan

People also ask

2 Answers

SilentGhost

Alex Martelli

Recent Activity

Donate For Us