I wanted to count the number of times that a string like 'aa' appears in 'aaa' (or 'aaaa'). The most obvious code gives the wrong (or at least, not the intuitive) answer: <pre class="prettyprint"><code>'aaa'.count('aa') 1 # should be 2 'aaaa'.count('aa') 2 # should be 3 </code></pre> Does anyone have a simple way to fix this?

From <code>str.count()</code> documentation: <blockquote> Return the number of non-overlapping occurrences of substring sub in the range [start, end]. Optional arguments start and end are interpreted as in slice notation. </blockquote> So, no. You are getting the expected result. If you want to count number of overlapping matches, use <code>regex</code>: <pre class="prettyprint"><code>>>> import re >>> >>> len(re.findall(r'(a)(?=\1)', 'aaa')) 2 </code></pre> This finds all the occurrence of <code>a</code>, which is followed by <code>a</code>. The 2nd <code>a</code> wouldn't be captured, as we've used look-ahead, which is zero-width assertion.

Python: how to count overlapping occurrences of a substring [duplicate]

Tags:

python

I wanted to count the number of times that a string like 'aa' appears in 'aaa' (or 'aaaa').

The most obvious code gives the wrong (or at least, not the intuitive) answer:

'aaa'.count('aa')
1 # should be 2
'aaaa'.count('aa')
2 # should be 3

Does anyone have a simple way to fix this?

554

asked Oct 10 '13 17:10

nivk

2 Answers

From str.count() documentation:

Return the number of non-overlapping occurrences of substring sub in the range [start, end]. Optional arguments start and end are interpreted as in slice notation.

So, no. You are getting the expected result.

If you want to count number of overlapping matches, use regex:

>>> import re
>>> 
>>> len(re.findall(r'(a)(?=\1)', 'aaa'))
2

This finds all the occurrence of a, which is followed by a. The 2nd a wouldn't be captured, as we've used look-ahead, which is zero-width assertion.

116

answered Sep 25 '22 01:09

Rohit Jain

haystack = "aaaa"
needle   = "aa"

matches  = sum(haystack[i:i+len(needle)] == needle 
               for i in xrange(len(haystack)-len(needle)+1))

# for Python 3 use range instead of xrange

answered Sep 25 '22 01:09

kindall

Related questions
                            
                                Is there any way to shorten this Python generator expression?
                            
                                Extracting sub-string after the first space in Python
                            
                                Flask + mod_wsgi: client denied by server configuration
                            
                                what is the difference between [[],[]] and [[]] * 2
                            
                                How to check all the elements in a list that has a specific requirement?
                            
                                Splitting list of python dictionaries by repeating dictionary key values
                            
                                Why does Django's send_mail not work during testing?
                            
                                python pack() and grid() methods together
                            
                                Get indices for all elements in an array in numpy
                            
                                Regex Apostrophe how to match?
                            
                                How to pass sys.argv[n] into a function in Python
                            
                                how to create a date object in python representing a set number of days
                            
                                ptrepack sortby needs 'full' index
                            
                                Numpy warning:Casting Complex to real discards imaginary part
                            
                                Viewing a list of all python operators via the interpreter
                            
                                In Python, how to test whether a line is the last one?
                            
                                Connect to an already running instance of chrome using selenium in python
                            
                                Usecase of |= in python
                            
                                plot decision boundary matplotlib
                            
                                List comprehension and function returning multiple values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With