Find the nth occurrence of substring in a string

People also ask

How do you find the index of the nth occurrence of a substring in a string?

To find the index of nth occurrence of a substring in a string you can use String. indexOf() function. A string, say str2 , can occur in another string, say str1 , n number of times. There could be a requirement in your Java application, that you have to find the position of the nth occurrence of str2 in str1 .

How do you find all the occurrences of a substring in a string?

Use the string. count() Function to Find All Occurrences of a Substring in a String in Python. The string. count() is an in-built function in Python that returns the quantity or number of occurrences of a substring in a given particular string.

How do you find the nth occurrence of a character in a string in SQL?

T-SQL's CHARINDEX() function is useful for parsing out characters within a string. However, it only returns the first occurrence of a character. Over at SQL Server Central, there is a function that Cade Bryant wrote that returns the Nth occurrence of a character.

Here's a more Pythonic version of the straightforward iterative solution:

def find_nth(haystack, needle, n):
    start = haystack.find(needle)
    while start >= 0 and n > 1:
        start = haystack.find(needle, start+len(needle))
        n -= 1
    return start

Example:

>>> find_nth("foofoofoofoo", "foofoo", 2)
6

If you want to find the nth overlapping occurrence of needle, you can increment by 1 instead of len(needle), like this:

def find_nth_overlapping(haystack, needle, n):
    start = haystack.find(needle)
    while start >= 0 and n > 1:
        start = haystack.find(needle, start+1)
        n -= 1
    return start

Example:

>>> find_nth_overlapping("foofoofoofoo", "foofoo", 2)
3

This is easier to read than Mark's version, and it doesn't require the extra memory of the splitting version or importing regular expression module. It also adheres to a few of the rules in the Zen of python, unlike the various re approaches:

Simple is better than complex.
Flat is better than nested.
Readability counts.

Mark's iterative approach would be the usual way, I think.

Here's an alternative with string-splitting, which can often be useful for finding-related processes:

def findnth(haystack, needle, n):
    parts= haystack.split(needle, n+1)
    if len(parts)<=n+1:
        return -1
    return len(haystack)-len(parts[-1])-len(needle)

And here's a quick (and somewhat dirty, in that you have to choose some chaff that can't match the needle) one-liner:

'foo bar bar bar'.replace('bar', 'XXX', 1).find('bar')

This will find the second occurrence of substring in string.

def find_2nd(string, substring):
   return string.find(substring, string.find(substring) + 1)

Edit: I haven't thought much about the performance, but a quick recursion can help with finding the nth occurrence:

def find_nth(string, substring, n):
   if (n == 1):
       return string.find(substring)
   else:
       return string.find(substring, find_nth(string, substring, n - 1) + 1)

Understanding that regex is not always the best solution, I'd probably use one here:

>>> import re
>>> s = "ababdfegtduab"
>>> [m.start() for m in re.finditer(r"ab",s)]
[0, 2, 11]
>>> [m.start() for m in re.finditer(r"ab",s)][2] #index 2 is third occurrence 
11

Related questions
                            
                                Iterating over a numpy array
                            
                                Group by & count function in sqlalchemy
                            
                                Pickle or json?
                            
                                Why is "if not someobj:" better than "if someobj == None:" in Python?
                            
                                Python executable not finding libpython shared library
                            
                                Adding headers to requests module
                            
                                How do I install opencv using pip?
                            
                                How to plot normal distribution?
                            
                                Multiprocessing : use tqdm to display a progress bar
                            
                                How do I perform HTML decoding/encoding using Python/Django?
                            
                                Combining two lists and removing duplicates, without removing duplicates in original list
                            
                                Where is virtualenvwrapper.sh after pip install?
                            
                                Create nice column output in python
                            
                                Regular expression to return text between parenthesis
                            
                                Shared-memory objects in multiprocessing
                            
                                Creating a dynamic choice field
                            
                                Is there a way to auto-adjust Excel column widths with pandas.ExcelWriter?
                            
                                subtract two times in python
                            
                                Insert an element at a specific index in a list and return the updated list
                            
                                Good ways to sort a queryset? - Django

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Find the nth occurrence of substring in a string

Tags:

python

string

substring

People also ask

Recent Activity

Donate For Us