Is there a nice(er) way to find the end index of a word in a string? My method is like that: <pre class="prettyprint"><code>text = "fed up of seeing perfect fashion photographs" word = "fashion" wordEndIndex = text.index(word) + len(word) - 1 </code></pre>

It depends whether you really want to know the end index or not. Presumably you're actually more interested in the bits of the <code>text</code> after that? Are you then doing something like this? <pre class="prettyprint"><code>>>> text[wordEndIndex:] 'n photographs' </code></pre> If you really do need the index, then do what you've done, but wrap it inside a function that you can call for different <code>text</code>s and <code>word</code>s so you don't have to repeat this code. Then it's simple and understandable, if you give the function a descriptive name. On the other hand, if you're more interested in the bits of <code>text</code>, then don't even bother working out what the index is: <pre class="prettyprint"><code>>>> text.split(word) ['fed up of seeing perfect ', ' photographs'] </code></pre> Of course this will get more complicated if the word can appear more than once in the text. In that case maybe you could define a different function to split on the first occurrence of the word and just give back the before and after components, without ever returning any numerical indexes.

I cannot comment on whether this is a better way, but an alternative to what you have suggested would be to find the next space after that word and use that to get the index. <pre class="prettyprint"><code>text = "fed up of seeing perfect fashion photographs" word = "fashion" temp = text.index(word) wordEndIndex = temp + text[temp:].index(' ') - 1 </code></pre> Your approach seems more natural, and is possibly faster too.

Find the index of the end of a word in python

Tags:

python

string

find

indexing

Is there a nice(er) way to find the end index of a word in a string?

My method is like that:

text = "fed up of seeing perfect fashion photographs"
word = "fashion"
wordEndIndex = text.index(word) + len(word) - 1

988

asked Dec 02 '15 17:12

Prag

3 Answers

It depends whether you really want to know the end index or not. Presumably you're actually more interested in the bits of the text after that? Are you then doing something like this?

>>> text[wordEndIndex:]
'n photographs'

If you really do need the index, then do what you've done, but wrap it inside a function that you can call for different texts and words so you don't have to repeat this code. Then it's simple and understandable, if you give the function a descriptive name.

On the other hand, if you're more interested in the bits of text, then don't even bother working out what the index is:

>>> text.split(word)
['fed up of seeing perfect ', ' photographs']

Of course this will get more complicated if the word can appear more than once in the text. In that case maybe you could define a different function to split on the first occurrence of the word and just give back the before and after components, without ever returning any numerical indexes.

121

answered Oct 17 '22 08:10

Constance

I cannot comment on whether this is a better way, but an alternative to what you have suggested would be to find the next space after that word and use that to get the index.

text = "fed up of seeing perfect fashion photographs"
word = "fashion"
temp = text.index(word)
wordEndIndex = temp + text[temp:].index(' ') - 1

Your approach seems more natural, and is possibly faster too.

answered Oct 17 '22 07:10

Antimony

Just for fun, here's a first–principles version that finds the index of the last character of the word in a single pass:

def word_end_index(text, word):
    wi = wl = len(word)
    for ti, tc in enumerate(text):
        wi = wi - 1 if tc == word[-wi] else wl
        if not wi:
            return ti

    return -1

I had some shorter versions but they used slices which would duplicate strings all over the place which is rather inefficient.

answered Oct 17 '22 09:10

Robin Hilliard

Related questions
                            
                                Best practices for Python deployment -- multiple versions, standard install locations, packaging tools etc
                            
                                How to escape $ on Python string Template class?
                            
                                ImportError: No module named vtkCommonPython
                            
                                How do I make Django signal handlers not fail silently when an exception is encountered in the signal handler?
                            
                                Refactor with pyCharm from "user" to "self.user"
                            
                                Why doesn't the MySQLdb Connection context manager close the cursor?
                            
                                How do I patch an object so that all methods are mocked except one?
                            
                                Setting column types while reading csv with pandas
                            
                                heatmap-like plot, but for categorical variables in seaborn
                            
                                pandas groupby-apply behavior, returning a Series (inconsistent output type)
                            
                                How (in what form) to share (deliver) a Python function?
                            
                                How to deal with UserWarning: Converting sparse IndexedSlices to a dense Tensor of unknown shape
                            
                                Coverage of Cython module using py.test and coverage.py
                            
                                Python - For loop millions of rows
                            
                                Why do I get "AttributeError: __fields_set__" when subclassing a Pydantic BaseModel?
                            
                                Robust Algorithm to detect uneven illumination in images [Detection Only Needed]
                            
                                Detect in python which keys are pressed
                            
                                Using "from __future__ import division" in my program, but it isn't loaded with my program
                            
                                Web app hangs for several hours in ssl.py at self._sslobj.do_handshake()
                            
                                Pandas Boolean .any() .all()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With