I have a script that runs into my text and search and replace all the sentences I write based in a database. The script: <pre class="prettyprint"><code>with open('C:/Users/User/Desktop/Portuguesetranslator.txt') as f: for l in f: s = l.split('*') editor.replace(s[0],s[1]) </code></pre> And the Database example: <pre class="prettyprint"><code>Event*Evento* result*resultado* </code></pre> And so on... Now what is happening is that I need the "whole word only" in that script, because I'm finding myself with problems. For example with <code>Result</code> and <code>Event</code>, because when I replace for <code>Resultado</code> and <code>Evento</code>, and I run the script one more time in the text the script replace again the <code>Resultado</code> and <code>Evento</code>. And the result after I run the script stays like this <code>Resultadoado</code> and <code>Eventoo</code>. Just so you guys know.. Its not only for Event and Result, there is more then 1000+ sentences that I already set for the search and replace to work.. I don't need a simples search and replace for two words.. because I'm going to be editing the database over and over for different sentences..

You want a regular expression. You can use the token <code>\b</code> to match a word boundary: i.e., <code>\bresult\b</code> would match only the exact word "result." <pre class="prettyprint"><code>import re with open('C:/Users/User/Desktop/Portuguesetranslator.txt') as f: for l in f: s = l.split('*') editor = re.sub(r"\b%s\b" % s[0] , s[1], editor) </code></pre>

Use <code>re.sub</code>: <pre class="prettyprint"><code>replacements = {'the':'a', 'this':'that'} def replace(match): return replacements[match.group(0)] # notice that the 'this' in 'thistle' is not matched print re.sub('|'.join(r'\b%s\b' % re.escape(s) for s in replacements), replace, 'the cat has this thistle.') </code></pre> Prints <pre class="prettyprint"><code>a cat has that thistle. </code></pre> Notes: <ul> <li>All the strings to be replaced are joined into a single pattern so that the string needs to be looped over just once. </li> <li>The source strings are passed to <code>re.escape</code> to make avoid interpreting them as regular expressions. </li> <li>The words are surrounded by <code>r'\b'</code> to make sure matches are for whole words only.</li> <li>A replacement function is used so that any match can be replaced.</li> </ul>

Use <code>re.sub</code> instead of normal string replace to replace only whole words.So your script,even if it runs again will not replace the already replaced words. <pre class="prettyprint"><code>>>> import re >>> editor = "This is result of the match" >>> new_editor = re.sub(r"\bresult\b","resultado",editor) >>> new_editor 'This is resultado of the match' >>> newest_editor = re.sub(r"\bresult\b","resultado",new_editor) >>> newest_editor 'This is resultado of the match' </code></pre>

Search and replace with "whole word only" option [duplicate]

Tags:

I have a script that runs into my text and search and replace all the sentences I write based in a database.

The script:

with open('C:/Users/User/Desktop/Portuguesetranslator.txt') as f:
    for l in f:
        s = l.split('*')
        editor.replace(s[0],s[1])

And the Database example:

Event*Evento*
result*resultado*

And so on...

Now what is happening is that I need the "whole word only" in that script, because I'm finding myself with problems.

For example with Result and Event, because when I replace for Resultado and Evento, and I run the script one more time in the text the script replace again the Resultado and Evento.

And the result after I run the script stays like this Resultadoado and Eventoo.

Just so you guys know.. Its not only for Event and Result, there is more then 1000+ sentences that I already set for the search and replace to work..

I don't need a simples search and replace for two words.. because I'm going to be editing the database over and over for different sentences..

708

asked Jul 18 '13 18:07

Renan Cidale

4 Answers

You want a regular expression. You can use the token \b to match a word boundary: i.e., \bresult\b would match only the exact word "result."

import re

with open('C:/Users/User/Desktop/Portuguesetranslator.txt') as f:
    for l in f:
        s = l.split('*')
        editor = re.sub(r"\b%s\b" % s[0] , s[1], editor)

100

answered Sep 22 '22 23:09

kindall

Use re.sub:

replacements = {'the':'a', 
                'this':'that'}

def replace(match):
    return replacements[match.group(0)]

# notice that the 'this' in 'thistle' is not matched 
print re.sub('|'.join(r'\b%s\b' % re.escape(s) for s in replacements), 
        replace, 'the cat has this thistle.')

Prints

a cat has that thistle.

Notes:

All the strings to be replaced are joined into a single pattern so that the string needs to be looped over just once.
The source strings are passed to re.escape to make avoid interpreting them as regular expressions.
The words are surrounded by r'\b' to make sure matches are for whole words only.
A replacement function is used so that any match can be replaced.

answered Sep 21 '22 23:09

Steven Rumbalski

Use re.sub instead of normal string replace to replace only whole words.So your script,even if it runs again will not replace the already replaced words.

>>> import re
>>> editor = "This is result of the match"
>>> new_editor = re.sub(r"\bresult\b","resultado",editor)
>>> new_editor
'This is resultado of the match'
>>> newest_editor = re.sub(r"\bresult\b","resultado",new_editor)
>>> newest_editor
'This is resultado of the match'

answered Sep 23 '22 23:09

DhruvPathak

It is very simple. use re.sub, don't use replace.

import re
replacements = {r'\bthe\b':'a', 
                r'\bthis\b':'that'}

def replace_all(text, dic):
    for i, j in dic.iteritems():
        text = re.sub(i,j,text)
    return text

replace_all("the cat has this thistle.", replacements)

It will print

a cat has that thistle.

answered Sep 21 '22 23:09

Sudharsan

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Search and replace with "whole word only" option [duplicate]

Tags:

Renan Cidale

People also ask

4 Answers

kindall

Steven Rumbalski

DhruvPathak

Sudharsan

Recent Activity

Donate For Us

Search and replace with "whole word only" option [duplicate]

Tags:

Renan Cidale

People also ask

4 Answers

kindall

Steven Rumbalski

DhruvPathak

Sudharsan

Related questions

Recent Activity

Donate For Us