Python regex multiple search

Tags:

regex

I need to search a string for multiple words.

import re

words = [{'word':'test1', 'case':False}, {'word':'test2', 'case':False}]

status = "test1 test2"

for w in words:
    if w['case']:
        r = re.compile("\s#?%s" % w['word'], re.IGNORECASE|re.MULTILINE)
    else:
        r = re.compile("\s#?%s" % w['word'], re.MULTILINE)
    if r.search(status):
        print "Found word %s" % w['word']

For some reason, this will only ever find "test2" and never "test1". Why is this?

I know I can use | delimitated searches but there could be hundreds of words which is why I am using a for loop.

375

asked May 28 '11 18:05

2 Answers

There is no space before test1 in status, while your generated regular expressions require there to be a space.

You can modify the test to match either after a space or at the beginning of a line:

Click to copy

for w in words:
    if w['case']:
        r = re.compile("(^|\s)#?%s" % w['word'], re.IGNORECASE|re.MULTILINE)
    else:
        r = re.compile("(^|\s)#?%s" % w['word'], re.MULTILINE)
    if r.search(status):
        print "Found word %s" % w['word']

answered Oct 27 '22 15:10

As Martijn pointed out, there's no space before test1. But also your code doesn't properly handle the case when a word is longer. Your code would find test2blabla as an instance of test2, and I'm not sure if that is what you want.

I suggest using word boundary regex \b:

Click to copy

for w in words:
    if w['case']:
        r = re.compile(r"\b%s\b" % w['word'], re.IGNORECASE|re.MULTILINE)
    else:
        r = re.compile(r"\b%s\b" % w['word'], re.MULTILINE)
    if r.search(status):
        print "Found word %s" % w['word']

EDIT:

I should've pointed out that if you really want to allow only (whitespace)word or (whitespace)#word format, you cannot use \b.

answered Oct 27 '22 16:10

Norbert P.

Related questions
                            
                                Looking for a good book for Google App Engine Python [closed]
                            
                                How do I set the User-Agent for a QNetworkRequest in PyQtWebkit?
                            
                                How to declare a vector of pointers in Cython?
                            
                                Writing/Reading arrays of Data in Open Office using Python. Anyone have any example code?
                            
                                How to pass a url as a url parameter when there is a question mark in it?
                            
                                Use arbitrary wx objects as a column in a wx.ListCtrl
                            
                                Where is the python path set when I don't have a .bash_profile?
                            
                                SQL Query Builder [closed]
                            
                                How to raise Suds.WebFault from python code?
                            
                                buildout - using different python version
                            
                                Draw text inside pylab figure window
                            
                                Django History for Custom Dashboard
                            
                                Delete / Insert Data in mmap'ed File
                            
                                Speeding Up the Django Admin Delete Page
                            
                                Autocomplete Textbox Example in python + Google app engine
                            
                                Reflect the QPixmap
                            
                                Python select() behavior is strange
                            
                                Check the email message size limit with Python
                            
                                Screenscaping aspx with Python Mechanize - Javascript form submission
                            
                                Synonym finder algorithm

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python regex multiple search

Tags:

python

regex

Hanpan

People also ask

2 Answers

Martijn Pieters

Norbert P.

Recent Activity

Donate For Us