Python: How to use RegEx in an if statement?

`if re.search(r'pattern', string):`

Simple if-regex example:

if re.search(r'ing\b', "seeking a great perhaps"):     # any words end with ing?
    print("yes")

Complex if-regex example (pattern check, extract a substring, case insensitive):

Click to copy

match_object = re.search(r'^OUGHT (.*) BE$', "ought to be", flags=re.IGNORECASE)
if match_object:
    assert "to" == match_object.group(1)     # what's between ought and be?

Notes:

Use re.search() not re.match. Match restricts to the start of strings, a confusing convention if you ask me. If you do want a string-starting match, use caret or \A instead, re.search(r'^...', ...)
Use raw string syntax r'pattern' for the first parameter. Otherwise you would need to double up backslashes, as in re.search('ing\\b', ...)
In these examples, '\\b' or r'\b' is a special sequence meaning word-boundary for regex purposes. Not to be confused with '\b' or '\x08' backspace.
re.search() returns None if it doesn't find anything, which is always falsy.
re.search() returns a Match object if it finds anything, which is always truthy.
a group is what matched inside parentheses
group numbering starts at 1
Specs
Tutorial

The REPL makes it easy to learn APIs. Just run python, create an object and then ask for help:

Click to copy

$ python
>>> import re
>>> help(re.compile(r''))

at the command line shows, among other things:

search(...)

search(string[, pos[, endpos]]) --> match object or None. Scan through string looking for a match, and return a corresponding MatchObject instance. Return None if no position in the string matches.

so you can do

Click to copy

regex = re.compile(regex_txt, re.IGNORECASE)

match = regex.search(content)  # From your file reading code.
if match is not None:
  # use match

Incidentally,

Click to copy

regex_txt = "facebook.com"

has a . which matches any character, so re.compile("facebook.com").search("facebookkcom") is not None is true because . matches any character. Maybe

Click to copy

regex_txt = r"(?i)facebook\.com"

The \. matches a literal "." character instead of treating . as a special regular expression operator.

The r"..." bit means that the regular expression compiler gets the escape in \. instead of the python parser interpreting it.

The (?i) makes the regex case-insensitive like re.IGNORECASE but self-contained.

First you compile the regex, then you have to use it with match, find, or some other method to actually run it against some input.

Click to copy

import os
import re
import shutil

def test():
    os.chdir("C:/Users/David/Desktop/Test/MyFiles")
    files = os.listdir(".")
    os.mkdir("C:/Users/David/Desktop/Test/MyFiles2")
    pattern = re.compile(regex_txt, re.IGNORECASE)
    for x in (files):
        with open((x), 'r') as input_file:
            for line in input_file:
                if pattern.search(line):
                    shutil.copy(x, "C:/Users/David/Desktop/Test/MyFiles2")
                    break

Related questions
                            
                                Python how to exit main function [duplicate]
                            
                                gradient descent using python and numpy
                            
                                How to use openCV's connected components with stats in python?
                            
                                how to get request object in django unit testing?
                            
                                Drop all data in a pandas dataframe
                            
                                catching SQLAlchemy exceptions
                            
                                How to launch python Idle from a virtual environment (virtualenv)
                            
                                How do I properly set the Datetimeindex for a Pandas datetime object in a dataframe?
                            
                                How do I check if keras is using gpu version of tensorflow?
                            
                                Getting the date of 7 days ago from current date in python [closed]
                            
                                AttributeError: Module Pip has no attribute 'main'
                            
                                How can I start ipython running a script?
                            
                                pandas read_csv index_col=None not working with delimiters at the end of each line
                            
                                How do I remove the background from this kind of image?
                            
                                Python: read all text file lines in loop
                            
                                no module named urllib.parse (How should I install it?)
                            
                                Testing if a pandas DataFrame exists
                            
                                Django - How to do tuple unpacking in a template 'for' loop
                            
                                How to find out if Python is compiled with UCS-2 or UCS-4?
                            
                                How can I install the Python library 'gevent' on Mac OS X Lion

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python: How to use RegEx in an if statement?

Tags:

python

regex

People also ask

`if re.search(r'pattern', string):`

`search(...)`

Recent Activity

Donate For Us