Using a variable within a regular expression in Pandas str.contains()

Tags:

I'm attempting to select rows from a dataframe using the pandas str.contains() function with a regular expression that contains a variable as shown below.

df = pd.DataFrame(["A test Case","Another Testing Case"], columns=list("A"))
variable = "test"
df[df["A"].str.contains(r'\b' + variable + '\b', regex=True, case=False)] #Returns nothing

While the above returns nothing, the following returns the appropriate row as expected

df[df["A"].str.contains(r'\btest\b', regex=True, case=False)] #Returns values as expected

Any help would be appreciated.

285

asked Dec 04 '18 22:12

neanderslob

1 Answers

Both word boundary characters must be inside raw strings. Why not use some sort of string formatting instead? String concatenation as a rule is generally discouraged.

df[df["A"].str.contains(fr'\b{variable}\b', regex=True, case=False)] 
# Or, 
# df[df["A"].str.contains(r'\b{}\b'.format(variable), regex=True, case=False)] 

             A
0  A test Case

186

answered Oct 29 '22 21:10

cs95

Related questions
                            
                                Difference between get and dunder getitem [duplicate]
                            
                                from torch._C import * ImportError: DLL load failed: The specified module could not be found
                            
                                How to create new column in Pandas with condition to repeat by a value of another column?
                            
                                How to override the html default "Please fill out this field" when validation fails in Flask?
                            
                                pip install fail with SSL certificate verify failed (_ssl.c:833)
                            
                                Python script stops running when screen turns off
                            
                                Can't install Tensorflow Mac
                            
                                Sklearn Chi2 For Feature Selection
                            
                                Class weights for balancing data in TensorFlow Object Detection API
                            
                                Writing a Large JSON Array To File
                            
                                Using Geopandas, how do I select all points not within a polygon?
                            
                                How to use PyInstaller from script, not terminal?
                            
                                How to capitalize first letter in strings that may contain numbers
                            
                                How to get slope from timeseries data in pandas?
                            
                                Legend with vertical line in matplotlib
                            
                                Installed pytest but running `pytest` in bash returns `not found`
                            
                                How can I select specific fields in django rest framework? [duplicate]
                            
                                MultiThreading in AWS lambda using Python3
                            
                                Compiling cython with gcc: No such file or directory from #include "ios"
                            
                                Is it possible to use spacy with already tokenized input?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using a variable within a regular expression in Pandas str.contains()

Tags:

python

regex

contains

pandas

neanderslob

People also ask

1 Answers

cs95

Recent Activity

Donate For Us