df=pd.DataFrame({"A":["one","two","three"],"B":["fopur","give","six"]})
when I do,
df.B.str.contains("six").any()
out[2]=True
when I do,
df.B.str.contains("six)").any()
I am getting the below error,
C:\ProgramData\Anaconda3\lib\sre_parse.py in parse(str, flags, pattern)
868 if source.next is not None:
869 assert source.next == ")"
--> 870 raise source.error("unbalanced parenthesis")
871
872 if flags & SRE_FLAG_DEBUG:
error: unbalanced parenthesis at position 3
Please help!
You can set regex=False
in in pandas.Series.str.contains
:
df.B.str.contains("six)", regex=False).any()
If you want to match irrespective of case,
df.B.str.contains("Six)", case=False, regex=False).any()
out[]: True
https://pandas.pydata.org/pandas-docs/stable/generated/pandas.Series.str.contains.html
Info:
Parenthesis are special characters in regular expressions that need to be "escaped", see for example here or here.
You need escape )
by \
because special regex character:
df.B.str.contains("six\)").any()
More general:
import re
df.B.str.contains(re.escape("six)")).any()
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With