regex: "(^|)" vs "(|^)"

Tags:

r

I have a very special question concerning regular expressions in R:

grepl("(|^)over","stackoverflow")
# [1] TRUE

grepl("(^|)over","stackoverflow")
# [1] FALSE

grepl("(^|x|)over","stackoverflow")
# [1] FALSE

grepl("(x|^|)over","stackoverflow")
# [1] FALSE

grepl("(x||^)over","stackoverflow")
# [1] TRUE

Why do not all those expressions evaluate to TRUE?

604

asked Mar 09 '16 23:03

1 Answers

POSIX regular expressions actually should make all those True. It appears that R uses a slightly modified version of Ville Laurikari's TRE library that doesn't quite follow the standard. I'd follow @rawr's recommendations and use perl = TRUE for more compliant regular expressions.

See also: When both halves of an OR regex group match, is it defined which will be chosen?

102

answered Oct 22 '22 22:10

Allen Luce

Related questions
                            
                                How do you reject a string if preceded by another string using standard POSIX regex?
                            
                                Repeatable, complex regular expression, with dot '.' delimited separators
                            
                                looping through scan and replacing matches individually
                            
                                awk FPAT variable: Working
                            
                                Detect and alter strings in PDFs
                            
                                Regular expression to match only if there are N unique characters
                            
                                Exclude strings of pattern "abba"
                            
                                preg_match :print: class matches tab character
                            
                                Regex match non-greedy on one optional string and greedy on another
                            
                                split line via regex in javascript?
                            
                                Remove variable wrapped in function from model formula in R
                            
                                Use Pandas string method 'contains' on a Series containing lists of strings
                            
                                is_date() is malfunctioning
                            
                                Confused with the locale settings in R
                            
                                Parsing scutil output with perl
                            
                                Perl split function - use repeating characters as delimiter
                            
                                Using regex to replace parameters in a string
                            
                                Regular Expression for Separating Paths [duplicate]
                            
                                Parse string with whitespace and quotation mark (with quotation mark retained)
                            
                                How to parse JSON-XML hybrid file in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

regex: "(^|)" vs "(|^)"

Tags:

regex

r

Daniel Gerigk

People also ask

1 Answers

Allen Luce

Recent Activity

Donate For Us