how to make portable regex?

1 Answers

There is no standard, but if maximum portability is your goal you should stick to the features supported by JavaScript regexes. All of the other major flavors support everything JS does, with only minor variations here and there. For example, some only support the POSIX character-class notation ([:alpha:]), while others use the Unicode syntax (\p{Alpha}).

Probably the most troublesome variations are those that affect the dot (.) and the anchors (^ and $). For example, JavaScript has no DOTALL (or "single-line") mode, so to match anything including a newline you have to use a hack like [\s\S]. Meanwhile, Ruby has a DOTALL mode but calls it multiline mode--what everyone else calls "multiline" (^ and $ as line anchors) is how Ruby always works.

Be aware, too, of exactly what the dot doesn't match (in the default mode). Traditionally that was just the linefeed (\n), but more and more flavors are adopting (or at least approximating) the Unicode guidelines concerning line separators. For example, in Java the dot doesn't match any of [\r\n\u0085\u2028\u2029], while ^ and $ treat \r\n as a single separator and won't match between the two characters.

Note that I'm only talking about Perl-derived flavors, like Python, Ruby, PHP, JavaScript, etc.. It wouldn't make sense to inlcude GNU or POSIX based flavors like grep, awk, and MySQL; they tend to have fewer features, but that's not what you would choose them for anyway.

I'm also not including the XML Schema flavor; it's much more limited than JavaScript, but it's a specialized application. For example, it doesn't support the anchors (^, $, \A, \Z, etc.) because matches are always anchored at both ends.

answered Oct 10 '22 00:10

Alan Moore

Related questions
                            
                                Using lookahead, how to ensure at least 4 alphanumeric chars are included + underscores
                            
                                RE -> FSM generator? [closed]
                            
                                Restrict word list in XML schema
                            
                                What is the regular expression for a Spanish word?
                            
                                Shell equivalent of php preg_match?
                            
                                Mysql field name within regular expression
                            
                                running grep from within GNU make
                            
                                How can I set Regular Expression on TextBox?
                            
                                Regex for file path validation in javascript
                            
                                How to simulate non-greedy quantifiers in languages that don't support them?
                            
                                Regex replace - how to replace same pattern in multiple places with different strings?
                            
                                Matching one-line JavaScript comments (//) with re
                            
                                Java equivalent of Perl's s/// operator?
                            
                                Shell commands to match key value pairs
                            
                                java phone number validation
                            
                                JavaScript regular expression literal persists between function calls
                            
                                How can I generate all possible permutations from a Perl regular expression?
                            
                                Ruby equivalent to "grep -C 5" to get context of lines around the match?
                            
                                Regular Expression to remove Div tags [duplicate]
                            
                                String regex matching in Erlang

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to make portable regex?

Tags:

regex

portability

dugres

People also ask

1 Answers

Alan Moore

Recent Activity

Donate For Us