Lots of ready-to-use character classes are available in Perl regular expressions, such as <code>\d</code> or <code>\S</code>, or new-fangled Unicode grokkers such as <code>\p{P}</code>, which matches punctuation characters. Now let's say I'd like to match all punctuation characters <code>\p{P}</code> (quite a number of them, and not something you want to type in by hand) - all but one, all but the good old komma (or comma, <code>,</code>). Is there a way to specify this requirement short of expanding the handy character class and taking away the komma by hand?

<pre class="prettyprint"><code>$ unichars -au '\p{P}' | wc -l 598 </code></pre> Double negation: <pre class="prettyprint"><code>/[^\P{P},]/ $ unichars -au '[^\P{P},]' | wc -l 597 </code></pre> "And" through lookahead/lookbehind: <pre class="prettyprint"><code>/\p{P}(?<!,)/ $ unichars -au '\p{P}(?<!,)' | wc -l 597 </code></pre> <code>unichars</code>

Use ready-made character class and restrict it further

1 Answers

$ unichars -au '\p{P}' | wc -l
598

Double negation:

/[^\P{P},]/

$ unichars -au '[^\P{P},]' | wc -l
597

"And" through lookahead/lookbehind:

/\p{P}(?<!,)/

$ unichars -au '\p{P}(?<!,)' | wc -l
597

unichars

172

answered Nov 12 '22 16:11

ikegami

Related questions
                            
                                Easiest way to convert "a/b/c" to ["a/b/c", "a/b", "a"]
                            
                                How to replace by regular expression to lowercase in python
                            
                                Regex for Username?
                            
                                What's the Regular Expression that matches culture names?
                            
                                Regex to replace invalid characters
                            
                                PHP multiple new lines
                            
                                Unexpected RegexStringValidator failure in ConfigurationProperty in custom ConfigurationElement
                            
                                Replace String.Replace with Regex.Replace
                            
                                What is the regex expression for CDATA
                            
                                How do I check if a named group exists in a MatchData object?
                            
                                how to make regexp not hungry with quotes?
                            
                                Replace text inside of square brackets
                            
                                How to Use AND in Ruby Regex
                            
                                Powershell Parsing Help - How to output a list of folder names into a text file
                            
                                JSLint Regexp Violation Quandry
                            
                                How to search text surrounded by double-quotes with RegEx?
                            
                                Javascript Regex with .test()
                            
                                Ant replaceregexp task - Match and replace HTML comments block
                            
                                C# Regex, Unrecognized escape sequence
                            
                                Regular expression for upper case letter only in JavaScript

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Use ready-made character class and restrict it further

Tags:

regex

unicode

perl

Lumi

People also ask

1 Answers

ikegami

Recent Activity

Donate For Us