I'm attempting to match the last character in a WORD. A WORD is a sequence of non-whitespace characters '[^\n\r\t\f ]', or an empty line matching ^$. The expression I made to do this is: "[^ \n\t\r\f]$?:[ \$\n\t\r\f]$" The regex matches a non-whitespace character that follows a whitespace character or the end of the line. But I don't know how to stop it from excluding the following whitespace character from the result and why it doesn't seem to capture a character preceding the end of the line. Using the string "Hi World!", I would expect: the "i" and "!" to be captured. Instead I get: "i ". What steps can I take to solve this problem?

"Word" that is a sequence of non-whitespace characters scenario Note that a non-capturing group <code>(?:...)</code> in <code>[^ \n\t\r\f](?:[ \$\n\t\r\f])</code> still matches (consumes) the whitespace char (thus, it becomes a part of the match) and it does not match at the end of the string as the <code>$</code> symbol is not a string end anchor inside a character class, it is parsed as a literal <code>$</code> symbol. You may use <pre class="prettyprint"><code>\S(?!\S) </code></pre> See the regex demo The <code>\S</code> matches a non-whitespace char that is not followed with a non-whitespace char (due to the <code>(?!\S)</code> negative lookahead). General "word" case If a word consists of just letters, digits and underscores, that is, if it is matched with <code>\w+</code>, you may simply use <pre class="prettyprint"><code>\w\b </code></pre> Here, <code>\w</code> matches a "word" char, and the word boundary asserts there is no word char right after. See another regex demo.

regex last character of a WORD

1 Answers

"Word" that is a sequence of non-whitespace characters scenario

Note that a non-capturing group (?:...) in [^ \n\t\r\f](?:[ \$\n\t\r\f]) still matches (consumes) the whitespace char (thus, it becomes a part of the match) and it does not match at the end of the string as the $ symbol is not a string end anchor inside a character class, it is parsed as a literal $ symbol.

You may use

\S(?!\S)

See the regex demo

The \S matches a non-whitespace char that is not followed with a non-whitespace char (due to the (?!\S) negative lookahead).

General "word" case

If a word consists of just letters, digits and underscores, that is, if it is matched with \w+, you may simply use

\w\b

Here, \w matches a "word" char, and the word boundary asserts there is no word char right after.

See another regex demo.

answered Sep 20 '22 23:09

Wiktor Stribiżew

Related questions
                            
                                Regex, find pattern only in middle of string
                            
                                Grails g:tags does not support the doller($) sign in regex for pattern ? why?
                            
                                Case-insensitive hash-keys in Regexp::Grammars
                            
                                Numeric value directly after backreference [duplicate]
                            
                                Java match whole word in String
                            
                                R tm substitute words in Corpus using gsub
                            
                                How to replace curly braces and its contents in a string
                            
                                Regex returning complete line instead of match
                            
                                Java String replaceAll regex to remove everything except digits, dots and spaces
                            
                                IIS ReWrite Rule to remove query string when contains specific specific query string
                            
                                Match same number of repetitions of character as repetitions of captured group
                            
                                remove single character in string
                            
                                Powershell wildcard / regex replace
                            
                                Regex in Java: match groups until first symbol occurrence
                            
                                Regex get the text after the match which must be the last occurrence
                            
                                Regex match 4 bytes unicode characters
                            
                                How do I capture match-groups of alternation of a regular expression with split?
                            
                                replace multiple spaces by non breaking spaces
                            
                                JAVA - replaceAll in a regex with $1
                            
                                Swift 3 replacingOccurrences regex

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

regex last character of a WORD

Tags:

string

regex

Aquaactress

People also ask

1 Answers

Wiktor Stribiżew

Recent Activity

Donate For Us