remove single character in string

Question

Looking for a regex that will remove single characters from a string, with a few conditions. One regex will remove all single characters in a string and the other regex will only remove single characters in between the first and last character. See samples below.

Remove all single characters:

Before

names <- c("John C. Smith", "Chris T. Anderson", "Mary H. Jane",
           "J. J. Smith", "J. Thomas")

After:

"John Smith", "Chris Anderson", "Mary Jane", "Smith", "Thomas"

Removes single characters, excludes the first and last characters

Before

names <- c("John C. Smith", "Chris T. Anderson", "Mary H. Jane",
           "J. J. Smith", "J. Thomas")

After:

"John Smith", "Chris Anderson", "Mary Jane", "J. J. Smith", "J. Thomas"

G5W · Accepted Answer

Edited because I Missed part of the question

gsub can delete a pattern from your data. Here, we remove single characters that have multiple character strings both before and after.

gsub("(\w\w)\W+\w\W+(\w\w)", "\1 \2", names)
[1] "John Smith"     "Chris Anderson" "Mary Jane"   "J. J. Smith" "J. Thomas"

To get rid of all of them.

gsub("\W*\b\w\b\W*", " ", names)
[1] "John Smith"     "Chris Anderson" "Mary Jane"      "  Smith"        " Thomas"

akrun · Answer

Here is another option

gsub("\b[A-Z][[:punct:]]\s*", "", names)
#[1] "John Smith"     "Chris Anderson" "Mary Jane"      "Smith"         
#[5] "Thomas"

Or for the second case

sub("(\w+)\s+([A-Z][[:punct:]]\s*){1,}", "\1 ", names)
#[1] "John Smith"     "Chris Anderson" "Mary Jane"      "J. J. Smith"   
#[5] "J. Thomas"

remove single character in string

Tags:

regex

r

Remove all single characters:

Removes single characters, excludes the first and last characters

DCRubyHound

2 Answers

G5W

akrun

Recent Activity

Donate For Us

remove single character in string

Tags:

regex

r

Remove all single characters:

Removes single characters, excludes the first and last characters

DCRubyHound

2 Answers

G5W

akrun

Related questions

Recent Activity

Donate For Us