Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

how to remove words of specific length in a string in R?

I want to remove words of length less than 3 in a string. for example my input is

str<- c("hello RP have a nice day")

I want my output to be

str<- c("hello have nice day")

Please help

like image 790
areddy Avatar asked Oct 20 '15 01:10

areddy


1 Answers

Try this:

gsub('\\b\\w{1,2}\\b','',str)
[1] "hello  have  nice day"

EDIT \b is word boundary. If need to drop extra space,change it as:

gsub('\\b\\w{1,2}\\s','',str)

Or

gsub('(?<=\\s)(\\w{1,2}\\s)','',str,perl=T)
like image 101
Shenglin Chen Avatar answered Sep 23 '22 18:09

Shenglin Chen