Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

Split vector of strings and paste subset of resulting elements into a new vector

Tags:

string

split

r

vector

Define

z<- as.character(c("1_xx xx xxx_xxxx_12_sep.xls","2_xx xx xxx_xxxx_15_aug.xls"))

such that

> z
[1] "1_xx xx xxx_xxxx_12_sep.xls" "2_xx xx xxx_xxxx_15_aug.xls"

I want to create a vector w such that

> w
[1] "1_12_sep" "2_15_aug"

That is, split each element of z by _ and then join elements 1,4,5, with the .xls removed from the latter.

I can manage the split part, but not sure what function to provide, e.g something like"

w <- as.character(lapply(strsplit(z,"_"), function(x) ???))

like image

328

asked Jun 17 '11 21:06

Fred

Video Answer

2 Answers

You can do this using a combination of strsplit, substr and lapply:

y <- strsplit(z,"_",fixed=TRUE)
lapply(y,FUN=function(x){paste(x[1],x[4],substr(x[5],1,3),sep="_")})

like image

70

answered Oct 21 '22 01:10

joran

Using a bit of magic in the stringr package: I separately extract the left and right date fields, combine them, and finally remove the .xls at the end.

library(stringr)
l <- str_extract(z, "\\d+_")
r <- str_extract(z, "\\d+_\\w*\\.xls")
gsub(".xls", "", paste(l, r, sep=""))

[1] "1_12_sep" "2_15_aug"

str_extract is a wrapper around some of the base R functions which I find easier to use.

Edit Here is a short explanation of what the regex does:

\\d+ looks for one or more digits. It is escaped to distinguish from a normal character d.
\\w* looks for zero or more alphanumeric characters (word). Again, it's escaped.
\\. looks for a decimal point. This needs to be escaped because otherwise the decimal point means any single character.

In theory the regex should be quite flexible. It should find single or double characters for your dates.

like image

41

answered Oct 21 '22 02:10

Andrie

Sign in to Comment

Related questions
                            
                                Extract characters at a set position
                            
                                Running a linear model in R with spreadsheet data
                            
                                extracting first value from a list
                            
                                Semi-transparency in RStudio
                            
                                I want to select the greater of the two values from two columns in R [duplicate]
                            
                                Rcpp warning: "directory not found for option '-L/usr/local/Cellar/gfortran/4.8.2/gfortran'"
                            
                                r - Iterating with 2 variables in for
                            
                                Get selected rows of Rhandsontable
                            
                                Displaying radio button in elements in Shiny in a horizontal order instead of default vertical view
                            
                                several substitutions in one line R
                            
                                replace "." with space using gsub() in R?
                            
                                Add hline with population median for each facet
                            
                                How to find points by linear interpolation
                            
                                Accessing element of a split string in R
                            
                                Setting individual y axis limits with facet wrap NOT with scales free_y
                            
                                Add variable to nested list
                            
                                How to run a package's testthat tests
                            
                                geom_density_ridges requires the following missing aesthetics: y
                            
                                Extract colnames from a nested list of data.frames
                            
                                Using R with Apache & PHP [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With