Split columns by number in a dataframe

Tags:

I'm trying to separate a column in a rather untidy dataframe.

section
View 500
V458
453

And I want to create a new column from this. With the preferred output like below.

section  section numbers  
View     500
V        458
         453

I've been trying to research it but I'm having a time with it. I can separate them in the case of the first row, because I can use regex like this.

df_split <- separate(df, col = section, into = c("section", "section_number"), sep = " +[1-9]")

But I can't seem to find a way to use an "or" type statement. If anyone has any input that would be wonderful.

956

asked Dec 23 '16 20:12

sevpants

2 Answers

Using a simple gsub would be a choice for me:

section <- c('View 500', 'V458', '453')

cbind(section = trimws(gsub('[0-9]', '', section)), 
      section_numbers = trimws(gsub('[a-zA-Z]', '', section)))

I use trimws to just remove any unwanted white spaces.

Output:

    section section_numbers
[1,] "View"  "500"          
[2,] "V"     "458"          
[3,] ""      "453"

180

answered Sep 28 '22 09:09

LyzandeR

You can use tidyr for this:

tidyr::extract(df,section, c("section", "section number"), 
               regex="([[:alpha:]]*)[[:space:]]*([[:digit:]]*)")
  section section number
1    View            500
2       V            458
3                    453

answered Sep 28 '22 07:09

HubertL

Related questions
                            
                                Meaning of %o% in R
                            
                                Is S-PLUS dead? [closed]
                            
                                Display SpatialPolygonsDataFrame on leaflet map with R
                            
                                R as.POSIXct() dropping hours minutes and seconds
                            
                                Evaluate a Chunk based on the output format of knitr
                            
                                Proper way to have two functions access a single function's environment?
                            
                                Deleting rows in R conditionally
                            
                                conditional calculations in data frame
                            
                                Use put two value columns in spread() function in R [duplicate]
                            
                                correlation matrix of a bunch of categorical variables in R
                            
                                Whether to write in "ui.R + server.R" or "app.R"
                            
                                R - Call a function from function name that is stored in a variable?
                            
                                Get ObjectID in mongolite R library
                            
                                xyplot time series with positive values in green, negative in red, in R
                            
                                Count number of unique rows based on two columns, by group
                            
                                Divide all columns by the value from the 2nd column - apply for all rows
                            
                                How can I plot igraph community with defined colors?
                            
                                Incomplete list into dataframe
                            
                                Moving x or y axis together with tick labels to the middle of a single ggplot (no facets)
                            
                                How does createDataPartition function from caret package split data?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Split columns by number in a dataframe

Tags:

regex

split

dataframe

r

sevpants

People also ask

2 Answers

LyzandeR

HubertL

Recent Activity

Donate For Us