I have a data frame with multiple columns in R. I want to split the "age" column into two column, each with one string in it. <pre class="prettyprint"><code> fas value age colony 1: C12:0 0.002221915 LO 7_13 2: C13:0 0.000770179 LO 7_13 3: C14:0 0.004525352 LO 7_13 4: C15:0 0.000738928 LO 7_13 5: C16:1a 0.002964627 LO 7_13 </code></pre> Output: <pre class="prettyprint"><code> fas value size age colony 1: C12:0 0.002221915 L O 7_13 2: C13:0 0.000770179 L O 7_13 3: C14:0 0.004525352 L O 7_13 4: C15:0 0.000738928 L O 7_13 5: C16:1a 0.002964627 L O 7_13 </code></pre> I tried: <pre class="prettyprint"><code>data_frame<-str_split_fixed(df$age, "", 2) </code></pre>

With base R: <pre class="prettyprint"><code>df$size <- substr(df$age,1,1) df$age <- substr(df$age,2,2) </code></pre> And to get the result in the column order you specified: <pre class="prettyprint"><code>df[,c("fas","value","age","size","colony")] fas value age size colony 1 C12:0 0.002221915 O L 7_13 2 C13:0 0.000770179 O L 7_13 3 C14:0 0.004525352 O L 7_13 4 C15:0 0.000738928 O L 7_13 5 C16:1a 0.002964627 O L 7_13 </code></pre>

You can use <code>sub</code>and backreference: <pre class="prettyprint"><code>df$age <- sub("(^\\w)(\\w$)", "\\1", df$age) df$size <- sub("(^\\w)(\\w$)", "\\2", df$age) </code></pre>

Split word in column in R

Tags:

r

I have a data frame with multiple columns in R. I want to split the "age" column into two column, each with one string in it.

         fas       value age colony
   1:  C12:0 0.002221915  LO   7_13
   2:  C13:0 0.000770179  LO   7_13
   3:  C14:0 0.004525352  LO   7_13
   4:  C15:0 0.000738928  LO   7_13
   5: C16:1a 0.002964627  LO   7_13

Output:

         fas           value size age colony
       1:  C12:0 0.002221915    L   O   7_13
       2:  C13:0 0.000770179    L   O   7_13
       3:  C14:0 0.004525352    L   O   7_13
       4:  C15:0 0.000738928    L   O   7_13
       5: C16:1a 0.002964627    L   O   7_13

I tried:

data_frame<-str_split_fixed(df$age, "", 2)

541

asked Feb 21 '21 16:02

Luker354

2 Answers

With base R:

df$size <- substr(df$age,1,1)
df$age  <- substr(df$age,2,2)

And to get the result in the column order you specified:

df[,c("fas","value","age","size","colony")]
     fas       value age size colony
1  C12:0 0.002221915   O    L   7_13
2  C13:0 0.000770179   O    L   7_13
3  C14:0 0.004525352   O    L   7_13
4  C15:0 0.000738928   O    L   7_13
5 C16:1a 0.002964627   O    L   7_13

110

answered Oct 24 '22 01:10

Waldi

You can use suband backreference:

df$age <- sub("(^\\w)(\\w$)", "\\1", df$age)
df$size <- sub("(^\\w)(\\w$)", "\\2", df$age)

answered Oct 24 '22 03:10

Chris Ruehlemann

Related questions
                            
                                Sum of intervals lengths from an integer vector
                            
                                How to get all possible subsets of a character vector in R?
                            
                                How to calculate cumulative sum? [duplicate]
                            
                                Dummify character column and find unique values [duplicate]
                            
                                summing multiple columns in an R data-frame quickly [duplicate]
                            
                                Remove duplicate element within a row in a specific column
                            
                                Coalesce pairs of variables within a dataframe based on a regular expression
                            
                                Perform 'cross product' of two vectors, but with addition
                            
                                ImageMagick in R
                            
                                How to rename specific variable of a data frame with setNames()?
                            
                                r keeping 0.0 when using paste or paste0
                            
                                How to visualize a map from a netcdf file?
                            
                                Removing NA in correlation matrix
                            
                                The difference between & and && in R
                            
                                R cumulative sum by condition with reset
                            
                                How to get names of dot-dot-dot arguments in R [duplicate]
                            
                                Sorting rows alphabetically
                            
                                Preventing R From Rounding
                            
                                Position-dodge warning with ggplot boxplot?
                            
                                How can I count the number of times a value occurs in a column of a dataframe?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With