I have <pre class="prettyprint"><code>x<-"1, A | 2, B | 10, C " </code></pre> <code>x</code> is always this way formatted, <code>|</code> denotes a new row and the first value is the <code>variable1</code>, the second value is <code>variable2</code>. I would like to convert it to a data.frame <pre class="prettyprint"><code> variable1 variable2 1 1 A 2 2 B 3 10 C </code></pre> I haven't found any package that can understand the escape character <code>|</code> How can I convert it to <code>data.frame</code>?

We may use <code>read.table</code> from <code>base R</code> to read the string into two columns after replacing the <code>|</code> with <code>\n</code> <pre class="prettyprint"><code>read.table(text = gsub("|", "\n", x, fixed = TRUE), sep=",", header = FALSE, col.names = c("variable1", "variable2"), strip.white = TRUE ) </code></pre> -output <pre class="prettyprint"><code> variable1 variable2 1 1 A 2 2 B 3 10 C </code></pre> <hr> Or use <code>fread</code> from <code>data.table</code> <pre class="prettyprint"><code>library(data.table) fread(gsub("|", "\n", x, fixed = TRUE), col.names = c("variable1", "variable2")) variable1 variable2 1: 1 A 2: 2 B 3: 10 C </code></pre> <hr> Or using <code>tidyverse</code> - <code>separate_rows</code> to split the column and then create two columns with <code>separate</code> <pre class="prettyprint"><code>library(tidyr) library(dplyr) tibble(x = trimws(x)) %>% separate_rows(x, sep = "\\s*\\|\\s*") %>% separate(x, into = c("variable1", "variable2"), sep=",\\s+", convert = TRUE) # A tibble: 3 × 2 variable1 variable2 <int> <chr> 1 1 A 2 2 B 3 10 C </code></pre>

Here's a way using <code>scan()</code>. <pre class="prettyprint"><code>x <- "1, A | 2, B | 10, C " do.call(rbind.data.frame, strsplit(scan(text=x, what="A", sep='|', quiet=T, strip.white=T), ', ')) |> setNames(c('variable1', 'variable2')) # variable1 variable2 # 1 1 A # 2 2 B # 3 10 C </code></pre> Note: <code>R version 4.1.2 (2021-11-01)</code>.

Convert a properly formatted string to data frame

Tags:

r

I have

x<-"1, A | 2, B | 10, C "

x is always this way formatted, | denotes a new row and the first value is the variable1, the second value is variable2.

I would like to convert it to a data.frame

  variable1 variable2
1         1         A
2         2         B
3        10         C

I haven't found any package that can understand the escape character |

How can I convert it to data.frame?

829

asked Dec 25 '21 20:12

ECII

Video Answer

2 Answers

We may use read.table from base R to read the string into two columns after replacing the | with \n

read.table(text = gsub("|", "\n", x, fixed = TRUE), sep=",", 
    header = FALSE, col.names = c("variable1", "variable2"), strip.white = TRUE )

-output

 variable1 variable2
1         1        A 
2         2        B 
3        10        C

Or use fread from data.table

library(data.table)
fread(gsub("|", "\n", x, fixed = TRUE), col.names = c("variable1", "variable2"))
   variable1 variable2
1:         1         A
2:         2         B
3:        10         C

Or using tidyverse - separate_rows to split the column and then create two columns with separate

library(tidyr)
library(dplyr)
tibble(x = trimws(x)) %>% 
  separate_rows(x, sep = "\\s*\\|\\s*") %>%
  separate(x, into = c("variable1", "variable2"), sep=",\\s+", convert = TRUE)
# A tibble: 3 × 2
  variable1 variable2
      <int> <chr>    
1         1 A        
2         2 B        
3        10 C

answered Oct 20 '22 09:10

akrun

Here's a way using scan().

x <- "1, A | 2, B | 10, C "

do.call(rbind.data.frame,
        strsplit(scan(text=x, what="A", sep='|', quiet=T, strip.white=T), ', ')) |>
  setNames(c('variable1', 'variable2'))
#   variable1 variable2
# 1         1         A
# 2         2         B
# 3        10         C

Note: R version 4.1.2 (2021-11-01).

answered Oct 20 '22 10:10

jay.sf

Related questions
                            
                                Is there a way to multiply the 2d matrices of a 3d array by a scalar in R?
                            
                                Create summary table in R using statistics from package `modifiedmk`
                            
                                Remove linear dependent variables while using the bife package
                            
                                Lookaround regular expression pattern in R
                            
                                How to append new data in googlesheet
                            
                                match all parentheses between two curly brackets
                            
                                Dodge two different geoms apart in ggplot2
                            
                                ggplot2 geom_bar fill aesthetic not changing
                            
                                How to count rows by group with n() inside dplyr::across()?
                            
                                How are apply family functions scoped?
                            
                                Tuning a LASSO model and predicting using tidymodels
                            
                                R: unequi join with merge function
                            
                                Binding dataframes of different length (no cbind, no merge)
                            
                                Complex numbers in R vs. Matlab
                            
                                Efficiently find the overlap between two time intervals in R
                            
                                Installing R gsl package on Ubuntu
                            
                                Child background Julia processes from R Shiny apps do not stay alive after app is closed
                            
                                How can one control the number of axis ticks within `facet_wrap()`?
                            
                                How to cache in IPython Notebook?
                            
                                Colorize Clusters in Dendogram with ggplot2

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With