Questions Linux Laravel Mysql Ubuntu Git Menu

HTML CSS JAVASCRIPT SQL PYTHON PHP BOOTSTRAP JAVA JQUERY R React Kotlin

Reorganizing data from 3 rows to 1

Tags:

r

csv

reshape2

I need to reorganize data from a csv file that contains mostly repeating data. I have the data imported into R in a dataframe but I am having trouble with the following:

ID   Language  Author   Keyword
12   eng       Rob      COLOR=Red
12   eng       Rob      SIZE=Large
12   eng       Rob      DD=1
15   eng       John     COLOR=Red
15   eng       John     SIZE=Medium
15   eng       John     DD=2

What I need to do is transform this into a row with each keyword in a separate column

ID   Language  Author  COLOR  SIZE      DD
12   eng       Rob     Red    Large     1

Any ideas?

like image

836

asked Feb 22 '13 19:02

Ray

People also ask

How do I convert multiple rows to one column?

Click in a cell, or select multiple cells that you want to split. Under Table Tools, on the Layout tab, in the Merge group, click Split Cells. Enter the number of columns or rows that you want to split the selected cells into.

1 Answers

Using the reshape2 package this is straightforward:

With tt defined as in Gary's answer

library("reshape2")

tt <- cbind(tt, colsplit(tt$Keyword, "=", c("Name", "Value")))
tt_new <- dcast(tt, ID + Language + Author ~ Name, value.var="Value")

which gives

> tt_new
  ID Language Author COLOR DD   SIZE
1 12      eng    Rob   Red  1  Large
2 15      eng   John   Red  2 Medium

like image

183

answered Oct 22 '22 14:10

Brian Diggs

Sign in to Comment

Related questions
                            
                                Significance testing in R, determining if the proportion in one column is significantly different from the other column within the single variable
                            
                                big.matrix as data.frame in R
                            
                                How to incorporate updated line colours into legend of a plot in R using lattice?
                            
                                R: Unused argument "label" in hclust
                            
                                Subtracting Two Columns Consisting of Both Date and Time in R
                            
                                Show two symbols for each legend label
                            
                                R- Create a single date from multiple columns
                            
                                R identifying a row prior to a change in sign
                            
                                Selecting rows in data.frame based on character strings
                            
                                How to control font size in png?
                            
                                colMeans function in R and running into problems with columns of size 1
                            
                                View large data set on the R console
                            
                                R extract time components from semi-standard strings
                            
                                How do I get my R buffer in emacs to occupy more horizontal space?
                            
                                reshape dataframe based on a string split in one column in R
                            
                                Selectively Modify Indices
                            
                                Removing NA columns in xts
                            
                                How to get something like Matplotlib's symlog scale in ggplot or lattice?
                            
                                Simple way to delete dataframe rows robust to instances where no rows match deletion criteria
                            
                                Moving average with varying time window in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With