I'm having trouble stacking columns in a data.frame into one column. Now my data looks something like this: <pre class="prettyprint"><code>id time black white red a 1 b1 w1 r1 a 2 b2 w2 r2 a 3 b3 w3 r3 b 1 b4 w4 r4 b 2 b5 w5 r5 b 3 b6 w6 r6 </code></pre> I'm trying to transform the data.frame so that it looks like this: <pre class="prettyprint"><code>id time colour a 1 b1 a 2 b2 a 3 b3 b 1 b4 b 2 b5 b 3 b6 a 1 w1 a 2 w2 a 3 w3 b 1 w4 b 2 w5 b 3 w6 a 1 r1 a 2 r2 . . . . . . . . . </code></pre> I'm guessing that this problem requires using the reshape package, but I'm not exactly sure how to use it to stack multiple columns under one column. Can anyone provide help on this?

Here's melt from reshape: <pre class="prettyprint"><code>library(reshape) melt(x, id.vars=c('id', 'time'),var='color') </code></pre> <hr> And using <code>reshape2</code> (an up-to-date, faster version of <code>reshape</code>) the syntax is almost identical. The help files have useful examples (see <code>?melt</code> and the link to <code>melt.data.frame</code>). In your case, something like the following will work (assuming your data.frame is called <code>DF</code>) <pre class="prettyprint"><code>library(reshape2) melt(DF, id.var = c('id','time'), variable.name = 'colour') </code></pre>

stacking columns in data.frame into one column in R

Tags:

dataframe

r

reshape

I'm having trouble stacking columns in a data.frame into one column. Now my data looks something like this:

id   time    black   white   red 
a     1       b1      w1     r1
a     2       b2      w2     r2
a     3       b3      w3     r3
b     1       b4      w4     r4
b     2       b5      w5     r5
b     3       b6      w6     r6

I'm trying to transform the data.frame so that it looks like this:

id   time  colour 
a     1     b1
a     2     b2
a     3     b3
b     1     b4
b     2     b5
b     3     b6
a     1     w1
a     2     w2
a     3     w3
b     1     w4
b     2     w5
b     3     w6
a     1     r1
a     2     r2
.     .     .
.     .     .
.     .     .

I'm guessing that this problem requires using the reshape package, but I'm not exactly sure how to use it to stack multiple columns under one column. Can anyone provide help on this?

671

asked Nov 28 '12 03:11

econlearner

2 Answers

Since you mention "stacking" in your title, you can also look at the stack function in base R:

cbind(mydf[1:2], stack(mydf[3:5]))
#    id time values   ind
# 1   a    1     b1 black
# 2   a    2     b2 black
# 3   a    3     b3 black
# 4   b    1     b4 black
# 5   b    2     b5 black
# 6   b    3     b6 black
# 7   a    1     w1 white
# 8   a    2     w2 white
# 9   a    3     w3 white
# 10  b    1     w4 white
# 11  b    2     w5 white
# 12  b    3     w6 white
# 13  a    1     r1   red
# 14  a    2     r2   red
# 15  a    3     r3   red
# 16  b    1     r4   red
# 17  b    2     r5   red
# 18  b    3     r6   red

If the values in the "black", "white", and "red" columns are factors, you'll need to convert them to character values first.

cbind(mydf[1:2], stack(lapply(mydf[3:5], as.character)))

170

answered Oct 23 '22 11:10

A5C1D2H2I1M1N2O1R2T1

Here's melt from reshape:

library(reshape)
melt(x, id.vars=c('id', 'time'),var='color')

And using reshape2 (an up-to-date, faster version of reshape) the syntax is almost identical.

The help files have useful examples (see ?melt and the link to melt.data.frame).

In your case, something like the following will work (assuming your data.frame is called DF)

library(reshape2)
melt(DF, id.var = c('id','time'), variable.name = 'colour')

answered Oct 23 '22 10:10

Matthew Lundberg

Related questions
                            
                                How to compute rolling covariance more efficiently
                            
                                Inserting a new row to data frame for each group id
                            
                                rbind dataframes across nested lists
                            
                                DT package not working with blogdown using hugo-future-imperfect theme
                            
                                comprehensive way to check for functions that use the random number generator in an R script?
                            
                                correlation of one variable to all the other in R
                            
                                Adjusting the width of the datatable using DT in R
                            
                                str_replace A1-A9 to A01-A09 and so on
                            
                                Adding convex hull to ggplot map
                            
                                Python equivalent of R list()
                            
                                R ggplot2 ggrepel - label a subset of points while being aware of all points
                            
                                Garbage collection of seemingly PROTECTed pairlist
                            
                                Strings as variable references in an R formula
                            
                                guidelines for testing a statistical function in R?
                            
                                Can anyone help me write a R data frame as a SAS data set?
                            
                                Is it possible/advisable to skip roxygen in favor of roxygen2? [closed]
                            
                                geom_smooth() - and scaling the y axis, losing data from smoothing
                            
                                How to do one-way ANOVA in R with unequal sample sizes?
                            
                                Installation directory of R and the usage of .libPath()
                            
                                How to save glm result without data or only with coeffients for prediction?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With