How can I 'unpivot' a table? What is the proper technical term for this? UPDATE: The term is called melt I have a data frame for countries and data for each year <pre class="prettyprint"><code>Country 2001 2002 2003 Nigeria 1 2 3 UK 2 NA 1 </code></pre> And I want to have something like <pre class="prettyprint"><code>Country Year Value Nigeria 2001 1 Nigeria 2002 2 Nigeria 2003 3 UK 2001 2 UK 2002 NA UK 2003 1 </code></pre>

I still can't believe I beat Andrie with an answer. :) <pre class="prettyprint"><code>> library(reshape) > my.df <- read.table(text = "Country 2001 2002 2003 + Nigeria 1 2 3 + UK 2 NA 1", header = TRUE) > my.result <- melt(my.df, id = c("Country")) > my.result[order(my.result$Country),] Country variable value 1 Nigeria X2001 1 3 Nigeria X2002 2 5 Nigeria X2003 3 2 UK X2001 2 4 UK X2002 NA 6 UK X2003 1 </code></pre>

how to pivot/unpivot (cast/melt) data frame? [duplicate]

Tags:

r

reshape

pivot-table

reshape2

How can I 'unpivot' a table? What is the proper technical term for this?

UPDATE: The term is called melt

I have a data frame for countries and data for each year

Click to copy

Country     2001    2002    2003
Nigeria     1       2       3
UK          2       NA       1

And I want to have something like

Click to copy

Country    Year    Value
Nigeria    2001    1
Nigeria    2002    2
Nigeria    2003    3
UK         2001    2
UK         2002    NA
UK         2003    1

311

asked Nov 02 '11 12:11

pedrosaurio

3 Answers

I still can't believe I beat Andrie with an answer. :)

Click to copy

> library(reshape) > my.df <- read.table(text = "Country     2001    2002    2003    + Nigeria     1       2       3    + UK          2       NA       1", header = TRUE) > my.result <- melt(my.df, id = c("Country")) > my.result[order(my.result$Country),]      Country variable value    1 Nigeria    X2001     1    3 Nigeria    X2002     2    5 Nigeria    X2003     3    2      UK    X2001     2    4      UK    X2002    NA    6      UK    X2003     1

164

answered Sep 24 '22 18:09

Roman Luštrik

The base R reshape approach for this problem is pretty ugly, particularly since the names aren't in a form that reshape likes. It would be something like the following, where the first setNames line modifies the column names into something that reshape can make use of.

Click to copy

reshape(   setNames(mydf, c("Country", paste0("val.", c(2001, 2002, 2003)))),    direction = "long", idvar = "Country", varying = 2:ncol(mydf),    sep = ".", new.row.names = seq_len(prod(dim(mydf[-1]))))

A better alternative in base R is to use stack, like this:

Click to copy

cbind(mydf[1], stack(mydf[-1])) #   Country values  ind # 1 Nigeria      1 2001 # 2      UK      2 2001 # 3 Nigeria      2 2002 # 4      UK     NA 2002 # 5 Nigeria      3 2003 # 6      UK      1 2003

There are also new tools for reshaping data now available, like the "tidyr" package, which gives us gather. Of course, the tidyr:::gather_.data.frame method just calls reshape2::melt, so this part of my answer doesn't necessarily add much except introduce the newer syntax that you might be encountering in the Hadleyverse.

Click to copy

library(tidyr) gather(mydf, year, value, `2001`:`2003`) ## Note the backticks #   Country year value # 1 Nigeria 2001     1 # 2      UK 2001     2 # 3 Nigeria 2002     2 # 4      UK 2002    NA # 5 Nigeria 2003     3 # 6      UK 2003     1

All three options here would need reordering of rows if you want the row order you showed in your question.

A fourth option would be to use merged.stack from my "splitstackshape" package. Like base R's reshape, you'll need to modify the column names to something that includes a "variable" and "time" indicator.

Click to copy

library(splitstackshape) merged.stack(   setNames(mydf, c("Country", paste0("V.", 2001:2003))),   var.stubs = "V", sep = ".") #    Country .time_1  V # 1: Nigeria    2001  1 # 2: Nigeria    2002  2 # 3: Nigeria    2003  3 # 4:      UK    2001  2 # 5:      UK    2002 NA # 6:      UK    2003  1

Sample data

Click to copy

 mydf <- structure(list(Country = c("Nigeria", "UK"), `2001` = 1:2, `2002` = c(2L,       NA), `2003` = c(3L, 1L)), .Names = c("Country", "2001", "2002",                     "2003"), row.names = 1:2, class = "data.frame")

answered Sep 21 '22 18:09

A5C1D2H2I1M1N2O1R2T1

You can use the melt command from the reshape package. See here: http://www.statmethods.net/management/reshape.html

Probably something like melt(myframe, id=c('Country'))

answered Sep 25 '22 18:09

nicolaskruchten

Related questions
                            
                                order while splitting (eg. TA should be split to two column "A" in first "T" second) in r
                            
                                How to create a stacked bar chart from summarized data in ggplot2
                            
                                Count item pairs linked by column value
                            
                                How to name sections on x axis that are separated by vertical lines in an R plot (package ggplot2)?
                            
                                Oauth authentification to Fitbit using httr
                            
                                Parsing Deeply Nested JSON Structures in R Using RJSONIO
                            
                                Adding Different Percentiles in boxplots in R
                            
                                stat_bin2d with fill based on success rate
                            
                                Reading URL in R and RStudio
                            
                                In R, how can I test if two factors are equivalent?
                            
                                Exception handling and stack unwinding in R
                            
                                Consistent graph size in R using ggplot2 (legend and axis change the size)
                            
                                Find a word before one of two possible separators
                            
                                How to create dummy variables?
                            
                                Why does approx return a list rather than a data frame or array?
                            
                                In R, is there a way to color plot points on a gradient based on a range of numbers?
                            
                                Draw a function in ggplot2 with more than x as parameter
                            
                                Convert ggplot object to plotly in shiny application
                            
                                Convert a month abbreviation to a numeric month, in R
                            
                                Interpolate NA values in a data frame with na.approx

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to pivot/unpivot (cast/melt) data frame? [duplicate]

Tags:

r

reshape

pivot-table

reshape2

pedrosaurio

People also ask

3 Answers

Roman Luštrik

Sample data

A5C1D2H2I1M1N2O1R2T1

nicolaskruchten

Recent Activity

Donate For Us