I have the following dummy data: <pre class="prettyprint"><code>library(dplyr) library(tidyr) library(reshape2) dt <- expand.grid(Year = 1990:2014, Product=LETTERS[1:8], Country = paste0(LETTERS, "I")) %>% select(Product, Country, Year) dt$value <- rnorm(nrow(dt)) </code></pre> I pick two product-country combinations <pre class="prettyprint"><code>sdt <- dt %>% filter((Product == "A" & Country == "AI") | (Product == "B" & Country =="EI")) </code></pre> and I want to see the values side by side for each combination. I can do this with <code>dcast</code>: <pre class="prettyprint"><code>sdt %>% dcast(Year ~ Product + Country) </code></pre> Is it possible to do this with <code>spread</code> from the package tidyr?

One option would be to create a new 'Prod_Count' by joining the 'Product' and 'Country' columns by <code>paste</code>, remove those columns with the <code>select</code> and reshape from 'long' to 'wide' using <code>spread</code> from <code>tidyr</code>. <pre class="prettyprint"><code> library(dplyr) library(tidyr) sdt %>% mutate(Prod_Count=paste(Product, Country, sep="_")) %>% select(-Product, -Country)%>% spread(Prod_Count, value)%>% head(2) # Year A_AI B_EI #1 1990 0.7878674 0.2486044 #2 1991 0.2343285 -1.1694878 </code></pre> Or we can avoid a couple of steps by using <code>unite</code> from <code>tidyr</code> (from @beetroot's comment) and reshape as before. <pre class="prettyprint"><code> sdt%>% unite(Prod_Count, Product,Country) %>% spread(Prod_Count, value)%>% head(2) # Year A_AI B_EI # 1 1990 0.7878674 0.2486044 # 2 1991 0.2343285 -1.1694878 </code></pre>

Is it possible to use spread on multiple columns in tidyr similar to dcast? [duplicate]

Tags:

r

tidyr

reshape2

I have the following dummy data:

Click to copy

library(dplyr) library(tidyr) library(reshape2) dt <- expand.grid(Year = 1990:2014, Product=LETTERS[1:8], Country = paste0(LETTERS, "I")) %>%   select(Product, Country, Year) dt$value <- rnorm(nrow(dt))

I pick two product-country combinations

Click to copy

sdt <- dt %>% filter((Product == "A" & Country == "AI") | (Product == "B" & Country =="EI"))

and I want to see the values side by side for each combination. I can do this with dcast:

Click to copy

sdt %>% dcast(Year ~ Product + Country)

Is it possible to do this with spread from the package tidyr?

307

asked Jul 24 '14 09:07

mpiktas

2 Answers

One option would be to create a new 'Prod_Count' by joining the 'Product' and 'Country' columns by paste, remove those columns with the select and reshape from 'long' to 'wide' using spread from tidyr.

Click to copy

 library(dplyr)  library(tidyr)  sdt %>%  mutate(Prod_Count=paste(Product, Country, sep="_")) %>%  select(-Product, -Country)%>%   spread(Prod_Count, value)%>%  head(2)  #  Year      A_AI       B_EI  #1 1990 0.7878674  0.2486044  #2 1991 0.2343285 -1.1694878

Or we can avoid a couple of steps by using unite from tidyr (from @beetroot's comment) and reshape as before.

Click to copy

 sdt%>%   unite(Prod_Count, Product,Country) %>%  spread(Prod_Count, value)%>%   head(2)  #   Year      A_AI       B_EI  # 1 1990 0.7878674  0.2486044  # 2 1991 0.2343285 -1.1694878

109

answered Sep 17 '22 12:09

akrun

With the new function pivot_wider() introduced in tidyr version 1.0.0, this can be accomplished with one function call.

pivot_wider() (counterpart: pivot_longer()) works similar to spread(). However, it offers additional functionality such as using multiple key/name columns (and/or multiple value columns). To this end, the argument names_from—that indicates from which column(s) the names of the new variables are taken—may take more than one column name (here Product and Country).

Click to copy

library("tidyr")  sdt %>%      pivot_wider(id_cols = Year,                 names_from = c(Product, Country)) %>%      head(2) #> # A tibble: 2 x 3 #>     Year   A_AI    B_EI #>    <int>  <dbl>   <dbl> #>  1  1990 -2.08  -0.113  #>  2  1991 -1.02  -0.0546

See also: https://tidyr.tidyverse.org/articles/pivot.html

answered Sep 18 '22 12:09

hplieninger

Related questions
                            
                                How to put labels over geom_bar in R with ggplot2
                            
                                Merge data frames based on rownames in R
                            
                                Split data.frame based on levels of a factor into new data.frames
                            
                                Is there a command in R to view all the functions present in a package? [duplicate]
                            
                                Extract file extension from file path
                            
                                Adding vertical line in plot ggplot
                            
                                Pass arguments to dplyr functions
                            
                                How to convert R formula to text?
                            
                                Sum a list of matrices [duplicate]
                            
                                How to paste a string on each element of a vector of strings using apply in R?
                            
                                R + ggplot : Time series with events
                            
                                Adding greek character to axis title
                            
                                How to save a data frame as CSV to a user selected location using tcltk
                            
                                Aggregate Daily Data to Month/Year intervals
                            
                                Chopping a string into a vector of fixed width character elements
                            
                                multiple authors and subtitles in Rmarkdown yaml
                            
                                First letter to upper case
                            
                                Why are my dplyr group_by & summarize not working properly? (name-collision with plyr)
                            
                                R define dimensions of empty data frame
                            
                                How can one work fully generically in data.table in R with column names in variables

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it possible to use spread on multiple columns in tidyr similar to dcast? [duplicate]

Tags:

r

tidyr

reshape2

mpiktas

People also ask

2 Answers

akrun

hplieninger

Recent Activity

Donate For Us