I would really like pivot_wider to create a column with NAs if the level of a factor exists but never appears in the data when it's used as a names_from argument. For example, the first line gives me a two column tibble, but I'd really like the three column tibble below. <pre class="prettyprint"><code>tibble(Person=c("Sarah", "Jackson", "Jackson"), Rank=c(1,1,2), FavoriteAnimal=factor(c("Dog", "Dog", "Cat")))%>% group_by(Person)%>%arrange(Rank)%>%slice(1)%>% pivot_wider(names_from = FavoriteAnimal, values_from=Rank) </code></pre> <code>tibble(Person=c("Jackson", "Sarah"), Dog=c(1,1), Cat=c(NA,NA))</code> How can I get my column of NAs for levels not appearing in my dataset?

Alternatively, you can first add the missing levels and then do the transformation: <pre class="prettyprint"><code>tibble(Person = c("Sarah", "Jackson", "Jackson"), Rank = c(1, 1, 2), FavoriteAnimal = factor(c("Dog", "Dog", "Cat"))) %>% group_by(Person) %>% arrange(Rank) %>% slice(1) %>% complete(FavoriteAnimal = FavoriteAnimal) %>% pivot_wider(names_from = FavoriteAnimal, values_from = Rank) Person Cat Dog <chr> <dbl> <dbl> 1 Jackson NA 1 2 Sarah NA 1 </code></pre>

You can do it with <code>tidyr::spread</code> - <code>spread(key = FavoriteAnimal, value = Rank, drop = FALSE)</code> gives you what you want. Unfortunately the <code>drop</code> argument seems to have been lost in the transition from <code>spread</code> to <code>pivot_wider</code>.

How can I keep pivot_wider() from dropping factor levels in names?

Tags:

r

tidyr

I would really like pivot_wider to create a column with NAs if the level of a factor exists but never appears in the data when it's used as a names_from argument. For example, the first line gives me a two column tibble, but I'd really like the three column tibble below.

tibble(Person=c("Sarah", "Jackson", "Jackson"), Rank=c(1,1,2), 
       FavoriteAnimal=factor(c("Dog", "Dog", "Cat")))%>%
    group_by(Person)%>%arrange(Rank)%>%slice(1)%>%
    pivot_wider(names_from = FavoriteAnimal, values_from=Rank)

tibble(Person=c("Jackson", "Sarah"), Dog=c(1,1), Cat=c(NA,NA))

How can I get my column of NAs for levels not appearing in my dataset?

710

asked Nov 19 '19 15:11

jntrcs

2 Answers

Alternatively, you can first add the missing levels and then do the transformation:

tibble(Person = c("Sarah", "Jackson", "Jackson"), 
       Rank = c(1, 1, 2), 
       FavoriteAnimal = factor(c("Dog", "Dog", "Cat"))) %>%
 group_by(Person) %>%
 arrange(Rank) %>% 
 slice(1) %>%
 complete(FavoriteAnimal = FavoriteAnimal) %>%
 pivot_wider(names_from = FavoriteAnimal, values_from = Rank)

  Person    Cat   Dog
  <chr>   <dbl> <dbl>
1 Jackson    NA     1
2 Sarah      NA     1

153

answered Nov 06 '22 13:11

tmfmnk

You can do it with tidyr::spread - spread(key = FavoriteAnimal, value = Rank, drop = FALSE) gives you what you want.

Unfortunately the drop argument seems to have been lost in the transition from spread to pivot_wider.

answered Nov 06 '22 12:11

Andrew Gustar

Related questions
                            
                                Cumulative variance explained for NMDS in R
                            
                                Shade background of a ggplot chart using geom_rect with categorical variables
                            
                                R- how to conditionally remove first row of group_by
                            
                                Add directlabels to geom_smooth rather than geom_line
                            
                                What is the difference between paste/paste0 and str_c?
                            
                                How to format a difftime object to a string with HH:MM:SS
                            
                                ggplot aes_string doesn't work with spaces
                            
                                rmarkdown & kable/kableextra: Printing % symbol in Table when using escape = F
                            
                                select non-missing variables in a purrr loop
                            
                                Add space above y-axis without expand()
                            
                                Aligning axes of R plots on one side of a grid together
                            
                                Using case_when() to assign two new columns, instead of one
                            
                                R shiny datatable pagination and show all rows as options
                            
                                How to use rlang operators in a package?
                            
                                What is the equivalent of "everything()" operator in "data.table"? [duplicate]
                            
                                How to change the colour of the printed output in base R?
                            
                                How to code elementary symmetric polynomials in R
                            
                                Making variables immutable in R
                            
                                Add a MS Word Comment via Rmarkdown
                            
                                ggplot2 change fill for color legend when fill also used in aesthetic

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With