I've got a little problem using dplyr <code>group_by</code> function. After doing this : <pre class="prettyprint"><code>datasetALL %>% group_by(YEAR,Region) %>% summarise(count_number = n()) </code></pre> here is the result : <pre class="prettyprint"><code>YEAR Region count_number <int> <int> <int> 1 1946 1 2 2 1946 2 3 3 1946 3 1 4 1946 5 1 5 1947 3 1 6 1947 4 1 </code></pre> I would like something like : <pre class="prettyprint"><code>YEAR Region count_number <int> <int> <int> 1 1946 1 2 2 1946 2 3 3 1946 3 1 4 1946 5 1 5 1946 4 0 #order is no important 6 1947 1 0 7 1947 2 0 8 1947 3 1 9 1947 4 1 10 1947 5 0 </code></pre> I try to use <code>complete()</code> from tidyr package, but it's not succeeding...

Using <code>complete</code> from the tidyr package should work. You can find documentation about it here. What probably happened is that you did not remove the grouping. Then complete tries to add each of the combinations of <code>YEAR</code> and <code>Region</code> within each group. But all these combinations are already in the grouping. Thus first remove the grouping and then do the complete. <pre class="prettyprint"><code>datasetALL %>% group_by(YEAR,Region) %>% summarise(count_number = n()) %>% ungroup() %>% complete(Year, Region, fill = list(count_number = 1)) </code></pre>

It has been already mentioned, but you can solve this problem in its entirety by using <code>tidyr</code> and the parameter <code>nesting</code> in it: <pre class="prettyprint"><code>complete(df, YEAR, nesting(Region), fill = list(count_number = 0)) YEAR Region count_number <int> <int> <dbl> 1 1946 1 2 2 1946 2 3 3 1946 3 1 4 1946 4 0 5 1946 5 1 6 1947 1 0 7 1947 2 0 8 1947 3 1 9 1947 4 1 10 1947 5 0 </code></pre>

Complete column with group_by and complete

Tags:

r

dplyr

tidyr

I've got a little problem using dplyr group_by function. After doing this :

datasetALL %>% group_by(YEAR,Region) %>% summarise(count_number = n())

here is the result :

YEAR Region count_number
<int>  <int>        <int>
1   1946      1            2
2   1946      2            3
3   1946      3            1
4   1946      5            1
5   1947      3            1
6   1947      4            1

I would like something like :

YEAR Region count_number
<int>  <int>        <int>
1   1946      1            2
2   1946      2            3
3   1946      3            1
4   1946      5            1
5   1946      4            0 #order is no important
6   1947      1            0
7   1947      2            0
8   1947      3            1
9   1947      4            1
10  1947      5            0

I try to use complete() from tidyr package, but it's not succeeding...

582

asked Apr 19 '17 16:04

Ben

Video Answer

2 Answers

Using complete from the tidyr package should work. You can find documentation about it here.

What probably happened is that you did not remove the grouping. Then complete tries to add each of the combinations of YEAR and Region within each group. But all these combinations are already in the grouping. Thus first remove the grouping and then do the complete.

datasetALL %>% 
    group_by(YEAR,Region) %>% 
    summarise(count_number = n()) %>%
    ungroup() %>%
    complete(Year, Region, fill = list(count_number = 1))

127

answered Sep 30 '22 08:09

Pieter

It has been already mentioned, but you can solve this problem in its entirety by using tidyr and the parameter nesting in it:

complete(df, YEAR, nesting(Region), fill = list(count_number = 0))

    YEAR Region count_number
   <int>  <int>        <dbl>
 1  1946      1            2
 2  1946      2            3
 3  1946      3            1
 4  1946      4            0
 5  1946      5            1
 6  1947      1            0
 7  1947      2            0
 8  1947      3            1
 9  1947      4            1
10  1947      5            0

answered Sep 30 '22 10:09

tmfmnk

Related questions
                            
                                r search along a vector and calculate the mean
                            
                                Proper R Markdown Code Organization
                            
                                Test if column name contains string in R
                            
                                Removing one tableGrob when applied to a box plot with a facet_wrap
                            
                                How to delete everything after nth delimiter in R?
                            
                                How can I import SAS format files into R?
                            
                                Dynamically sorting columns in dplyr via passing ordered vector with column names to select
                            
                                Plot 2 tmap objects side-by-side
                            
                                Is there a function to recognize a word?
                            
                                How to combine two rows in R?
                            
                                Why is standard R median function so much slower than a simple C++ alternative?
                            
                                Aggregate data.frame for each day
                            
                                Faster way to unlist a list of large matrices?
                            
                                How to get the table counts for unique values in column
                            
                                Extract pattern from string in R without distinguishing between upper and lower case letters
                            
                                Shift geom_bar right (not center-aligned)
                            
                                Preserve order of input variables and factor levels in summary table, using dplyr tidyr
                            
                                Get value of last non-NA row per column in data.table
                            
                                filter or subset list by partial object name in R
                            
                                How to extract the "domain" from an email address

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With