I have a following dataframe in R <pre class="prettyprint"><code> Serivce Codes ABS RT ABS RT ABS TY ABS DR ABS DR ABS DR ABS DR DEF RT DEF RT DEF TY DEF DR DEF DR DEF DR DEF DR DEF TY DEF SE DEF SE </code></pre> What I want is service wise code count in descending order <pre class="prettyprint"><code> Serivce Codes Count ABS DR 4 ABS RT 2 ABS TY 1 DEF DR 4 DEF RT 2 DEF TY 2 </code></pre> I am doing following in r <pre class="prettyprint"><code>df%>% group_by(Service,Codes) %>% summarise(Count = n()) %>% top_n(n=3,wt = Count) %>% arrange(desc(Count)) %>% as.data.frame() </code></pre> But,it does not give me what is intended.

We can try with <code>count/arrange/slice</code> <pre class="prettyprint"><code>df1 %>% count(Service, Codes) %>% arrange(desc(n)) %>% group_by(Service) %>% slice(seq_len(3)) # A tibble: 6 x 3 # Groups: Service [2] # Service Codes n # <chr> <chr> <int> #1 ABS DR 4 #2 ABS RT 2 #3 ABS TY 1 #4 DEF DR 4 #5 DEF RT 2 #6 DEF SE 2 </code></pre> <hr> In the OP's code, we need to <code>arrange</code> by 'Service' too. As @Marius said in the comments, the <code>top_n</code> will include more number of rows if there are ties. One option is to do a second grouping with 'Service' and <code>slice</code> (as showed above) or after the grouping, we can <code>filter</code> <pre class="prettyprint"><code>df1 %>% group_by(Service,Codes) %>% summarise(Count = n()) %>% top_n(n=3,wt = Count) %>% arrange(Service, desc(Count)) %>% group_by(Service) %>% filter(row_number() <=3) </code></pre>

how to find top N descending values in group in dplyr

Tags:

r

I have a following dataframe in R

  Serivce     Codes
   ABS         RT
   ABS         RT
   ABS         TY
   ABS         DR
   ABS         DR
   ABS         DR
   ABS         DR
   DEF         RT
   DEF         RT
   DEF         TY
   DEF         DR
   DEF         DR
   DEF         DR
   DEF         DR
   DEF         TY
   DEF         SE
   DEF         SE

What I want is service wise code count in descending order

  Serivce     Codes    Count
   ABS         DR        4
   ABS         RT        2 
   ABS         TY        1
   DEF         DR        4
   DEF         RT        2
   DEF         TY        2

I am doing following in r

df%>% 
group_by(Service,Codes) %>% 
summarise(Count = n()) %>%
top_n(n=3,wt = Count) %>% 
arrange(desc(Count)) %>% 
as.data.frame()

But,it does not give me what is intended.

202

asked Jul 28 '17 05:07

Neil

1 Answers

We can try with count/arrange/slice

df1 %>% 
   count(Service, Codes) %>%
   arrange(desc(n)) %>% 
   group_by(Service) %>% 
   slice(seq_len(3))
# A tibble: 6 x 3
# Groups:   Service [2]
#  Service Codes     n
#    <chr> <chr> <int>
#1     ABS    DR     4
#2     ABS    RT     2
#3     ABS    TY     1
#4     DEF    DR     4
#5     DEF    RT     2
#6     DEF    SE     2

In the OP's code, we need to arrange by 'Service' too. As @Marius said in the comments, the top_n will include more number of rows if there are ties. One option is to do a second grouping with 'Service' and slice (as showed above) or after the grouping, we can filter

df1 %>% 
  group_by(Service,Codes) %>%
  summarise(Count = n()) %>%
  top_n(n=3,wt = Count)  %>%
  arrange(Service, desc(Count)) %>%
  group_by(Service) %>%
  filter(row_number() <=3)

126

answered Oct 14 '22 06:10

akrun

Related questions
                            
                                How to plot a one column data frame with ggplot?
                            
                                Assigning a value to each range of consecutive numbers with same sign in R
                            
                                How to write a facet_wrap (ggplot2) within a function
                            
                                Socket programming in R to receive UDP stream
                            
                                Using purrr::map to iterate linear model over columns in data frame
                            
                                Creating a new r data.table column based on values in another column and grouping
                            
                                R function to return multiple data frames
                            
                                "Forecast" library can't be installed
                            
                                How to specify different colors with ggplot
                            
                                R stargazer package: eliminate "t =" label from reported test statistics
                            
                                ggplot divergent lines with error bars
                            
                                Combine in flexdashboard with multiple pages different types of vertical_layout
                            
                                How to import ical .ics file in R
                            
                                R flexdashboard remove title bar
                            
                                R Hex to RGB converter
                            
                                Using ggfortify and ggrepel for pca
                            
                                Can't load files using system.file or file.path in R?
                            
                                How to use data within a function in an R package?
                            
                                How to add label to geom_segment at the start of the segment?
                            
                                R optparse error with command line arguments

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With