Calculate ranks for each group

Q: What is the formula for ranking in Excel?

=RANK(B2,$B$2:$B$11) For ascending order, type a 1, or any other number except zero. If you were comparing golf scores, you could type a 1, to rank in ascending order.

Tags:

r

ranking

plyr

I have a df with types and values. I want to rank them in order of x within type and give a count of how many other rows row n has higher value of x than (column pos).

e.g.

df <- data.frame(type = c("a","a","a","b","b","b"),x=c(1,77,1,34,1,8))
# for type a row 3 has a higher x than row 1 and 2 so has a pos value of 2

I can do this with:

library(plyr)
df <- data.frame(type = c("a","a","a","b","b","b"),x=c(1,77,1,34,1,8))
df <- ddply(df,.(type), function(x) x[with(x, order(x)) ,])
df <- ddply(df,.(type), transform, pos = (seq_along(x)-1) )

     type  x pos
1    a  1   0
2    a  1   1
3    a 77   2
4    b  1   0
5    b  8   1
6    b 34   2

But this approach does not take into account ties between type a row 1 and 2. Whats the easiest way to get the output where ties have the same value e.g.

     type  x pos
 1    a  1   0
 2    a  1   0
 3    a 77   2
 4    b  1   0
 5    b  8   1
 6    b 34   2

855

asked Dec 17 '12 14:12

user1320502

1 Answers

ddply(df,.(type), transform, pos = rank(x,ties.method ="min")-1)

  type  x pos
1    a  1   0
2    a 77   2
3    a  1   0
4    b 34   2
5    b  1   0
6    b  8   1

135

answered Oct 17 '22 05:10

Roland

Related questions
                            
                                Join data.table on exact date or if not the case on the nearest less than date
                            
                                R: caching/memoise for environments
                            
                                remove columns with NAs from all dataframes in list
                            
                                In R, how to get the whole command line into the sys.call() of a binary operator?
                            
                                How to delete a slot of an element in a list in R with lappy
                            
                                Reading sdmx-xml files into a dataframe in R
                            
                                R: replacing NA with value of closest point
                            
                                using k-NN in R with categorical values
                            
                                Why sometimes i cant set a class definition as slot in a s4 class? [closed]
                            
                                Will just installing this package speed up R?
                            
                                Combining or merging workspaces in R and general workspace management
                            
                                Classification with naiveBayes (e1071) does not work ($levels returns NULL)
                            
                                Exclude rows with certain time of day
                            
                                Query using geom_bar() of ggplot2 - R
                            
                                pdf device and font family "Arial" / Or: Change font name (not font) in PDF
                            
                                How to perform 10 fold cross validation with LibSVM in R?
                            
                                contrasts in anova
                            
                                suffixes in xts merge in R [closed]
                            
                                Efficient way to calculate grid quadrants a line passes through
                            
                                How to change matrix column type in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With