How to partition when ranking on a particular column?

Tags:

All:

I have a data frame like the follow.I know I can do a global rank order like this:

dt <- data.frame(
    ID = c('A1','A2','A4','A2','A1','A4','A3','A2','A1','A3'),
    Value = c(4,3,1,3,4,6,6,1,8,4)
);
> dt
   ID Value
1  A1     4
2  A2     3
3  A4     1
4  A2     3
5  A1     4
6  A4     6
7  A3     6
8  A2     1
9  A1     8
10 A3     4
dt$Order <- rank(dt$Value,ties.method= "first")
> dt
   ID Value Order
1  A1     4     5
2  A2     3     3
3  A4     1     1
4  A2     3     4
5  A1     4     6
6  A4     6     8
7  A3     6     9
8  A2     1     2
9  A1     8    10
10 A3     4     7

But how can I set a rank order for a particular ID instead of a global rank order. How can I get this done? In T-SQL, we can get this done as the following syntax:

RANK() OVER ( [ < partition_by_clause > ] < order_by_clause > )

Any idea?

203

asked Apr 01 '12 03:04

RobinMin

1 Answers

Many options.

Using ddply from the plyr package:

library(plyr)
ddply(dt,.(ID),transform,Order = rank(Value,ties.method = "first"))
   ID Value Order
1  A1     4     1
2  A1     4     2
3  A1     8     3
4  A2     3     2
5  A2     3     3
6  A2     1     1
7  A3     6     2
8  A3     4     1
9  A4     1     1
10 A4     6     2

Or if performance is an issue (i.e. very large data) using the data.table package:

library(data.table)
DT <- data.table(dt,key = "ID")
DT[,transform(.SD,Order = rank(Value,ties.method = "first")),by = ID]
      ID Value Order
 [1,] A1     4     1
 [2,] A1     4     2
 [3,] A1     8     3
 [4,] A2     3     2
 [5,] A2     3     3
 [6,] A2     1     1
 [7,] A4     1     1
 [8,] A4     6     2
 [9,] A3     6     2
[10,] A3     4     1

or in all its gory detail a base R solution using split lapply do.call and rbind:

do.call(rbind,lapply(split(dt,dt$ID),transform,
              Order = rank(Value,ties.method = "first")))

answered Sep 30 '22 19:09

joran

Related questions
                            
                                Pass Parameters from Command line into R markdown document
                            
                                R system functions always returns error 127
                            
                                How to define "hidden global variables" inside R packages?
                            
                                Prevent Rstudio console from showing script commands
                            
                                Align multiple plots with varying spacings and add arrows between them
                            
                                Add curly braces to ggplot2 and then use ggsave
                            
                                ggplot jitter geom_errorbar?
                            
                                Summarize data.table by group
                            
                                Remove leading NAs to align data
                            
                                ggplot2: add p-values to the plot
                            
                                Error in bind_rows_(x, .id) : Column can't be converted from factor to numeric
                            
                                Merging list with common elements
                            
                                Use of tidyeval based non-standard evaluation in recode in right-hand side of mutate
                            
                                Joining factor levels of two columns
                            
                                transform vector into list
                            
                                Repeated-measures / within-subjects ANOVA in R
                            
                                R: Function that finds the range of 95% of all values?
                            
                                Change text on strips in lattice plots
                            
                                Venn diagram from list of clusters and co-occurring factors
                            
                                Comparing rows between two matrices

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to partition when ranking on a particular column?

Tags:

dataframe

r

rank

database-partitioning

RobinMin

People also ask

1 Answers

joran

Recent Activity

Donate For Us