Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

R quantile by groups with assignments

Tags:

r

tapply

quantile

I have the following df:

group = rep(seq(1,3),30)
variable = runif(90, 5.0, 7.5)
df = data.frame(group,variable)

I need to i) Define quantile by groups, ii) Assign each person to her quantile with respect to her group.

Thus, the output would look like:

id    group  variable  quantile_with_respect_to_the_group
1      1      6.430002     1
2      2      6.198008     3
          .......

There is a complicated way to do it with loops and cut function over each groups but it is not efficient at all. Does someone know a better solution ?

Thanks !

like image 699
Jb_Eyd Avatar asked Mar 13 '23 06:03

Jb_Eyd


2 Answers

Another version with dplyr/findInterval

library(dplyr)
df %>%
  group_by(group) %>% 
  mutate(Quantile = findInterval(variable, 
                quantile(variable, probs=0:4/4)))
like image 41
akrun Avatar answered Mar 17 '23 13:03

akrun


In data.table:

library(data.table)

setDT(df)[,quantile := cut(variable, quantile(variable, probs = 0:4/4),
                         labels = FALSE, include.lowest = TRUE), by = group]

>head(df)
#    group variable quantile
# 1:     1 6.103909        2
# 2:     2 6.511485        3
# 3:     3 5.091684        1
# 4:     1 6.966461        4
# 5:     2 6.613441        4
like image 117
mtoto Avatar answered Mar 17 '23 13:03

mtoto