quantile cut by group in data.table

Tags:

I would like to do quantile cuts (cut into n bins with equal number of points) for each group

qcut = function(x, n) {
  quantiles = seq(0, 1, length.out = n+1)
  cutpoints = unname(quantile(x, quantiles, na.rm = TRUE))
  cut(x, cutpoints, include.lowest = TRUE)
}

library(data.table)
dt = data.table(A = 1:10, B = c(1,1,1,1,1,2,2,2,2,2))
dt[, bin := qcut(A, 3)]
dt[, bin2 := qcut(A, 3), by = B]

dt
A     B    bin        bin2
 1:  1 1  [1,4]    [6,7.33]
 2:  2 1  [1,4]    [6,7.33]
 3:  3 1  [1,4] (7.33,8.67]
 4:  4 1  [1,4]   (8.67,10]
 5:  5 1  (4,7]   (8.67,10]
 6:  6 2  (4,7]    [6,7.33]
 7:  7 2  (4,7]    [6,7.33]
 8:  8 2 (7,10] (7.33,8.67]
 9:  9 2 (7,10]   (8.67,10]
10: 10 2 (7,10]   (8.67,10]

Here the cut without grouping is correct -- data lie in the bin. But the result by group is wrong.

How can I fix that?

913

asked Mar 22 '17 10:03

jf328

1 Answers

This is a bug in handling of factors. Please check if it is known (or fixed in the development version) and report it to the data.table bug tracker otherwise.

qcut = function(x, n) {
  quantiles = seq(0, 1, length.out = n+1)
  cutpoints = unname(quantile(x, quantiles, na.rm = TRUE))
  as.character(cut(x, cutpoints, include.lowest = TRUE))
}

dt[, bin2 := qcut(A, 3), by = B]
#     A B    bin        bin2
# 1:  1 1  [1,4]    [1,2.33]
# 2:  2 1  [1,4]    [1,2.33]
# 3:  3 1  [1,4] (2.33,3.67]
# 4:  4 1  [1,4]    (3.67,5]
# 5:  5 1  (4,7]    (3.67,5]
# 6:  6 2  (4,7]    [6,7.33]
# 7:  7 2  (4,7]    [6,7.33]
# 8:  8 2 (7,10] (7.33,8.67]
# 9:  9 2 (7,10]   (8.67,10]
#10: 10 2 (7,10]   (8.67,10]

152

answered Sep 17 '22 19:09

Roland

Related questions
                            
                                R Shiny conditionalPanel displays when condition is not met
                            
                                How to suppress zeroes when using geom_histogram with scale_y_log10
                            
                                write.xlsx function gives error when defining path with the file name but read.xlsx is fine
                            
                                dealing with an input dataset in R Shiny
                            
                                How can I get tooltips showing in dygraphs without annotation
                            
                                Create a recursive list from a list of vectors
                            
                                List all variables (and their proportions) in a subset of a dataframe
                            
                                R: can range(data.frame) exclude infinite values?
                            
                                Rstudio Git bash pop-up every time
                            
                                Conditional calculation of mean
                            
                                R - How to Speed Up Recursion and Double Summation
                            
                                ggplot2: Save individual facet_wrap facets as separate plot objects
                            
                                Collapsible tree in R
                            
                                is it necessary to center and scale data before predicting?
                            
                                React to menuItem() tab selection
                            
                                Convert a sparse matrix to a full matrix - R
                            
                                One main panel and 2 side panels
                            
                                Grouped layer control in Leaflet R
                            
                                Generate observers for dynamic number of inputs
                            
                                How to sliderInput for Dates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

quantile cut by group in data.table

Tags:

r

data.table

quantile

jf328

People also ask

1 Answers

Roland

Recent Activity

Donate For Us