I am trying to use the cut function to create age intervals. Unfortunately, I receive NAs for values that match the lower end of the first break. For example: <pre class="prettyprint"><code>AGE <- sample(18:50, 100, replace = TRUE) AGE_GROUPS <- cut(AGE, breaks = c(18, 27, 36, 45)) DF <- data.frame(AGE, AGE_GROUPS) </code></pre> For all the values where AGE is 18 and above 45, I receive NA in the AGE_GROUPS variable. How can I make sure that the lowest bracket in AGE_GROUPS includes 18 and how can I make sure that the highest bracket includes all values >= 45?

Breaks isn't just the intermediate breaks, it is the endpoints too. You can make sure you get everything with <pre class="prettyprint"><code>breaks = c(-Inf, 18, 27, 36, 45, Inf) </code></pre> A little more conservatively, you could use <pre class="prettyprint"><code>breaks = c(0, 18, 27, 36, 45, 120) </code></pre> which can be useful for catching outlier coding errors. You may also want <code>include.lowest = TRUE</code>. See <code>?cut</code> for examples.

Cut function returns NA for intervals

Tags:

r

cut

I am trying to use the cut function to create age intervals. Unfortunately, I receive NAs for values that match the lower end of the first break.

For example:

AGE <- sample(18:50, 100, replace = TRUE)
AGE_GROUPS <- cut(AGE, breaks = c(18, 27, 36, 45))
DF <- data.frame(AGE, AGE_GROUPS)

For all the values where AGE is 18 and above 45, I receive NA in the AGE_GROUPS variable. How can I make sure that the lowest bracket in AGE_GROUPS includes 18 and how can I make sure that the highest bracket includes all values >= 45?

552

asked Dec 13 '17 20:12

Tea Tree

1 Answers

Breaks isn't just the intermediate breaks, it is the endpoints too. You can make sure you get everything with

breaks = c(-Inf, 18, 27, 36, 45, Inf)

A little more conservatively, you could use

breaks = c(0, 18, 27, 36, 45, 120)

which can be useful for catching outlier coding errors. You may also want include.lowest = TRUE. See ?cut for examples.

105

answered Sep 20 '22 07:09

Gregor Thomas

Related questions
                            
                                Error: .onLoad failed in loadNamespace() for 'tcltk', details:
                            
                                How to keep count in a recursive function in R?
                            
                                ROCR error: Format of predictions is invalid
                            
                                Using dcast.data.table with date values and aggregation
                            
                                How to add logo on ggplot2 footer
                            
                                Count number of values in R [duplicate]
                            
                                Converting to date in a character column that contains two date formats
                            
                                Counting the number of "0" in this factor
                            
                                How can I reverse numbers in a vector ONLY if they are sequential?
                            
                                clicking same plotly marker twice does not trigger events twice
                            
                                Export fitted regression splines (constructed by 'bs' or 'ns') as piecewise polynomials
                            
                                r heatmap - stat_density2d (ggmap) vs. addHeatmap (shiny leaflet)
                            
                                How to use values from a previous row and column
                            
                                Unnest one column list to many columns in tidyr
                            
                                Error in na.fail.default(as.ts(x)) : missing values in object in time series forecasting
                            
                                Map values to viridis colours in r
                            
                                Find the pair of most correlated variables
                            
                                Data frame to nested list
                            
                                creating a square matrix from a data frame [duplicate]
                            
                                Fast checking of missing values in Rcpp

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With