"Adding missing grouping variables" message in dplyr in R

Tags:

I have a portion of my script that was running fine before, but recently has been producing an odd statement after which many of my other functions do not work properly. I am trying to select the 8th and 23rd positions in a ranked list of values for each site to find the 25th and 75th percentile values for each day in a year for each site for 30 years. My approach was as follows (adapted for the four line dataset - slice(3) would be slice(23) for my full 30 year dataset usually):

library(“dplyr”)  mydata  structure(list(station_number = structure(c(1L, 1L, 1L, 1L), .Label = "01AD002", class = "factor"),  year = 1981:1984, month = c(1L, 1L, 1L, 1L), day = c(1L,  1L, 1L, 1L), value = c(113, 8.329999924, 15.60000038, 149 )), .Names = c("station_number", "year", "month", "day", "value"), class = "data.frame", row.names = c(NA, -4L))        value <- mydata$value   qu25 <- mydata %>%            group_by(month, day, station_number) %>%            arrange(desc(value)) %>%            slice(3) %>%            select(value)

Before, I would be left with a table that had one value per site to describe the 25th percentile (since the arrange function seems to order them highest to lowest). However, now when I run these lines, I get a message:

Adding missing grouping variables: `month`, `day`, `station_number`

This message doesn’t make sense to me, as the grouping variables are clearly present in my table. Also, again, this was working fine until recently. I have tried:

detatch(“plyr”) – since I have it loaded before dplyr
dplyr:: group_by – placing this directly in the group_by line
uninstalling and re-intstalling dplyr, although this was for another issue I was having

Any idea why I might be receiving this message and why it may have stopped working?

Thanks for any help.

Update: Added dput example with one site, but values for January 1st for multiple years. The hope would be that the positional value is returned once grouped, for instance slice(3) would hopefully return the 15.6 value for this smaller subset.

755

asked Jul 21 '16 18:07

acersaccharum

1 Answers

For consistency sake the grouping variables should be always present when defined earlier and thus are added when select(value) is executed. ungroup should resolve it:

qu25 <- mydata %>%    group_by(month, day, station_number) %>%   arrange(desc(value)) %>%    slice(2) %>%    ungroup() %>%   select(value)

The requested result is without warnings:

> mydata %>%  +   group_by(month, day, station_number) %>% +   arrange(desc(value)) %>%  +   slice(2) %>%  +   ungroup() %>% +   select(value) # A tibble: 1 x 1   value   <dbl> 1   113

108

answered Oct 05 '22 23:10

Drey

Related questions
                            
                                R data.table sliding window
                            
                                Network chord diagram woes in R
                            
                                Code folding in bookdown
                            
                                Does roxygen2 automatically write NAMESPACE directives for "Imports:" packages?
                            
                                Faster weighted sampling without replacement
                            
                                Insert a row in a data.table
                            
                                How to install dependencies when using "R CMD INSTALL" to install R packages?
                            
                                Conditionally replacing column values with data.table
                            
                                ggplot2 - bar plot with both stack and dodge
                            
                                R summary() equivalent in numpy
                            
                                One function to detect NaN, NA, Inf, -Inf, etc.?
                            
                                How to use reference variables by character string in a formula?
                            
                                How to properly document a S3 method of a generic from a different package, using Roxygen?
                            
                                How to apply function over each matrix element's indices
                            
                                How to add different lines for facets
                            
                                How do I convert a factor into date format?
                            
                                Convert integer to class Date
                            
                                Function not found in R doParallel 'foreach' - Error in { : task 1 failed - "could not find function "raster""
                            
                                Non-numeric Argument to Binary Operator Error in R
                            
                                Installing of SparkR

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

"Adding missing grouping variables" message in dplyr in R

Tags:

r

dplyr

acersaccharum

People also ask

1 Answers

Drey

Recent Activity

Donate For Us