Is there a function to determine if a tibble is a grouped one or not. I use the following code to create an aggregated variable without shrinking the dataset: <pre class="prettyprint"><code>mydataset %>% select(count, group) %>% group_by(group) %>% mutate(count_group = sum(count)) </code></pre> If I use mutate, I have a grouped tibble. If I use summarise, I have a simple tibble. Is there a function, like <code>as.grouped()</code> which allows to determine the character grouped of a tibble?

The functions <code>is.grouped_df()</code> and <code>is_grouped_df()</code> will both return a TRUE logical if it's a grouped tibble, and FALSE if it is not. <pre class="prettyprint"><code># Create a tibble df <- tibble(x = c(5, 2, NA)) # Group by column 'x' gdf <- group_by(df, x) # Returns FALSE is.grouped_df(df) # Returns TRUE is.grouped_df(gdf) </code></pre>

Determine if a tibble is grouped or not

Tags:

r

dplyr

Is there a function to determine if a tibble is a grouped one or not.

I use the following code to create an aggregated variable without shrinking the dataset:

mydataset %>% select(count, group) %>%
  group_by(group) %>%
  mutate(count_group = sum(count))

If I use mutate, I have a grouped tibble. If I use summarise, I have a simple tibble.

Is there a function, like as.grouped() which allows to determine the character grouped of a tibble?

404

asked Mar 07 '17 18:03

YCR

2 Answers

Information on grouping is saved as an attribute. Example:

library("tidyverse")
a <- mtcars %>% group_by(cyl)
attributes(a)
$names
 [1] "mpg"  "cyl"  "disp" "hp"   "drat" "wt"   "qsec" "vs"   "am"   "gear"
[11] "carb"

$row.names
 [1] "Mazda RX4"           "Mazda RX4 Wag"       "Datsun 710"
 [4] "Hornet 4 Drive"      "Hornet Sportabout"   "Valiant"
 [7] "Duster 360"          "Merc 240D"           "Merc 230"
[10] "Merc 280"            "Merc 280C"           "Merc 450SE"
[13] "Merc 450SL"          "Merc 450SLC"         "Cadillac Fleetwood"
[16] "Lincoln Continental" "Chrysler Imperial"   "Fiat 128"
[19] "Honda Civic"         "Toyota Corolla"      "Toyota Corona"
[22] "Dodge Challenger"    "AMC Javelin"         "Camaro Z28"
[25] "Pontiac Firebird"    "Fiat X1-9"           "Porsche 914-2"
[28] "Lotus Europa"        "Ford Pantera L"      "Ferrari Dino"
[31] "Maserati Bora"       "Volvo 142E"

$class
[1] "grouped_df" "tbl_df"     "tbl"        "data.frame"

$groups
# A tibble: 3 x 2
    cyl .rows
  <dbl> <list>
1     4 <int [11]>
2     6 <int [7]>
3     8 <int [14]>

attributes function can be used to check for the presence of grouping attribute:

any(names(attributes(a)) == "groups")

answered Oct 28 '22 03:10

Konrad

The functions is.grouped_df() and is_grouped_df() will both return a TRUE logical if it's a grouped tibble, and FALSE if it is not.

# Create a tibble    
df <- tibble(x = c(5, 2, NA))
# Group by column 'x'
gdf <- group_by(df, x)
# Returns FALSE
is.grouped_df(df)
# Returns TRUE
is.grouped_df(gdf)

answered Oct 28 '22 04:10

Steven Livingstone

Related questions
                            
                                How to create a ribbon plot?
                            
                                Set a variable using colnames(), update data.table using := operator, variable is silently updated? [duplicate]
                            
                                3D surface plot from 2D matrix
                            
                                Why is there no NA_logical_
                            
                                RODBC loses time values of datetime when result set is large
                            
                                Calling a Rcpp function from another Rcpp function while building an R package
                            
                                Print, cat, paste in R separated by newline character
                            
                                R max function ignore NA
                            
                                pair-wise duplicate removal from dataframe [duplicate]
                            
                                Incorrect behavior with dplyr's left_join?
                            
                                Get rid of auto spacing using bquote in r
                            
                                UI elements for selecting date and time (not just date) in shiny
                            
                                A better way to push and pop to/from lists in R?
                            
                                Add a page refresh button by using R Shiny
                            
                                R: lapply function - skipping the current function loop
                            
                                Use dplyr's summarise and summarise_each together?
                            
                                Calculate marginal tax rates using R
                            
                                R print table using message
                            
                                R gsub a single double quotation mark
                            
                                geom_tile single color as 0, then color scale

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With