I am using group_split in dplyr and I am struggling to name the list after I have split by more than one column. I know how to do this when we group by one column here but I am not sure how to do this when splitting by two columns I can't share the data but if using the iris dataset, it would be similar to this (in my case both columns are factors) <pre class="prettyprint"><code>iris %>% group_split(Species, Petal.Width) </code></pre>

Use <code>dplyr::group_keys()</code> to get the grouping variables. <pre class="prettyprint lang-r prettyprint-override"><code>library(dplyr) library(stringr) # make grouped data frame iris_group <- iris %>% group_by(Species, Petal.Width) # get group keys group_name_df <- group_keys(iris_group) %>% mutate(group_name = str_c(as.character(Species),"-",Petal.Width)) # get name for each group group_name <- group_name_df$group_name # assign name to each split table df_list <- group_split(iris_group) %>% setNames(group_name) </code></pre> <pre class="prettyprint"><code>> group_name_df # A tibble: 27 x 3 Species Petal.Width group_name <fct> <dbl> <chr> 1 setosa 0.1 setosa-0.1 2 setosa 0.2 setosa-0.2 3 setosa 0.3 setosa-0.3 4 setosa 0.4 setosa-0.4 5 setosa 0.5 setosa-0.5 6 setosa 0.6 setosa-0.6 7 versicolor 1 versicolor-1 8 versicolor 1.1 versicolor-1.1 9 versicolor 1.2 versicolor-1.2 10 versicolor 1.3 versicolor-1.3 # ... with 17 more rows </code></pre> <pre class="prettyprint"><code>> df_list $`setosa-0.1` # A tibble: 5 x 5 Sepal.Length Sepal.Width Petal.Length Petal.Width Species <dbl> <dbl> <dbl> <dbl> <fct> 1 4.9 3.1 1.5 0.1 setosa 2 4.8 3 1.4 0.1 setosa 3 4.3 3 1.1 0.1 setosa 4 5.2 4.1 1.5 0.1 setosa 5 4.9 3.6 1.4 0.1 setosa $`setosa-0.2` # A tibble: 29 x 5 Sepal.Length Sepal.Width Petal.Length Petal.Width Species <dbl> <dbl> <dbl> <dbl> <fct> . . . </code></pre>

How to name a list of a group_split in dplyr when grouped by more than one column

Tags:

r

dplyr

I am using group_split in dplyr and I am struggling to name the list after I have split by more than one column.

I know how to do this when we group by one column here but I am not sure how to do this when splitting by two columns

I can't share the data but if using the iris dataset, it would be similar to this (in my case both columns are factors)

iris %>%
group_split(Species, Petal.Width)

385

asked Jul 30 '19 15:07

Scott

1 Answers

Use dplyr::group_keys() to get the grouping variables.

library(dplyr)
library(stringr)
# make grouped data frame
iris_group <- iris %>%
    group_by(Species, Petal.Width)

# get group keys
group_name_df <- group_keys(iris_group) %>%
    mutate(group_name = str_c(as.character(Species),"-",Petal.Width))

# get name for each group
group_name <- group_name_df$group_name

# assign name to each split table
df_list <- group_split(iris_group) %>%
    setNames(group_name)

> group_name_df
# A tibble: 27 x 3
   Species    Petal.Width group_name    
   <fct>            <dbl> <chr>         
 1 setosa             0.1 setosa-0.1    
 2 setosa             0.2 setosa-0.2    
 3 setosa             0.3 setosa-0.3    
 4 setosa             0.4 setosa-0.4    
 5 setosa             0.5 setosa-0.5    
 6 setosa             0.6 setosa-0.6    
 7 versicolor         1   versicolor-1  
 8 versicolor         1.1 versicolor-1.1
 9 versicolor         1.2 versicolor-1.2
10 versicolor         1.3 versicolor-1.3
# ... with 17 more rows

> df_list 
$`setosa-0.1`
# A tibble: 5 x 5
  Sepal.Length Sepal.Width Petal.Length Petal.Width Species
         <dbl>       <dbl>        <dbl>       <dbl> <fct>  
1          4.9         3.1          1.5         0.1 setosa 
2          4.8         3            1.4         0.1 setosa 
3          4.3         3            1.1         0.1 setosa 
4          5.2         4.1          1.5         0.1 setosa 
5          4.9         3.6          1.4         0.1 setosa 

$`setosa-0.2`
# A tibble: 29 x 5
   Sepal.Length Sepal.Width Petal.Length Petal.Width Species
          <dbl>       <dbl>        <dbl>       <dbl> <fct>  
.
.
.

112

answered Nov 11 '22 16:11

yusuzech

Related questions
                            
                                Installing tidyverse on Ubuntu 18.x & R 3.4.4/3.5.1
                            
                                R - finding pattern in a column and replacing it (more efficient solution)
                            
                                How to extract stan code from rstanarm object
                            
                                create a matrix in `R` and each element in that matrix is another matrix
                            
                                Function parameter; passing variable name without quotes
                            
                                Make Y-axis start at 1 instead of 0 within ggplot bar chart
                            
                                Is there a way to make a kable without lines/borders for pdf?
                            
                                Icons in data table in Shiny
                            
                                join data frames and replace one column with another
                            
                                How to fix an error when adding a manual scale in ggplot?
                            
                                How to change alpha in geom_sf?
                            
                                In R: How to replace NA in a Vector found between two integers
                            
                                autoplot does not accept ts object
                            
                                How to stop ggrepel labels moving between gganimate frames in R/ggplot2?
                            
                                Mutate_if or mutate_at in dplyr with Dates
                            
                                How to generate README.md from README.Rmd for R package?
                            
                                "recursive" self join in data.table
                            
                                How to solve an equation for a given variable in R?
                            
                                How to do faster list-column operations inside data.table
                            
                                str_extract_all: return all patterns found in string concatenated as vector

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With