Consider the following dataset: <pre class="prettyprint"><code>df = data.frame(id = c(1,1,1,2,2,2,3,3,3), time = c(1,2,3,1,2,3,1,2,3), x = c(8,8,9,7,7,7,7,7,8), id_x = c(1,1,2,3,3,3,4,4,5)) </code></pre> I want to compute <code>id_x</code> which identifies each unique combination of variables <code>id</code> and <code>x</code> (preferably using <code>dplyr</code>). In Stata, I can do the following: <pre class="prettyprint"><code>Stata clear input id time x 1 1 8 1 2 8 1 3 9 2 1 7 2 2 7 2 3 7 3 1 7 3 2 7 3 3 8 end egen id_x = group(id, x) list, separator(0) +----------------------+ | id time x id_x | |----------------------| 1. | 1 1 8 1 | 2. | 1 2 8 1 | 3. | 1 3 9 2 | 4. | 2 1 7 3 | 5. | 2 2 7 3 | 6. | 2 3 7 3 | 7. | 3 1 7 4 | 8. | 3 2 7 4 | 9. | 3 3 8 5 | +----------------------+ </code></pre>

We can use <code>dplyr::group_indices</code>: <pre class="prettyprint lang-r prettyprint-override"><code>library(dplyr) #df1 %>% mutate(id_xx = group_indices(.,id,x)) df1 %>% group_by(id,x) %>% mutate(id_xx = group_indices()) #> # A tibble: 9 x 5 #> # Groups: id, x [5] #> id time x id_x id_xx #> <dbl> <dbl> <dbl> <dbl> <int> #> 1 1 1 8 1 1 #> 2 1 2 8 1 1 #> 3 1 3 9 2 2 #> 4 2 1 7 3 3 #> 5 2 2 7 3 3 #> 6 2 3 7 3 3 #> 7 3 1 7 4 4 #> 8 3 2 7 4 4 #> 9 3 3 8 5 5 </code></pre> <h3>Data:</h3> <pre class="prettyprint lang-r prettyprint-override"><code>df1 <- data.frame(id = c(1,1,1,2,2,2,3,3,3), time = c(1,2,3,1,2,3,1,2,3), x = c(8,8,9,7,7,7,7,7,8), id_x = c(1,1,2,3,3,3,4,4,5)) </code></pre>

While M-- answer was completely correct answer at the time of writing, <code>dplyr</code> has deprecated <code>group_indices()</code>, so the code is now <pre class="prettyprint"><code>df1 %>% group_by(complex, palliative) %>% mutate(cplx_pal = cur_group_id()) </code></pre>

Equivalent for Stata's egen group() function

Tags:

dataframe

r

dplyr

stata

Consider the following dataset:

df = data.frame(id = c(1,1,1,2,2,2,3,3,3), 
                time = c(1,2,3,1,2,3,1,2,3), 
                x = c(8,8,9,7,7,7,7,7,8), 
                id_x = c(1,1,2,3,3,3,4,4,5))

I want to compute id_x which identifies each unique combination of variables id and x (preferably using dplyr).

In Stata, I can do the following:

Stata
clear

input id time x
1 1 8
1 2 8
1 3 9
2 1 7
2 2 7
2 3 7
3 1 7
3 2 7
3 3 8
end

egen id_x = group(id, x)

list, separator(0)

     +----------------------+
     | id   time   x   id_x |
     |----------------------|
  1. |  1      1   8      1 |
  2. |  1      2   8      1 |
  3. |  1      3   9      2 |
  4. |  2      1   7      3 |
  5. |  2      2   7      3 |
  6. |  2      3   7      3 |
  7. |  3      1   7      4 |
  8. |  3      2   7      4 |
  9. |  3      3   8      5 |
     +----------------------+

402

asked Jun 21 '19 20:06

safex

2 Answers

We can use dplyr::group_indices:

library(dplyr)

#df1 %>% mutate(id_xx = group_indices(.,id,x))
df1 %>% group_by(id,x) %>% mutate(id_xx = group_indices())
#> # A tibble: 9 x 5
#> # Groups:   id, x [5]
#>      id  time     x  id_x id_xx
#>   <dbl> <dbl> <dbl> <dbl> <int>
#> 1     1     1     8     1     1
#> 2     1     2     8     1     1
#> 3     1     3     9     2     2
#> 4     2     1     7     3     3
#> 5     2     2     7     3     3
#> 6     2     3     7     3     3
#> 7     3     1     7     4     4
#> 8     3     2     7     4     4
#> 9     3     3     8     5     5

Data:

df1 <-  data.frame(id = c(1,1,1,2,2,2,3,3,3), 
                time = c(1,2,3,1,2,3,1,2,3), 
                x = c(8,8,9,7,7,7,7,7,8), 
                id_x = c(1,1,2,3,3,3,4,4,5))

182

answered Oct 26 '22 15:10

M--

While M-- answer was completely correct answer at the time of writing, dplyr has deprecated group_indices(), so the code is now

df1 %>% group_by(complex, palliative) %>% mutate(cplx_pal = cur_group_id())

answered Oct 26 '22 13:10

Brent

Related questions
                            
                                In sync sliderInput and textInput
                            
                                ggplot2 geom_tile: how to have no spacing between lines when plotting non-continuous data
                            
                                Source nested R files within Rmarkdown document
                            
                                using mget within a function in R
                            
                                R how to install a specified version of a bioconductor package?
                            
                                R - file.copy function
                            
                                gather on first two rows
                            
                                Display python plotly graph in RMarkdown html document
                            
                                ggplot2 add a guide for abbreviations
                            
                                How to set the font size of data label in fviz_pca_var of factoextra
                            
                                Rstudio is painfully slow
                            
                                Meaning of error using . shorthand inside dplyr function
                            
                                Different hard threshold for each column
                            
                                Copy On Modify; What Happens When You Run This Code? x <- list(1:10); x[[2]] <- x
                            
                                How does subsetting with NA work?
                            
                                Generate progress bar in modal in shiny app, that closes automatically
                            
                                Visualising big set of points with third feature as a color - a way to improve a speed
                            
                                converting a dgCMatrix to data frame
                            
                                What is causing this error? Coefficients not defined because of singularities
                            
                                Adding boxplot below density plot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With