So I am trying to translate some dplyr code. I have tried to get help from a package that translates dplyr to data.table but it still does not work. The error is with <code>row_number</code> from <code>dplyr</code>.. I need all the steps in the <code>dplyr</code> code (even though they don't make sense here with <code>mtcars</code>) <pre class="prettyprint lang-r prettyprint-override"><code>library(dplyr) library(dtplyr) # from https://github.com/tidyverse/dtplyr library(data.table) mtcars %>% distinct(mpg, .keep_all = TRUE) %>% group_by(am) %>% arrange(mpg, .by_group = TRUE) %>% mutate(row_num = LETTERS[row_number()]) %>% ungroup() # using dtplyr dt <- lazy_dt(mtcars) dt %>% distinct(mpg, .keep_all = TRUE) %>% group_by(am) %>% arrange(mpg, .by_group = TRUE) %>% mutate(row_num = LETTERS[row_number()]) %>% ungroup() %>% show_query() #> unique(`_DT1`, by = "mpg")[order(am, mpg)][, `:=`(row_num = c("A", #> "B", "C", "D", "E", "F", "G", "H", "I", "J", "K", "L", "M", "N", #> "O", "P", "Q", "R", "S", "T", "U", "V", "W", "X", "Y", "Z")[row_number()]), #> keyby = .(am)] # I then use the query from dtplyr DT <- as.data.table(mtcars) unique(DT, by = "mpg")[order(am, mpg)][, `:=`(row_num = c("A", "B", "C", "D", "E", "F", "G", "H", "I", "J", "K", "L", "M", "N", "O", "P", "Q", "R", "S", "T", "U", "V", "W", "X", "Y", "Z")[row_number()]), keyby = .(am)] #> row_number() should only be called in a data context </code></pre> Created on 2019-07-14 by the reprex package (v0.3.0)

Might I recommend the rowid function? It does the grouping step "under the hood" you might find it looks cleaner: <pre class="prettyprint"><code>unique(DT, by='mpg')[order(am, mpg), row_num := LETTERS[rowid(am)]] </code></pre> if you love chaining, you could also get everything inside <code>[]</code>: <pre class="prettyprint"><code>DT[ , .SD[1L], by = mpg ][order(am, mpg), row_num := LETTERS[rowid(am)]] </code></pre>

Translating dplyr to data.table

Tags:

r

data.table

dplyr

dtplyr

So I am trying to translate some dplyr code. I have tried to get help from a package that translates dplyr to data.table but it still does not work. The error is with row_number from dplyr..

I need all the steps in the dplyr code (even though they don't make sense here with mtcars)

Click to copy

library(dplyr)
library(dtplyr) # from https://github.com/tidyverse/dtplyr
library(data.table)

mtcars %>% 
  distinct(mpg, .keep_all = TRUE) %>% 
  group_by(am) %>% 
  arrange(mpg, .by_group = TRUE) %>% 
  mutate(row_num = LETTERS[row_number()]) %>% 
  ungroup() 

# using dtplyr
dt <- lazy_dt(mtcars)

dt %>% 
  distinct(mpg, .keep_all = TRUE) %>% 
  group_by(am) %>% 
  arrange(mpg, .by_group = TRUE) %>% 
  mutate(row_num = LETTERS[row_number()]) %>% 
  ungroup() %>% 
  show_query()
#> unique(`_DT1`, by = "mpg")[order(am, mpg)][, `:=`(row_num = c("A", 
#> "B", "C", "D", "E", "F", "G", "H", "I", "J", "K", "L", "M", "N", 
#> "O", "P", "Q", "R", "S", "T", "U", "V", "W", "X", "Y", "Z")[row_number()]), 
#>     keyby = .(am)]

# I then use the query from dtplyr 
DT <- as.data.table(mtcars)
unique(DT, by = "mpg")[order(am, mpg)][, `:=`(row_num = c("A", 
                                                              "B", "C", "D", "E", "F", "G", 
                                                              "H", "I", "J", "K", "L", "M", 
                                                              "N", "O", "P", "Q", "R", "S", 
                                                              "T", "U", "V", "W", "X", "Y", 
                                                              "Z")[row_number()]), keyby = .(am)]

#> row_number() should only be called in a data context

^{Created on 2019-07-14 by the reprex package (v0.3.0)}

881

asked Jul 13 '19 23:07

xhr489

Video Answer

1 Answers

Might I recommend the rowid function? It does the grouping step "under the hood" you might find it looks cleaner:

Click to copy

unique(DT, by='mpg')[order(am, mpg), row_num := LETTERS[rowid(am)]]

if you love chaining, you could also get everything inside []:

Click to copy

DT[ , .SD[1L], by = mpg
   ][order(am, mpg), row_num := LETTERS[rowid(am)]]

149

answered Oct 12 '22 08:10

MichaelChirico

Related questions
                            
                                How to look for a certain part in a string and only keep that part
                            
                                Picking individual colours from a RColorBrewer palette as a scale_colour_manual() value in ggplot2
                            
                                R: removing numbers at begin and end of a string
                            
                                R-invalid multibyte string 1
                            
                                How to plot 2 categorical variables on X-axis and two continuous variables as "fill" using ggplot2 package?
                            
                                List file information in a text file for all the files in a directory
                            
                                R -apply- convert many columns from numeric to factor
                            
                                package ‘tidyverse’ is not available
                            
                                Moving some rows of a data frame to the end based on a match vector
                            
                                Remove columns the tidyeval way
                            
                                Color points by their occurrence count in ggplot2 geom_count
                            
                                Change the order of stacked fill columns in ggplot2
                            
                                extracting more than 20 variables by importance via varImp
                            
                                gcc: error: libgomp.spec: No such file or directory with Amazon Linux 2017.09.1
                            
                                R markdown to PDF - Printing console output
                            
                                Extraction of POSIXlt component runs fine in R 3.4.4, but errors in R 3.5.0. Why?
                            
                                Create ggplot2 function and specify arguments as variables in data as per ggplot2 standard functionality
                            
                                Automatically extracting strings with mismatched spellings from a column and replacing them in R [closed]
                            
                                Convert multiple columns of a data frame from string to numeric in R
                            
                                How to remove certain items from a vector?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Translating dplyr to data.table

Tags:

r

data.table

dplyr

dtplyr

xhr489

People also ask

Video Answer

1 Answers

MichaelChirico

Recent Activity

Donate For Us