Shift with dynamic n (number of position lead / lag by)

Tags:

r

data.table

I have the below df:

df <- data.table(user = c('a', 'a', 'a', 'b', 'b')
                 , spend = 1:5
                 , shift_by = c(1,1,2,1,1)
                 ); df

   user spend shift_by
1:    a     1        1
2:    a     2        1
3:    a     3        2
4:    b     4        1
5:    b     5        1

I am looking to create a lead lag column only this time the n parameter in data.table's shift function is dynamic and takes df$shiftby as input. My expected result is:

df[, spend_shifted := c(NA, 1, 1, NA, 4)]; df

   user spend shift_by spend_shifted
1:    a     1        1            NA
2:    a     2        1             1
3:    a     3        2             1
4:    b     4        1            NA
5:    b     5        1             4

However, with the below attempt it gives:

df[, spend_shifted := shift(x=spend, n=shift_by, type="lag"), user]; df

   user spend shift_by spend_shifted
1:    a     1        1            NA
2:    a     2        1            NA
3:    a     3        2            NA
4:    b     4        1            NA
5:    b     5        1            NA

This is the closest example I could find. However, I need a group by and am after a data.table solution because of speed. Truly look forward to finding any ideas.

839

asked Nov 02 '21 14:11

Sweepy Dodo

Video Answer

1 Answers

I believe this will work. You can drop the newindex-column afterward.

df[, newindex := rowid(user) - shift_by]
df[newindex < 0, newindex := 0]
df[newindex > 0, spend_shifted := df[, spend[newindex], by = .(user)]$V1]
#    user spend shift_by newindex spend_shifted
# 1:    a     1        1        0            NA
# 2:    a     2        1        1             1
# 3:    a     3        2        1             1
# 4:    b     4        1        0            NA
# 5:    b     5        1        1             4

171

answered Oct 26 '22 02:10

Wimpel

Related questions
                            
                                How to correctly output Plotly plots in shiny?
                            
                                Using dplyr summarize with different operations for multiple columns
                            
                                All combinations of letters/numbers under specific conditions
                            
                                r - Convert output from sf::st_within to vector
                            
                                R - ggplot2 time series x-axis to show last day of the month
                            
                                Image output in shiny app
                            
                                Convert an integer to a string in R
                            
                                R Caret Package Error - At least one of the class levels is not a valid R variable name
                            
                                Replacing zeroes with NA for values preceding non-zero
                            
                                R - Fitting a grid over a City Map and inputting data into grid squares
                            
                                ggplot scale_color_manual with breaks does not match expected order
                            
                                Generate all unique combinations from a vector with repeating elements
                            
                                R: What are dates in a dates vector: dates or numeric values? (difference between x[i] and i)
                            
                                Clear R environment of all objetcs & packages
                            
                                Separate a shopping list into multiple columns
                            
                                How to create a geom line plot with single geom point at the end with legend
                            
                                Combine: rowwise(), mutate(), across(), for multiple functions
                            
                                Delete duplicates between groups in R
                            
                                Iterate sequentially over two lists in R
                            
                                How can I trust a library in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With