Cumulative product of (1-previous_record)*current_record

Tags:

The data frame contains two variables (time and rate) and 10 observations

time <- seq(1:10) 
rate <- 1-(0.99^time)
dat <- data.frame(time, rate)

I need to add a new column (called new_rate).

new_rate is defined as follows

Note: new_rate_1 is the first observation of new the column new_rate, etc.

new_rate_1 = rate_1
new_rate_2 = (1-rate_1)*rate_2
new_rate_3 = (1-rate_1)*(1-rate_2)*rate_3
new_rate_4 = (1-rate_1)*(1-rate_2)*(1-rate_3)*rate_4
...
new_rate_10 = (1-rate_1)*(1-rate_2)*(1-rate_3)*(1-rate_4)*(1-rate_5)*(1-rate_6)*(1-rate_7)*(1-rate_8)*(1-rate_9)*rate_10

How this can be done in base R or dplyr?

461

asked Jul 23 '20 00:07

user9292

Video Answer

2 Answers

cumprod to the rescue (hat-tip to @Cole for simplifying the code):

dat$rate * c(1, cumprod(1 - head(dat$rate, -1)))

The logic is that you are essentially doing a cumulative product of 1 - dat$rate, multiplied by the current step.
At the first step, you can just keep the existing value, but then you need to offset the two vectors so that the multiplication gives the desired result.

Proof:

out <- c(
dat$rate[1],
(1-dat$rate[1])*dat$rate[2],
(1-dat$rate[1])*(1-dat$rate[2])*dat$rate[3],
(1-dat$rate[1])*(1-dat$rate[2])*(1-dat$rate[3])*dat$rate[4],
(1-dat$rate[1])*(1-dat$rate[2])*(1-dat$rate[3])*(1-dat$rate[4])*dat$rate[5],
(1-dat$rate[1])*(1-dat$rate[2])*(1-dat$rate[3])*(1-dat$rate[4])*(1-dat$rate[5])*dat$rate[6],
(1-dat$rate[1])*(1-dat$rate[2])*(1-dat$rate[3])*(1-dat$rate[4])*(1-dat$rate[5])*(1-dat$rate[6])*dat$rate[7],
(1-dat$rate[1])*(1-dat$rate[2])*(1-dat$rate[3])*(1-dat$rate[4])*(1-dat$rate[5])*(1-dat$rate[6])*(1-dat$rate[7])*dat$rate[8],
(1-dat$rate[1])*(1-dat$rate[2])*(1-dat$rate[3])*(1-dat$rate[4])*(1-dat$rate[5])*(1-dat$rate[6])*(1-dat$rate[7])*(1-dat$rate[8])*dat$rate[9],
(1-dat$rate[1])*(1-dat$rate[2])*(1-dat$rate[3])*(1-dat$rate[4])*(1-dat$rate[5])*(1-dat$rate[6])*(1-dat$rate[7])*(1-dat$rate[8])*(1-dat$rate[9])*dat$rate[10]
)

all.equal(
  dat$rate * c(1, cumprod(1 - head(dat$rate, -1))),
  out
)
#[1] TRUE

134

answered Sep 22 '22 16:09

thelatemail

A straightforward math approach using cumprod should work

> c(1, head(cumprod(1 - rate), -1)) * rate
 [1] 0.01000000 0.01970100 0.02881885 0.03709807 0.04432372 0.05033049
 [7] 0.05500858 0.05830607 0.06022773 0.06083074

If you want to practice with recursions, you can try the method below

f <- function(v, k = length(v)) {
    if (k == 1) {
        return(v[k])
    }
    u <- f(v, k - 1)
    c(u, tail(u, 1) * (1 / v[k - 1] - 1) * v[k])
}

such that

> f(rate)
 [1] 0.01000000 0.01970100 0.02881885 0.03709807 0.04432372 0.05033049
 [7] 0.05500858 0.05830607 0.06022773 0.06083074

answered Sep 22 '22 16:09

ThomasIsCoding

Related questions
                            
                                Subset a dataframe using a logical vector with $
                            
                                How to add only missing Dates in Dataframe
                            
                                Pass optional arguments to function, three dots
                            
                                Restrain scattered jitter points within a violin plot by ggplot2
                            
                                ggplot2 Stacked Bar Chart - Each Bar being 100% and with percenage labels inside each bar
                            
                                R: Calculating distance in miles from one point to another
                            
                                How to compose a list of functions
                            
                                ggplot() scaling with scale::percent_format() producing strange results
                            
                                Plot y = mx + c with ggplot
                            
                                Blogdown kable tables formatting (ugly)
                            
                                Handling empty strings in string detection
                            
                                R shiny dynamic UI in insertUI
                            
                                How to convert a numeric value into a Date value
                            
                                How to filter an R simple features collection using sf methods like st_intersects()?
                            
                                R return true or false per row if string contains any of a list of words
                            
                                How to find the number of times row elements switch from negative to positive (cycles) for each factor level
                            
                                Replacement of plyr::cbind.fill in dplyr?
                            
                                Left-adjust (hjust = 0) vertical x axis labels on facets with free scale
                            
                                How to group rows and get their cell associations layed out in a list form in r?
                            
                                How to establish if the dates in a column are unique?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Cumulative product of (1-previous_record)*current_record

Tags:

iteration

r

dplyr

rolling-computation

accumulate