I have a dataset on state level approval ratings. I need to lag one of the variables by two years. The data is annual and spans 1970 to 2008. Obviously, if I lag the data I will lose some observations (ie: 1970 won't be able to find the 1968 data) I'm fine with losing those observations, but the diff command returns an error when I try to lag. However, when I run the lag I get the following error that the replacement does not match the data: <pre class="prettyprint"><code>> df$lagvar <- diff(df$var, lag=2) Error in `$<-.data.frame`(`*tmp*`, "lagvar", value = c(-0.4262501, : replacement has 230 rows, data has 232 </code></pre> I've searched around, but cannot find a solution. Any ideas on how to get around this?

<code>diff</code> does not pad with leading <code>NA</code> by default. You have to add those yourself. <pre class="prettyprint"><code>df$lagvar <- c(NA, NA, diff(df$var, lag=2)) </code></pre> You could write a simple wrapper function to do it for you. Something like this, perhaps: <pre class="prettyprint"><code>mydiff <- function(x, ...) { d <- diff(x, ...) c(rep(NA, NROW(x)-NROW(d)), d) } </code></pre>

Lag with missing data

Tags:

r

diff

missing-data

I have a dataset on state level approval ratings. I need to lag one of the variables by two years.

The data is annual and spans 1970 to 2008. Obviously, if I lag the data I will lose some observations (ie: 1970 won't be able to find the 1968 data) I'm fine with losing those observations, but the diff command returns an error when I try to lag.

However, when I run the lag I get the following error that the replacement does not match the data:

> df$lagvar <- diff(df$var, lag=2)
Error in `$<-.data.frame`(`*tmp*`, "lagvar", value = c(-0.4262501,  : 
replacement has 230 rows, data has 232

I've searched around, but cannot find a solution. Any ideas on how to get around this?

351

asked May 01 '13 21:05

user2340913

1 Answers

diff does not pad with leading NA by default. You have to add those yourself.

df$lagvar <- c(NA, NA, diff(df$var, lag=2))

You could write a simple wrapper function to do it for you. Something like this, perhaps:

mydiff <- function(x, ...) {
  d <- diff(x, ...)
  c(rep(NA, NROW(x)-NROW(d)), d)
}

109

answered Nov 15 '22 00:11

Joshua Ulrich

Related questions
                            
                                ggplot2 + Date structure using scale X
                            
                                Implementations of local regression and local likelihood methods
                            
                                How to read output from linux process status (ps) command in R?
                            
                                R data.table subsetting a subset
                            
                                What is the difference between cor and cor.test in R
                            
                                Vectorize for loop over data frame in R
                            
                                Converting a grouped continous variable into rows in R
                            
                                Is it possible call a COM object from within R, if the COM object is exposed from a .NET assembly?
                            
                                R documentation, how to set a character in bold font within math mode, within eqn or deqn?
                            
                                How do I get the current dimensions of the quartz device in R?
                            
                                Why does tempdir() adds extra slash at end of directory tree on osx?
                            
                                linear model when all occurrences of independent variables are NA
                            
                                Potential problems from over-allocating truelength more than 1000 times
                            
                                draw multiple discrete networks in R using igraph
                            
                                R intersect data.frame on multiple criteria
                            
                                Problems with testthat Connections
                            
                                Decimal places in Summary(model) output in R
                            
                                Sliding window in R
                            
                                Regression for a Rate variable in R
                            
                                Unusual legend using size mapping and density2d

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With