How do I Difference Panel Data in R

Tags:

I am wondering if there is any easy R commands or packages that will all allow me to easily add variables to data.frames which are the "difference" or change of over time of those variables.

If my data looked like this:

set.seed(1)
MyData <- data.frame(Day=0:9 %% 5+1, 
                 Price=rpois(10,10),
                 Good=rep(c("apples","oranges"), each=5))
MyData

   Day Price    Good
1    1     8  apples
2    2    10  apples
3    3     7  apples
4    4    11  apples
5    5    14  apples
6    1    12 oranges
7    2    11 oranges
8    3     9 oranges
9    4    14 oranges
10   5    11 oranges

Then after "first differencing" the price variable, my data would look like this.

   Day Price    Good P1d
1    1     8  apples  NA
2    2    10  apples   2
3    3     7  apples  -3
4    4    11  apples   4
5    5    14  apples   3
6    1    12 oranges  NA
7    2    11 oranges  -1
8    3     9 oranges  -2
9    4    14 oranges   5
10   5    11 oranges  -3

449

asked Mar 21 '14 11:03

Francis Smart

2 Answers

ave

transform(MyData, P1d = ave(Price, Good, FUN = function(x) c(NA, diff(x))))

ave/gsubfn

The last solution can be shorteneed slightly using fn$ in the gsubfn package:

library(gsubfn)
transform(MyData, P1d = fn$ave(Price, Good, FUN = ~ c(NA, diff(x))))

dplyr

library(dplyr)

MyData %>% 
  group_by(Good) %>% 
  mutate(P1d = Price - lag(Price)) %>% 
  ungroup

data.table

library(data.table)

dt <- data.table(MyData)
dt[, P1d := c(NA, diff(Price)), by = Good]

Update

dplyr now uses %>% instead of %.% .

124

answered Sep 29 '22 16:09

G. Grothendieck

One can easily do it like this:

library(reshape2)
library(dplyr)

MyNewData <- 
 MyData %.%
 melt(id = c("Good", "Day")) %.%
 dcast(Day ~ Good) %.%
 mutate(apples  = apples - lag(apples),
     oranges = oranges - lag(oranges)) %.%
 melt(id = "Day", variable.name = "Good", value.name = "P1d") %.%
 merge(MyData) %.%
 arrange(Good, Day)

Regards

answered Sep 29 '22 16:09

Miha Trošt

Related questions
                            
                                Largest possible values in R [duplicate]
                            
                                How to get frequencies then add it as a variable in an array?
                            
                                Extract name hierarchy for each leaf of a nested list
                            
                                R pairwise product
                            
                                Format to find day of week not working in Windows
                            
                                Overlapping Genomic Ranges
                            
                                time series with 10 min frequency in R
                            
                                Partial string matching with grep and regular expressions
                            
                                R, get longitude/latitude data for cities and add it to my dataframe [closed]
                            
                                Evaluate an expression within an environment inside a function
                            
                                How do I make the y-axis values bold in R?
                            
                                Right (or left) side trimmed mean
                            
                                Solve simple equation in R
                            
                                R tm package: utf-8 text
                            
                                confusion matrix from rpart
                            
                                Subset not based on exact match, but partial in R
                            
                                Plot - Invalid xlim value
                            
                                hydrological year time series
                            
                                Adding a bar chart to a ggplot2 legend
                            
                                Trim a string to a specific number of characters in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I Difference Panel Data in R

Tags:

r

time-series

panel-data