Get the (t-1) data within groups

Tags:

Apologies if this has been asked before, but I couldn't find any question which answers this exactly. I have a data like this:

Project        Date   price
      A   30/3/2013    2082
      B   19/3/2013    1567
      B   22/2/2013    1642
      C   12/4/2013    1575
      C    5/6/2013    1582

I want to have a column with last-instance prices by group. For example, for row 2, the last instance price for same group will be 1642. The final data will look somewhat like this:

Project        Date   price   lastPrice
      A   30/3/2013    2082           0
      B   19/3/2013    1567        1642
      B   22/2/2013    1642           0 
      C   12/4/2013    1575           0
      C    5/6/2013    1582        1575

How to do this? The main issue I'm facing is that the data may not be ordered by date so its not as if I can just take the last cell.

228

asked Jul 08 '15 07:07

UD1989

1 Answers

Here's an option. I'd also recommend to use NAs instead if 0 because 0 could be actual price.

library(dplyr)
df %>% 
  arrange(as.Date(Date, format = "%d/%m/%Y")) %>%
  group_by(Project) %>%
  mutate(lastPrice = lag(price))

# Source: local data frame [5 x 4]
# Groups: Project
# 
#   Project      Date price lastPrice
# 1       B 22/2/2013  1642        NA
# 2       B 19/3/2013  1567      1642
# 3       A 30/3/2013  2082        NA
# 4       C 12/4/2013  1575        NA
# 5       C  5/6/2013  1582      1575

Another option is to use shift from the devel version of data.table

library(data.table) ## v >= 1.9.5
setDT(df)[order(as.Date(Date, format = "%d/%m/%Y")), 
                lastPrice := shift(price), 
                by = Project]

#    Project      Date price lastPrice
# 1:       A 30/3/2013  2082        NA
# 2:       B 19/3/2013  1567      1642
# 3:       B 22/2/2013  1642        NA
# 4:       C 12/4/2013  1575        NA
# 5:       C  5/6/2013  1582      1575

Or with base R

df <- df[order(df$Project, as.Date(df$Date, format = "%d/%m/%Y")), ]
within(df, lastPrice <- ave(price, Project, FUN = function(x) c(NA, x[-length(x)])))
#   Project      Date price lastPrice
# 1       A 30/3/2013  2082        NA
# 3       B 22/2/2013  1642        NA
# 2       B 19/3/2013  1567      1642
# 4       C 12/4/2013  1575        NA
# 5       C  5/6/2013  1582      1575

As a side note, it is better to keep your date column in a Date class in the first place, so I'd recommend doing df$Date <- as.Date(df$Date, format = "%d/%m/%Y") once and for all.

176

answered Nov 03 '22 08:11

David Arenburg

Related questions
                            
                                R fastest way to get values from list as vector [duplicate]
                            
                                Operation on multiple(70) columns by another column in R
                            
                                Reformat data frame using with months spread and ordered by their calender order in R [duplicate]
                            
                                Give common missing argument in functional
                            
                                Novice plotting coordinates on map in R with csv file
                            
                                Randomly splitting data from a grouped dataset
                            
                                Select elements in named vector
                            
                                ggplot2 legend for abline and stat_smooth
                            
                                Can I use index.html and ui.r for my r shiny interface?
                            
                                How to get output file to open automatically using render() function in RMarkdown
                            
                                Write a list, as seen in R console output, into a text file
                            
                                R: return row and column numbers of matches in a data frame
                            
                                Piecewise regression with a straight line and a horizontal line joining at a break point
                            
                                Using colon (':') to access elements in an array in C++ (in Rcpp)
                            
                                R: Kaggle Titanic Dataset Random Forest NAs introduced by coercion
                            
                                Need to access Google Custom search api through R
                            
                                Adding labels to pie chart in R... Radiating "spokes"?
                            
                                Set color for NA Value with spplot in R
                            
                                R - Scraping an HTML table with rvest when there are missing <tr> tags
                            
                                R: Insert multiple rows (variable number) in data frame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Get the (t-1) data within groups

Tags:

date

r

apply

UD1989

People also ask

1 Answers

David Arenburg

Recent Activity

Donate For Us