Filling data frame with previous row value

Tags:

I have a data frame that has 2 columns.

column1 has random numbers in column2 is a place holding column for what i want column3 to look like

  random    temp
0.502423373 1
0.687594055 0
0.741883739 0
0.445364032 0
0.50626137  0.5
0.516364981 0
...

I want to fill column3 so it takes the last non-zero number (1 or .5 in this example) and continuously fills the following rows with that value until it hits a row with a different number. then it repeats the process for the entire column.

random     temp state
0.502423373 1   1
0.687594055 0   1
0.741883739 0   1
0.445364032 0   1
0.50626137  0.5 0.5
0.516364981 0   0.5
0.807804708 0   0.5
0.247948445 0   0.5
0.46573337  0   0.5
0.103705154 0   0.5
0.079625868 1   1
0.938928944 0   1
0.677713019 0   1
0.112231619 0   1
0.165907178 0   1
0.836195267 0   1
0.387712998 1   1
0.147737077 0   1
0.439281543 0.5 0.5
0.089013503 0   0.5
0.84174743  0   0.5
0.931738707 0   0.5
0.807955172 1   1

thanks for any and all help

276

asked Dec 06 '13 04:12

user2813055

2 Answers

Perhaps you can make use of na.locf from the "zoo" package after setting values of "0" to NA. Assuming your data.frame is called "mydf":

mydf$state <- mydf$temp
mydf$state[mydf$state == 0] <- NA

library(zoo)
mydf$state <- na.locf(mydf$state)
#      random temp state
# 1 0.5024234  1.0   1.0
# 2 0.6875941  0.0   1.0
# 3 0.7418837  0.0   1.0
# 4 0.4453640  0.0   1.0
# 5 0.5062614  0.5   0.5
# 6 0.5163650  0.0   0.5

If there were NA values in your original data.frame in the "temp" column, and you wanted to keep them as NA in the newly generated "state" column too, that's easy to take care of. Just add one more line to reintroduce the NA values:

mydf$state[is.na(mydf$temp)] <- NA

answered Oct 20 '22 05:10

A5C1D2H2I1M1N2O1R2T1

Inspired by the solution of @Ananda Mahto, this is an adaption of the internal code of na.locf that works directly with 0's instead of NAs. Then you don't need the zoo package and you don't need to do the preprocessing of changing the values to NA. Benchmarktests show that this is about 10 times faster than the original version.

locf.0 <- function(x) {
  L <- x!=0
  idx <- c(0, which(L))[cumsum(L) + 1]
  return(x[idx])
} 
mydf$state <- locf.0(mydf$temp)

answered Oct 20 '22 04:10

shadow

Related questions
                            
                                Sequential citation numbering in R: separate numbers by hyphen, if sequential - add comma if not
                            
                                Efficiently convert a date column in data.table
                            
                                Opposite of unnest_tokens
                            
                                Get the name of a list item created with purrr::map
                            
                                Atom editor r-language error - Failed to load snippets
                            
                                if command to test for integer(0)
                            
                                save an R dataframe with the name specified by a string
                            
                                XPT to CSV Conversion? [closed]
                            
                                Multiple density graphs different groups (based on factor level) using plyr
                            
                                How to create a datetime object from separate date fields?
                            
                                Legends in R plots
                            
                                Plotting data against time in R
                            
                                Subset R data frame contingent on the value of duplicate variables
                            
                                how to install R packages "RNetCDF" and "ncdf" on Ubuntu?
                            
                                Producing numeric sequences in R using standard patterns
                            
                                Paste together each pair of columns in a data frame in R?
                            
                                Delete a period and a number at the end of a character string
                            
                                Increasing the legend range in geom_tile manually
                            
                                How do you draw a boxplot without specifying x axis?
                            
                                How to convert factor to numeric in R without NAs introduced by coercion warning message

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Filling data frame with previous row value

Tags:

dataframe

r

calculated-columns

user2813055

People also ask

2 Answers

A5C1D2H2I1M1N2O1R2T1

shadow

Recent Activity

Donate For Us