Add missing value in column with value from row above

Tags:

r

Every week I a incomplete dataset for a analysis. That looks like:

df1 <- data.frame(var1 = c("a","","","b",""), 
             var2 = c("x","y","z","x","z"))

Some var1 values are missing. The dataset should end up looking like this:

df2 <- data.frame(var1 = c("a","a","a","b","b"), 
             var2 = c("x","y","z","x","z"))

Currently I use an Excel macro to do this. But this makes it harder to automate the analysis. From now on I would like to do this in R. But I have no idea how to do this.

Thanks for your help.

QUESTION UPDATE AFTER COMMENT

var2 is not relevant for my question. The only thing I am trying to is. Get from df1 to df2.

df1 <- data.frame(var1 = c("a","","","b",""))
df2 <- data.frame(var1 = c("a","a","a","b","b"))

683

asked Mar 01 '12 10:03

jeroen81

2 Answers

Here is one way of doing it by making use of run-length encoding (rle) and its inverse rle.inverse:

fillTheBlanks <- function(x, missing=""){
  rle <- rle(as.character(x))
  empty <- which(rle$value==missing)
  rle$values[empty] <- rle$value[empty-1] 
  inverse.rle(rle)
}

df1$var1 <- fillTheBlanks(df1$var1)

The results:

df1

  var1 var2
1    a    x
2    a    y
3    a    z
4    b    x
5    b    z

136

answered Oct 14 '22 10:10

Andrie

Here is a simpler way:

library(zoo)
df1$var1[df1$var1 == ""] <- NA
df1$var1 <- na.locf(df1$var1)

answered Oct 14 '22 08:10

Andrei

Related questions
                            
                                how to find out R library location in Mac OSX?
                            
                                How to pass a list to a function in R?
                            
                                apply over matrix by column - any way to get column name?
                            
                                Getting a map with points, using ggmap and ggplot2
                            
                                Ifelse() with three conditions
                            
                                R - Scaling numeric values only in a dataframe with mixed types
                            
                                How to convert the name of a dataframe to a string in R?
                            
                                Complicated reshaping
                            
                                Convert hours:minutes:seconds to minutes
                            
                                Line breaks in R Markdown text (not code blocks)
                            
                                How can I prevent a library from masking functions
                            
                                How to replace empty string with NA in R dataframe?
                            
                                Sort data frame column by factor
                            
                                Three dimensional array to list
                            
                                How do I combine aes() and aes_string() options
                            
                                rmarkdown error "attempt to use zero-length variable name"
                            
                                More efficient R / Sweave / TeXShop work-flow?
                            
                                How do I add the mean value to a histogram in R?
                            
                                Read csv from specific row
                            
                                How do I generate a histogram for each column of my table?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With