Replace rows in one data frame if they appear in another data frame

Tags:

r

I have the following two data frames:

df1

id   V1 V2 V3
210  4  NA 7
220  NA NA NA
230  2  0  1
240  4  NA NA
250  1  9  2
260  6  5  NA
270  0  NA 3

df2

id   V1 V2 V3
210  4  3  7
240  4  3  NA
270  0  3 3

df2 is all the instances where df1 has NA in V2 and at least one numeric value in V1 or V3. Where this condition holds, I have changed the NAs in V2 to '3'.

I would now like to put these dfs back together. Specifically, I would like to replace all the rows in df1 that appear in df2. My expected output is this:

id   V1 V2 V3
210  4  3 7
220  NA NA NA
230  2  0  1
240  4  3 NA
250  1  9  2
260  6  5  NA
270  0  3 3

I have looked at this question, but it does this based on specific values in the df. And this question is similarly answered by specifying the actual values to replace. My real df is huge and all I want to do is put the two dfs together, replacing the rows that appear in both with df2.

492

asked Jun 18 '15 09:06

szi

1 Answers

A simple match call that will identify the instances that match df2$id within df1$id (in the correct appearance order) will solve this problem

df1[match(df2$id, df1$id), ] <- df2
df1
#    id V1 V2 V3
# 1 210  4  3  7
# 2 220 NA NA NA
# 3 230  2  0  1
# 4 240  4  3 NA
# 5 250  1  9  2
# 6 260  6  5 NA
# 7 270  0  3  3

Edit: As @plafort points out, you could avoid creating df2 in the first place, but I would go with vectorized approach instead of using apply. For example

indx <- rowSums(is.na(df1)) != (ncol(df1) - 1) & is.na(df1$V2)
df1[indx, "V2"] <- 3

137

answered Sep 19 '22 00:09

David Arenburg

Related questions
                            
                                Replicate vector in R
                            
                                read.xls - read in variable-length list of sheets, with their names
                            
                                Subsetting a dataframe by the amount of repetition [duplicate]
                            
                                R grouping by condition in data.table
                            
                                get first entries in rows of list?
                            
                                Print correlation data in same plot position across facets
                            
                                How to display "beautiful" glm and multinom table with Rmd and Knit HTML?
                            
                                Fast correlation in R using C and parallelization
                            
                                How to use msgbox in R [closed]
                            
                                geom_ribbon doesn't work - Error in eval(expr, envir, enclos) : object 'variable' not found
                            
                                data.table or dplyr - data manipulation
                            
                                How to sort all dataframes in a list of dataframes on the same column?
                            
                                convert to local time zone using latitude and longitude?
                            
                                Count occurrences of value in a set of variables in R (per row)
                            
                                Too many open devices r
                            
                                Add a segment only to one facet using ggplot2
                            
                                filter rows by a function over values of each row, data.table
                            
                                Get Value of last non-empty column for each row
                            
                                R - replace part of a string using wildcards
                            
                                Subtract specific rows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Replace rows in one data frame if they appear in another data frame

Tags:

r

szi

People also ask

1 Answers

David Arenburg

Recent Activity

Donate For Us