Merging two data.frames by key column

Tags:

I have two dataframes. In the first one, I have a KEY/ID column and two variables:

In the second dataframe, I have a KEY/ID column and a third variable

I would like to extract the rows of the first dataframe that are also in the second dataframe by matching them according to the KEY column. I would also like to add the V3 column to final dataset.

KEY V1 V2 V3 
1   10  2  5
2   20  4 10 
3   30  6 20

This are my attempts by using the subset and the merge function

subset(data1, data1$KEY == data2$KEY) 
merge(data1, data2, by.x = "KEY", by.y = "KEY")

None of them does the task.

Any hint would be appreaciated. Thank you!

495

asked May 09 '14 09:05

user3618451

Video Answer

2 Answers

merge(data1, data2, by="KEY") should do it!

answered Sep 28 '22 11:09

Christian Borck

If what you want is an inner join, then your attempt should do it. If it doesn't check the formats of Key columns in both the table using class(data1$key).

Apart from these and the merge suggested by Christian, you can use -

library(plyr)
join(data1, data2, by="KEY", type="inner")

library(data.table)
setkey(data1, KEY)
setkey(data2, KEY)
data1[,list(data1,data2)]

answered Sep 28 '22 10:09

RHelp

Related questions
                            
                                Checking if an r package is currently attached
                            
                                How to have R corrplot title position correct?
                            
                                Avoid overlapping x-axis labels in ggplot facet grid
                            
                                Error: Data source must be a dictionary (dplyr)
                            
                                Replace the spaces between multiple (3+) capital letters
                            
                                Explaining the forecasts from an ARIMA model
                            
                                Efficiently compute the row sums of a 3d array in R
                            
                                Order data frame by two columns in R
                            
                                How to set up an R based service on a web page [closed]
                            
                                wide to long multiple measures each time
                            
                                combining two plots in r
                            
                                Circular plot with vectors in R
                            
                                Creating line plot with time scale and labels in r
                            
                                Trying to get tf-idf weighting working in R
                            
                                Extract only coefficients whose p values are significant from a logistic model
                            
                                Getting driving distance between two points (lat, lon) using R and Google Map API
                            
                                Vary colors of axis labels in R based on another variable
                            
                                Is there an expression in `R` for "output of the last command"? [duplicate]
                            
                                Plotting points with color and shape based on data variables
                            
                                Labeling center of map polygons in R ggplot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Merging two data.frames by key column

Tags:

dataframe

r

subset