Matching multiple columns on different data frames and getting other column as result

Tags:

I got two big data frames, one (df1) has this structure

   chr    init
1  12  25289552
2   3 180418785
3   3 180434779

The other (df2) has this

    V1    V2     V3
10  1     69094 medium
11  1     69094 medium
12  12 25289552 high
13  1     69095 medium
14  3 180418785 medium
15  3 180434779 low

What I'm trying to do is to add the column V3 of df2 to df1, to get the info of the mutation

   chr    init  Mut
1  12  25289552 high
2   3 180418785 medium
3   3 180434779 low

I'm trying loading both into R and then doing a for loop using match but it doesn't work. Do you know any special way to do this? I am also open to do using awk or something similar

868

asked Nov 08 '12 10:11

user976991

1 Answers

Use merge

df1 <- read.table(text='  chr    init
1  12  25289552
2   3 180418785
3   3 180434779', header=TRUE)


df2 <- read.table(text='    V1    V2     V3
10  1     69094 medium
11  1     69094 medium
12  12 25289552 high
13  1     69095 medium
14  3 180418785 medium
15  3 180434779 low', header=TRUE)


merge(df1, df2, by.x='init', by.y='V2') # this works!
       init chr V1     V3
1  25289552  12 12   high
2 180418785   3  3 medium
3 180434779   3  3    low

To get your desired output the way you show it

output <- merge(df1, df2, by.x='init', by.y='V2')[, c(2,1,4)]
colnames(output)[3] <- 'Mut' 
output
  chr      init    Mut
1  12  25289552   high
2   3 180418785 medium
3   3 180434779    low

answered Oct 05 '22 22:10

Jilber Urbina

Related questions
                            
                                How to get the name of each element of a list using lapply()?
                            
                                Removing the levels attribute in the output - R
                            
                                filtering with multiple conditions on many columns using dplyr
                            
                                Heatmap plot by value using ggmap
                            
                                How to do range grouping on a column using dplyr?
                            
                                Error in sending email through Gmail by using mailR
                            
                                tidyr use separate_rows over multiple columns
                            
                                What is difference between eval_metric and feval in xgboost?
                            
                                Google Analytics does not work with blogdown
                            
                                Add group mean line to barplot with ggplot2
                            
                                How do we configure shinyserver open source to support concurrent users
                            
                                Automatically coerce all column types of one data frame to the type of another prior to binding
                            
                                What is the best way to avoid passing a data frame around?
                            
                                Recommendations for database with R [closed]
                            
                                Alternatives to system() in R for calling sed, rsync, ssh etc.: Do functions exist, should I write my own, or am I missing the point?
                            
                                Plotting a raster behind a shapefile
                            
                                Add a vertical line with ggplot when x-axis is a factor
                            
                                obtain hour from DateTime vector
                            
                                R: eval(parse(...)) is often suboptimal
                            
                                Comparison of Python and R vocabularies

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Matching multiple columns on different data frames and getting other column as result

Tags:

matching

dataframe

r

multiple-columns

user976991

People also ask

1 Answers

Jilber Urbina

Recent Activity

Donate For Us