I want to merge two data frames keeping the original row order of one of them (<code>df.2</code> in the example below). Here are some sample data (all values from <code>class</code> column are defined in both data frames): <pre class="prettyprint"><code>df.1 <- data.frame(class = c(1, 2, 3), prob = c(0.5, 0.7, 0.3)) df.2 <- data.frame(object = c('A', 'B', 'D', 'F', 'C'), class = c(2, 1, 2, 3, 1)) </code></pre> If I do: <pre class="prettyprint"><code>merge(df.2, df.1) </code></pre> Output is: <pre class="prettyprint"><code> class object prob 1 1 B 0.5 2 1 C 0.5 3 2 A 0.7 4 2 D 0.7 5 3 F 0.3 </code></pre> If I add <code>sort = FALSE</code>: <pre class="prettyprint"><code>merge(df.2, df.1, sort = F) </code></pre> Result is: <pre class="prettyprint"><code> class object prob 1 2 A 0.7 2 2 D 0.7 3 1 B 0.5 4 1 C 0.5 5 3 F 0.3 </code></pre> But what I would like is: <pre class="prettyprint"><code> class object prob 1 2 A 0.7 2 1 B 0.5 3 2 D 0.7 4 3 F 0.3 5 1 C 0.5 </code></pre>

You just need to create a variable which gives the row number in df.2. Then, once you have merged your data, you sort the new data set according to this variable. Here is an example : <pre class="prettyprint"><code>df.1<-data.frame(class=c(1,2,3), prob=c(0.5,0.7,0.3)) df.2<-data.frame(object=c('A','B','D','F','C'), class=c(2,1,2,3,1)) df.2$id <- 1:nrow(df.2) out <- merge(df.2,df.1, by = "class") out[order(out$id), ] </code></pre>

Check out the join function in the plyr package. It's like merge, but it allows you to keep the row order of one of the data sets. Overall, it's more flexible than merge. Using your example data, we would use <code>join</code> like this: <pre class="prettyprint"><code>> join(df.2,df.1) Joining by: class object class prob 1 A 2 0.7 2 B 1 0.5 3 D 2 0.7 4 F 3 0.3 5 C 1 0.5 </code></pre> Here are a couple of links describing fixes to the merge function for keeping the row order: http://www.r-statistics.com/2012/01/merging-two-data-frame-objects-while-preserving-the-rows-order/ http://r.789695.n4.nabble.com/patching-merge-to-allow-the-user-to-keep-the-order-of-one-of-the-two-data-frame-objects-merged-td4296561.html

Merge two data frames while keeping the original row order

Tags:

merge

sorting

dataframe

r

I want to merge two data frames keeping the original row order of one of them (df.2 in the example below).

Here are some sample data (all values from class column are defined in both data frames):

df.1 <- data.frame(class = c(1, 2, 3), prob = c(0.5, 0.7, 0.3)) df.2 <- data.frame(object = c('A', 'B', 'D', 'F', 'C'), class = c(2, 1, 2, 3, 1))

If I do:

merge(df.2, df.1)

Output is:

  class object prob 1     1      B  0.5 2     1      C  0.5 3     2      A  0.7 4     2      D  0.7 5     3      F  0.3

If I add sort = FALSE:

merge(df.2, df.1, sort = F)

Result is:

  class object prob 1     2      A  0.7 2     2      D  0.7 3     1      B  0.5 4     1      C  0.5 5     3      F  0.3

But what I would like is:

  class object prob 1     2      A  0.7 2     1      B  0.5 3     2      D  0.7 4     3      F  0.3     5     1      C  0.5

582

asked Jul 26 '13 09:07

DJack

2 Answers

You just need to create a variable which gives the row number in df.2. Then, once you have merged your data, you sort the new data set according to this variable. Here is an example :

df.1<-data.frame(class=c(1,2,3), prob=c(0.5,0.7,0.3)) df.2<-data.frame(object=c('A','B','D','F','C'), class=c(2,1,2,3,1)) df.2$id  <- 1:nrow(df.2) out  <- merge(df.2,df.1, by = "class") out[order(out$id), ]

100

answered Oct 25 '22 04:10

PAC

Check out the join function in the plyr package. It's like merge, but it allows you to keep the row order of one of the data sets. Overall, it's more flexible than merge.

Using your example data, we would use join like this:

> join(df.2,df.1) Joining by: class   object class prob 1      A     2  0.7 2      B     1  0.5 3      D     2  0.7 4      F     3  0.3 5      C     1  0.5

Here are a couple of links describing fixes to the merge function for keeping the row order:

http://www.r-statistics.com/2012/01/merging-two-data-frame-objects-while-preserving-the-rows-order/

http://r.789695.n4.nabble.com/patching-merge-to-allow-the-user-to-keep-the-order-of-one-of-the-two-data-frame-objects-merged-td4296561.html

answered Oct 25 '22 03:10

user2635373

Related questions
                            
                                Explain the quantile() function in R
                            
                                When does it pay off to use S4 methods in R programming
                            
                                Overlay normal curve to histogram in R
                            
                                Control ggplot2 legend look without affecting the plot
                            
                                How to turn a vector into a matrix in R?
                            
                                Re-ordering factor levels in data frame [duplicate]
                            
                                Boxplot show the value of mean
                            
                                Converting two columns of a data frame to a named vector
                            
                                Efficiently convert backslash to forward slash in R
                            
                                dplyr filter: Get rows with minimum of variable, but only the first if multiple minima
                            
                                How to save data file into .RData?
                            
                                Any way to make plot points in scatterplot more transparent in R?
                            
                                Split string based on alternating character in R
                            
                                Windows 7, update.packages problem: "unable to move temporary installation"?
                            
                                Combine base and ggplot graphics in R figure window
                            
                                Add multiple columns to R data.table in one function call?
                            
                                How to leave the R browser() mode in the console window?
                            
                                R: 2 functions with the same name in 2 different packages
                            
                                How can I print when using %dopar%
                            
                                How to declare a vector of zeros in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With