Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

R: Sort multiple columns by another data.frame?

Tags:

sorting

r

I'm trying to make sense of how to sort one data.frame based on multiple columns in another. This question does this with vectors. Can someone suggest a way to do the equivalent with data.frames?

Here's some sample data.

x1 <- data.frame(a=1:5, b=letters[1:5], c=rnorm(5))
x2 <- data.frame(a=c(4,4,2), b=c("d", "d", "b"), d=rnorm(3))

So I want to sort x2 by the first two columns of x1. My actual data is much more complicated, but this replicates the idea...

like image 587
griffin Avatar asked Oct 21 '10 17:10

griffin


1 Answers

It really depends on what your data really looks like. As it looks right now, you only need one column to sort, and that is easily done by:

x2[order(match(x2[,1],x1[,1])),]

If you need more than one column, this becomes a bit trickier. You will have to specify which one you want to sort first on, and which one second, eg :

x1 <- data.frame(a=rep(1:3,2), b=rep(letters[2:4],each=2), c=rnorm(6))
x2 <- data.frame(a=c(3,3,2), b=c("c", "d", "b"), d=rnorm(3))


x2[order(match(
  paste(x2[,1],x2[,2]),
  paste(x1[,1],x1[,2]))
),]

This sorts on the first column first, and then on the second. You have to keep in mind that you need all combinations in x2 also in x1. T

like image 170
Joris Meys Avatar answered Sep 29 '22 12:09

Joris Meys