Subset a matrix using a column from another matrix in R

Tags:

I have a matrix X1 with 6 columns. Column 3 in this X1 matrix contains RouteNo. I also have a vector V1 which is extracted from another matrix. Few values from this vector matches with RouteNo in X1. The task is to take a subset from matrix X1 where RouteNo from X1 matches with RouteNo from V1. V1 contains extra RouteNo than in matrix X1.

Click to copy

> X1
    V1 V2       V3 V4   V5 V6
1    1  2 84072082  1 2000  0
2    2  2 84046006  1 2000  0
3    3  2 84046006  1 2001  0
4    4  2 84046006  1 2002  0
5    5  2 84021002  1 2002  0
6    6  2 84021002  1 2003  0
7    7  2 84021002  1 2003  0
8    8  2 84021002  1 2004  0
9    9  2 84021002  1 2005  0
10  10  2 84021002  1 2005  0
11  11  2 12468015  1 2006  0
12  12  2 12468015  1 2007  0
13  96  2 12468015  2 2000  0
> V1
 [1] 84021001 84021002 84021105 84046006 84046007 84046008 84046009 84046011 84046013 84046014
> n2 = subset(X1, subset = X1[,3] %in% V1)
> dim(n2)
[1] 0 6

I tried using subset function but I am not getting the desired result. I expect to get the matrix as below

Click to copy

2    2  2 84046006  1 2000  0
3    3  2 84046006  1 2001  0
4    4  2 84046006  1 2002  0
5    5  2 84021002  1 2002  0
6    6  2 84021002  1 2003  0
7    7  2 84021002  1 2003  0
8    8  2 84021002  1 2004  0
9    9  2 84021002  1 2005  0

Is there any other way to get the result? Any help is appreciated. Thank in advance.

940

asked Dec 12 '11 20:12

NB_R

1 Answers

You are running into problems with scoping. You have a column named V1 in your data.frame x1. Change your look up vector to a name that isn't a column name and everything should be fine, i.e.:

Click to copy

subset(x1, V3 %in% v1)

or use [ to index directly

Click to copy

x1[x1$V3 %in% V1,]

The proof is in the pudding:

Click to copy

txt1 <- "    V1 V2       V3 V4   V5 V6
1    1  2 84072082  1 2000  0
2    2  2 84046006  1 2000  0
3    3  2 84046006  1 2001  0
4    4  2 84046006  1 2002  0
5    5  2 84021002  1 2002  0
6    6  2 84021002  1 2003  0
7    7  2 84021002  1 2003  0
8    8  2 84021002  1 2004  0
9    9  2 84021002  1 2005  0
10  10  2 84021002  1 2005  0
11  11  2 12468015  1 2006  0
12  12  2 12468015  1 2007  0
13  96  2 12468015  2 2000  0"
txt2 <- "84021001 84021002 84021105 84046006 84046007 84046008 84046009 84046011 84046013 84046014"

x1 <- read.table(textConnection(txt1))
#Note the lowercase
v1 <- read.table(textConnection(txt2))
#Make "V1" as you have it
V1 <- v1 

> #Bad
> dim(subset(x1, V3 %in% V1))
[1] 0 6
> #Good
> dim(subset(x1, V3 %in% v1))
[1] 9 6
#Does subset method equal the direct indexing method
> all.equal(subset(x1, V3 %in% v1),x1[x1$V3 %in% V1,])
[1] TRUE

103

answered Oct 13 '22 02:10

Chase

Related questions
                            
                                R Question Number of Unique Combinations of A,A,A,A,B,B,B,B,B
                            
                                R: How to fit a large dataset with a combination of distributions?
                            
                                running an R script in batch mode without the command prompt popping up
                            
                                Assigning results of a for loop to an empty matrix
                            
                                Select several subsets by taking different row interval and appy function to all subsets
                            
                                Documenting R.oo classes/methods with Roxygen
                            
                                How do you label a horizontal line when the x axis is categorical?
                            
                                Quadratically constrained quadratic programming in R
                            
                                Calling R from Java - Faster alternative to RCaller
                            
                                R and Brew:syntax issue
                            
                                Create vector of random numbers (size of vector not known at run-time)
                            
                                coerce a multiple output in a new dataframe using ddply
                            
                                how to fill contour colors and write axes names in RSM (R)
                            
                                How I can load a R script into JRI and execute from Java?
                            
                                Porting existing C++ code to R
                            
                                Rgooglemaps plotting of text
                            
                                RStudio not picking the encoding I'm telling it to use when reading a file
                            
                                Run a system command as sudo from R?
                            
                                Running cor() (or any variant) over a sparse matrix in R
                            
                                Trouble installing rgdal

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Subset a matrix using a column from another matrix in R

Tags:

r

subset

NB_R

People also ask

1 Answers

Chase

Recent Activity

Donate For Us