I have a data set of dimension 401*5677. Among the column of this matrix there are columns which are identical but under different column names. Now, I want to keep only one column from the columns which are repeated more than once, and also get the index j for the columns removed. Let us use as an example matrix, the following: <pre class="prettyprint"><code>B=matrix(c(1,4,0,2,56,7,1,4,0,33,2,5), nrow=3) colnames(B)<-c("a","b","c","d") </code></pre> What I did so far (on my real matrix G) is: <pre class="prettyprint"><code>corrG<-cor(G) Gtest=G for (i in 1:nrow(corrG)){ for (j in 1:ncol(corrG)){ if (i<j && corrG[i,j]==1){ Gtest[,j]=NA } } } Gfinal<-Gtest[,complete.cases(t(Gtest))] </code></pre> My code returns a matrix that still contains (!) some duplicated columns. Any help?

try <code>duplicated</code> function on transpose of the matrix. <pre class="prettyprint"><code>duplicated.columns <- duplicated(t(your.matrix)) new.matrix <- your.matrix[, !duplicated.columns] </code></pre>

Remove duplicated columns in matrix

Tags:

r

I have a data set of dimension 401*5677. Among the column of this matrix there are columns which are identical but under different column names. Now, I want to keep only one column from the columns which are repeated more than once, and also get the index j for the columns removed.

Let us use as an example matrix, the following:

B=matrix(c(1,4,0,2,56,7,1,4,0,33,2,5), nrow=3)
colnames(B)<-c("a","b","c","d")

What I did so far (on my real matrix G) is:

corrG<-cor(G) 
Gtest=G
for (i in 1:nrow(corrG)){
  for (j in 1:ncol(corrG)){
    if (i<j && corrG[i,j]==1){ 
      Gtest[,j]=NA
    }
  }
}
Gfinal<-Gtest[,complete.cases(t(Gtest))]

My code returns a matrix that still contains (!) some duplicated columns. Any help?

376

asked Apr 09 '13 14:04

Danai C.

1 Answers

try duplicated function on transpose of the matrix.

duplicated.columns <- duplicated(t(your.matrix))

new.matrix <- your.matrix[, !duplicated.columns]

133

answered Oct 12 '22 23:10

Nishanth

Related questions
                            
                                Converting an XTS object to a data.frame [duplicate]
                            
                                How to remove a row from zoo/xts object, given a timestamp
                            
                                Prevent names in dataframe list from disappearing
                            
                                Conditional mean statement
                            
                                Extract estimates of GAM
                            
                                Plot a character vector against a numeric vector in R
                            
                                Finding dimensional index in a multi-dimensional array in R
                            
                                Is there an R function for finding the rows that contains a specific element in a matrix?
                            
                                C++ and R interface, getting output
                            
                                Installing Rmpi on LAM/MPI cluster
                            
                                How do I extract hashtags from tweets in R?
                            
                                How to get sum of values every 8 days by date in data frame in R
                            
                                replace list elements (avoid global assignment)
                            
                                Hashing function for mapping integers to a given range?
                            
                                How do I set width of candles in candle chart using plot.xts?
                            
                                "last name, first name" -> "first name last name" in serialized strings
                            
                                Conditionally remove rows from dataframe (more than one conditions)
                            
                                Blocking and waiting in R
                            
                                Add Regression Line ggplot for Only Certain Groups
                            
                                How to adjust the tile height in geom tile?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With