Fastest way for calculating rank of 2*2 matrix?

Tags:

The recommended way to calculate the rank of a matrix in R seems to be qr:

X <- matrix(c(1, 2, 3, 4), ncol = 2, byrow=T)
Y <- matrix(c(1.0, 1, 1, 1), ncol = 2, byrow=T)
qr(X)$rank
[1] 2
qr(Y)$rank
[1] 1

I was able to improve efficiency by modifying this function for my specific case:

qr2 <- function (x, tol = 1e-07) { 
  if (!is.double(x)) 
  storage.mode(x) <- "double"
  p <- as.integer(2)
  n <- as.integer(2)
  res <- .Fortran("dqrdc2", qr = x, n, n, p, as.double(tol),
                  rank = integer(1L), qraux = double(p), pivot = as.integer(1L:p), 
                  double(2 * p), PACKAGE = "base")[c(1, 6, 7, 8)]
  class(res) <- "qr"
  res}

qr2(X)$rank
[1] 2
qr2(Y)$rank
[1] 1

library(microbenchmark)
microbenchmark(qr(X)$rank,qr2(X)$rank,times=1000)
Unit: microseconds
         expr    min     lq median     uq      max
1  qr(X)$rank 41.577 44.041 45.580 46.812 1302.091
2 qr2(X)$rank 19.403 21.251 23.099 24.331   80.997

Using R, is it possible to calculate the rank of a 2*2 matrix even faster?

989

asked Aug 30 '12 16:08

2 Answers

We can do even better using RcppEigen.

// [[Rcpp::depends(RcppEigen)]]
#include <RcppEigen.h>
using namespace Rcpp;
using   Eigen::Map;
using   Eigen::MatrixXd;
using   Eigen::FullPivHouseholderQR;
typedef  Map<MatrixXd>  MapMatd;

//calculate rank of a matrix using QR decomposition with pivoting 

// [[Rcpp::export]]
int rankEigen(NumericMatrix  m) {
   const MapMatd  X(as<MapMatd>(m));
   FullPivHouseholderQR<MatrixXd> qr(X);
   qr.setThreshold(1e-7);
   return qr.rank();
}

Benchmarks:

microbenchmark(rankEigen(X), qr3(X),times=1000)
Unit: microseconds
         expr   min    lq median    uq    max neval
 rankEigen(X) 1.849 2.465  2.773 3.081 18.171  1000
       qr3(X) 5.852 6.469  7.084 7.392 48.352  1000

However, the tolerance is not exactly the same as in LINPACK, because of different tolerance definitions:

test <- sapply(1:200, function(i) {
  Y <- matrix(c(10^(-i), 10^(-i), 10^(-i), 10^(-i)), ncol = 2, byrow=T)
  qr3(Y) ==  rankEigen(Y)
})

which.min(test)
#[1] 159

The threshold in FullPivHouseholderQR is defined as:

A pivot will be considered nonzero if its absolute value is strictly greater than |pivot|≤ threshold * |maxpivot| where maxpivot is the biggest pivot.

191

answered Sep 27 '22 18:09

Roland

Sure, just get rid of more stuff you don't need (because you know what the values are), don't do any checks, set DUP=FALSE, and only return what you want:

qr3 <- function (x, tol = 1e-07) {
  .Fortran("dqrdc2", qr=x*1.0, 2L, 2L, 2L, tol*1.0,
           rank = 0L, qraux = double(2L), pivot = c(1L,2L), 
           double(4L), DUP = FALSE, PACKAGE = "base")[[6L]]
}
microbenchmark(qr(X)$rank,qr2(X)$rank,qr3(X),times=1000)
# Unit: microseconds
#          expr    min      lq  median      uq     max
# 1  qr(X)$rank 33.303 34.2725 34.9720 35.5180 737.599
# 2 qr2(X)$rank 18.334 18.9780 19.4935 19.9240  38.063
# 3      qr3(X)  6.536  7.2100  8.3550  8.5995 657.099

I'm not an advocate of removing checks, but they do slow things down. x*1.0 and tol*1.0 ensure doubles, so that's kind-of a check and adds a little overhead. Also note that DUP=FALSE can potentially be dangerous, since you can alter the input object(s).

answered Sep 27 '22 19:09

Joshua Ulrich

Related questions
                            
                                Best way to check if a key exists in a Dictionary before adding it?
                            
                                Where to run complex algorithms? Server side or Client side? [closed]
                            
                                Can I disable checking for zero division every time the division happens?
                            
                                How to know which count query is the fastest?
                            
                                Speeding up an .exe created with Pyinstaller
                            
                                Efficient sampling from nested lists
                            
                                C#: Memory-efficient search through 2 million objects without external dependencies
                            
                                Understanding Ruby on Rails render times
                            
                                Variable number of arguments without boxing the value-types?
                            
                                Chat application using django
                            
                                How to import *huge* chunks of data to PostgreSQL?
                            
                                Why is pointer access slower than vector::iterator access? (compiler code generation)
                            
                                Python For Loop Slowing With Time
                            
                                How do optimizing compilers decide when and how much to unroll a loop?
                            
                                How to set fetchSize for iBatis select statement
                            
                                Suggested speed improvement when defining string with value immediately, instead of delaying
                            
                                Efficiently compute histogram of pairwise differences in a large vector in R?
                            
                                Techniques for keeping data in the cache, locality?
                            
                                Improving rendering performance with Jbuilder and Rails 3
                            
                                Make full site HTTPS / SSL? What performance / SEO issues & best practices still apply in 2012? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Fastest way for calculating rank of 2*2 matrix?

Tags:

performance

r

matrix

rank

Roland

People also ask

2 Answers

Roland

Joshua Ulrich

Recent Activity

Donate For Us