R: getting rid of for loop and speeding code

Tags:

I would like to speed up my calculations and obtain results without using loop in function m. Reproducible example:

N <- 2500
n <- 500
r <- replicate(1000, sample(N, n))

m <- function(r, N) {
  ic <- matrix(0, nrow = N, ncol = N)
  for (i in 1:ncol(r)) { 
    p <- r[, i]
    ic[p, p] <- ic[p, p] + 1
  }
  ic
}

system.time(ic <- m(r, N))
#  user  system elapsed 
#  6.25    0.51    6.76 
isSymmetric(ic)
# [1] TRUE

In every iteration of for loop we are dealing with matrix not vector, so how this could be Vectorized?

@joel.wilson The purpose of this function is to calculate pairwise frequencies of elements. So afterwards we could estimate pairwise inclusion probabilities.

Thanks to @Khashaa and @alexis_laz. Benchmarks:

Click to copy

> require(rbenchmark)
> benchmark(m(r, N),
+           m1(r, N),
+           mvec(r, N),
+           alexis(r, N),
+           replications = 10, order = "elapsed")
          test replications elapsed relative user.self sys.self user.child sys.child
4 alexis(r, N)           10    4.73    1.000      4.63     0.11         NA        NA
3   mvec(r, N)           10    5.36    1.133      5.18     0.18         NA        NA
2     m1(r, N)           10    5.48    1.159      5.29     0.19         NA        NA
1      m(r, N)           10   61.41   12.983     60.43     0.90         NA        NA

895

asked Nov 29 '16 10:11

minem

Video Answer

1 Answers

This should be significantly faster as it avoids operations on double indexing

Click to copy

m1 <- function(r, N) {
  ic <- matrix(0, nrow = N, ncol=ncol(r))
  for (i in 1:ncol(r)) { 
    p <- r[, i]
    ic[, i][p] <- 1
  }
  tcrossprod(ic)
}

system.time(ic1 <- m1(r, N))
#   user  system elapsed 
#   0.53    0.01    0.55  

all.equal(ic, ic1)
# [1] TRUE

Simple "counting/adding" operations can almost always be vectorized

Click to copy

mvec <- function(r, N) {
  ic <- matrix(0, nrow = N, ncol=ncol(r))
  i <- rep(1:ncol(r), each=nrow(r))
  ic[cbind(as.vector(r), i)] <- 1
  tcrossprod(ic)
}

answered Sep 30 '22 20:09

Khashaa

Related questions
                            
                                Can R visualize the t.test or other hypothesis test results?
                            
                                Shiny Application actionButton click on page load
                            
                                R shinydashboard dynamic menu selection
                            
                                Is there a logical way to think about List Indexing?
                            
                                Using anonymous functions with summarize_each or mutate_each
                            
                                Subtracting every two columns
                            
                                ggplot2: add regression equations and R2 and adjust their positions on plot
                            
                                wide format with dcast data.table [closed]
                            
                                Add line numbers to text content of a rendered rmarkdown html document
                            
                                Align strings of a dataframe in columns in r
                            
                                How to use non-default browser?
                            
                                debugging: function to create multiple lags for multiple columns (dplyr)
                            
                                R: Count objects in a picture
                            
                                Countdown Timer in R shiny?
                            
                                Searching functions using grep over multiple loaded packages in R
                            
                                dplyr mutate calling another dataframe
                            
                                How to convert complex JSON data into a single dataframe?
                            
                                R - difference scatter plot
                            
                                Bookdown: How can I change the size of the chapter titles?
                            
                                Weighted Average in R using NA weights

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

R: getting rid of for loop and speeding code

Tags:

performance

loops

r

matrix

minem

People also ask

Video Answer

1 Answers

Khashaa

Recent Activity

Donate For Us