I have a R code that can do convolution of two functions... <pre class="prettyprint"><code>convolveSlow <- function(x, y) { nx <- length(x); ny <- length(y) xy <- numeric(nx + ny - 1) for(i in seq(length = nx)) { xi <- x[[i]] for(j in seq(length = ny)) { ij <- i+j-1 xy[[ij]] <- xy[[ij]] + xi * y[[j]] } } xy } </code></pre> Is there a way to remove the two for loops and make the code run faster? Thank you San

<ol> <li>For vectors, you index with <code>[]</code>, not <code>[[]]</code>, so use <code>xy[ij]</code> etc </li> <li>Convolution doesn't vectorise easily but one common trick is to switch to compiled code. The Writing R Extensions manual uses convolution as a running example and shows several alternative; we also use it a lot in the Rcpp documentation.</li> </ol>

Avoid two for loops in R

Tags:

loops

r

I have a R code that can do convolution of two functions...

convolveSlow <- function(x, y) {  
nx <- length(x); ny <- length(y)  
xy <- numeric(nx + ny - 1)  
for(i in seq(length = nx)) {  
 xi <- x[[i]]  
        for(j in seq(length = ny)) {  
            ij <- i+j-1  
            xy[[ij]] <- xy[[ij]] + xi * y[[j]]  
        }  
    }  
    xy  
}

Is there a way to remove the two for loops and make the code run faster?

Thank you San

353

asked Feb 04 '11 04:02

user602599

2 Answers

Since R is very fast at computing vector operations, the most important thing to keep in mind when programming for performance is to vectorise as many of your operations as possible.

This means thinking hard about replacing loops with vector operations. Here is my solution for fast convolution (50 times faster with input vectors of length 1000 each):

convolveFast <- function(x, y) {
    nx <- length(x)
    ny <- length(y)
    xy <- nx + ny - 1
    xy <- rep(0, xy)
    for(i in (1:nx)){
        j <- 1:ny
        ij <- i + j - 1
        xy[i+(1:ny)-1] <- xy[ij] + x[i] * y
    }
    xy
}

You will notice that the inner loop (for j in ...) has disappeared. Instead, I replaced it with a vector operation. j is now defined as a vector (j <- 1:ny). Notice also that I refer to the entire vector y, rather than subsetting it (i.e. y instead of y[j]).

j <- 1:ny
ij <- i + j - 1
xy[i+(1:ny)-1] <- xy[ij] + x[i] * y

I wrote a small function to measure peformance:

measure.time <- function(fun1, fun2, ...){
    ptm <- proc.time()
    x1 <- fun1(...)
    time1 <- proc.time() - ptm

    ptm <- proc.time()
    x2 <- fun2(...)
    time2 <- proc.time() - ptm

    ident <- all(x1==x2)

    cat("Function 1\n")
    cat(time1)
    cat("\n\nFunction 2\n")
    cat(time2)
    if(ident) cat("\n\nFunctions return identical results")

}

For two vectors of length 1000 each, I get a 98% performance improvement:

x <- runif(1000)
y <- runif(1000)
measure.time(convolveSlow, convolveFast, x, y)

Function 1
7.07 0 7.59 NA NA

Function 2
0.14 0 0.16 NA NA

Functions return identical results

answered Nov 11 '22 09:11

Andrie

For vectors, you index with [], not [[]], so use xy[ij] etc
Convolution doesn't vectorise easily but one common trick is to switch to compiled code. The Writing R Extensions manual uses convolution as a running example and shows several alternative; we also use it a lot in the Rcpp documentation.

answered Nov 11 '22 08:11

Dirk Eddelbuettel

Related questions
                            
                                strsplit by row and distribute results by column in data.frame
                            
                                Calculating sum of squared deviations in R
                            
                                Vector input in shiny R and then use it
                            
                                Hyper-parameter tuning using pure ranger package in R
                            
                                Failed to connect the database when using sqldf in r
                            
                                Uniroot solution in R
                            
                                Round down a numeric
                            
                                How to save output from ggforce::facet_grid_paginate in only one pdf?
                            
                                Find all combinations of a set of numbers that add up to a certain total
                            
                                Euclidean distance calculations in R not making sense
                            
                                Convert string to date, format: "dd.mm.yyyy"
                            
                                count unique combinations of values
                            
                                Split on first comma in string
                            
                                How to find highest value in a data frame?
                            
                                R rbind error row.names duplicates not allowed
                            
                                R- delete accents in string
                            
                                Negation `!` in a dplyr pipeline `%>%`
                            
                                How to create lag variables
                            
                                How expand ggplot bar scale on one side but not the other without manual limits
                            
                                Error in dev.off() : cannot shut down device 1 (the null device)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With