The fastmatch package implements a much faster version of <code>match</code> for repeated matches (e.g. in a loop): <pre class="prettyprint"><code>set.seed(1) library(fastmatch) table <- 1L:100000L x <- sample(table, 10000, replace=TRUE) system.time(for(i in 1:100) a <- match(x, table)) system.time(for(i in 1:100) b <- fmatch(x, table)) identical(a, b) </code></pre> Is there a similar implementation for <code>%in%</code> I could use to speed up repeated lookups?

Look at the definition of <code>%in%</code>: <pre class="prettyprint"><code>R> `%in%` function (x, table) match(x, table, nomatch = 0L) > 0L <bytecode: 0x1fab7a8> <environment: namespace:base> </code></pre> It's easy to write your own <code>%fin%</code> function: <pre class="prettyprint"><code>`%fin%` <- function(x, table) { stopifnot(require(fastmatch)) fmatch(x, table, nomatch = 0L) > 0L } system.time(for(i in 1:100) a <- x %in% table) # user system elapsed # 1.780 0.000 1.782 system.time(for(i in 1:100) b <- x %fin% table) # user system elapsed # 0.052 0.000 0.054 identical(a, b) # [1] TRUE </code></pre>

Faster %in% operator

Tags:

The fastmatch package implements a much faster version of match for repeated matches (e.g. in a loop):

set.seed(1) library(fastmatch) table <- 1L:100000L x <- sample(table, 10000, replace=TRUE) system.time(for(i in 1:100) a <-  match(x, table)) system.time(for(i in 1:100) b <- fmatch(x, table)) identical(a, b)

Is there a similar implementation for %in% I could use to speed up repeated lookups?

597

asked Oct 04 '15 15:10

Zach

1 Answers

Look at the definition of %in%:

R> `%in%` function (x, table)  match(x, table, nomatch = 0L) > 0L <bytecode: 0x1fab7a8> <environment: namespace:base>

It's easy to write your own %fin% function:

`%fin%` <- function(x, table) {   stopifnot(require(fastmatch))   fmatch(x, table, nomatch = 0L) > 0L } system.time(for(i in 1:100) a <- x %in% table) #    user  system elapsed  #   1.780   0.000   1.782  system.time(for(i in 1:100) b <- x %fin% table) #    user  system elapsed  #   0.052   0.000   0.054 identical(a, b) # [1] TRUE

answered Sep 29 '22 18:09

Joshua Ulrich

Related questions
                            
                                Typescript module systems on momentJS behaving strangely
                            
                                Return null from a stateless component/"functional component"
                            
                                a bytes-like object is required, not 'str' JSON File opened as STR
                            
                                AWS API-Gateway communicating to SNS
                            
                                Swift generics: return type based on parameter type
                            
                                Why do I get "ImportError: cannot import name find_spec" when I start a new Django project?
                            
                                How to slice a generator object or iterator?
                            
                                In Kotlin, how do I add extension methods to another class, but only visible in a certain context?
                            
                                How to add comment inside complex excel formula
                            
                                Unable to obtain OffsetDateTime from TemporalAccessor
                            
                                Define function in unix/linux command line (e.g. BASH)
                            
                                Suppress panic output in Rust when using panic::catch_unwind

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With