I've been working on a few projects that have required me to do a lot of list subsetting and while profiling code I realised that the object[["nameHere"]] approach to subsetting lists was usually faster than the object$nameHere approach. As an example if we create a list with named components: <pre class="prettyprint"><code>a.long.list <- as.list(rep(1:1000)) names(a.long.list) <- paste0("something",1:1000) </code></pre> Why is this: <pre class="prettyprint"><code>system.time ( for (i in 1:10000) { a.long.list[["something997"]] } ) user system elapsed 0.15 0.00 0.16 </code></pre> faster than this: <pre class="prettyprint"><code>system.time ( for (i in 1:10000) { a.long.list$something997 } ) user system elapsed 0.23 0.00 0.23 </code></pre> My question is simply whether this behaviour is true universally and I should avoid the $ subset wherever possible or does the most efficient choice depend on some other factors?

Function <code>[[</code> first goes through all elements trying for exact match, then tries to do partial match. The <code>$</code> function tries both exact and partial match on each element in turn. If you execute: <pre class="prettyprint"><code>system.time ( for (i in 1:10000) { a.long.list[["something9973", exact=FALSE]] } ) </code></pre> i.e., you are running a partial match where there is no exact match, you will find that <code>$</code> is in fact ever so slightly faster.

R: Why is the [[ ]] approach for subsetting a list faster than using $?

Tags:

performance

list

r

subset

I've been working on a few projects that have required me to do a lot of list subsetting and while profiling code I realised that the object[["nameHere"]] approach to subsetting lists was usually faster than the object$nameHere approach.

As an example if we create a list with named components:

a.long.list <- as.list(rep(1:1000)) names(a.long.list) <- paste0("something",1:1000)

Why is this:

system.time ( for (i in 1:10000) {     a.long.list[["something997"]] } )   user  system elapsed  0.15    0.00    0.16

faster than this:

system.time (     for (i in 1:10000) {         a.long.list$something997     } )  user  system elapsed  0.23    0.00    0.23

My question is simply whether this behaviour is true universally and I should avoid the $ subset wherever possible or does the most efficient choice depend on some other factors?

261

asked May 18 '13 23:05

Jon M

1 Answers

Function [[ first goes through all elements trying for exact match, then tries to do partial match. The $ function tries both exact and partial match on each element in turn. If you execute:

system.time (     for (i in 1:10000) {      a.long.list[["something9973", exact=FALSE]]      } )

i.e., you are running a partial match where there is no exact match, you will find that $ is in fact ever so slightly faster.

121

answered Oct 16 '22 09:10

Bojan Nikolic

Related questions
                            
                                Efficiency of the STL priority_queue
                            
                                MongoDB - what is the fastest way to update all records in a collection?
                            
                                In JS, which is faster: Object's "in" operator or Array's indexof?
                            
                                PHP include(): File size & performance
                            
                                SQL performance: Is there any performance hit using NVarchar(MAX) instead of NVarChar(200)
                            
                                Is MATLAB faster than Python?
                            
                                Difference between ElapsedTicks, ElapsedMilliseconds, Elapsed.Milliseconds and Elapsed.TotalMilliseconds? (C#)
                            
                                Why are dict lookups always better than list lookups?
                            
                                Why is istream/ostream slow
                            
                                How to stop Handler Runnable?
                            
                                C++ fastest way to clear or erase a vector
                            
                                Getters and Setters. Is there performance overhead?
                            
                                Why is std::vector::operator[] 5 to 10 times faster than std::vector::at()?
                            
                                Even with slf4j, should you guard your logging?
                            
                                Hash Set and Array List performances
                            
                                How to count the number of 1's a number will have in binary? [duplicate]
                            
                                I need a slow C# function
                            
                                Which is faster, python webpages or php webpages? [closed]
                            
                                What is the most efficient algorithm for reversing a String in Java?
                            
                                Why are compilers so stupid?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With