I was doing some performance metrics and I ran into something that seems quite odd to me. I time the following two functions: <pre class="prettyprint"><code> private static void DoOne() { List<int> A = new List<int>(); for (int i = 0; i < 200; i++) A.Add(i); int s=0; for (int j = 0; j < 100000; j++) { for (int c = 0; c < A.Count; c++) s += A[c]; } } private static void DoTwo() { List<int> A = new List<int>(); for (int i = 0; i < 200; i++) A.Add(i); IList<int> L = A; int s = 0; for (int j = 0; j < 100000; j++) { for (int c = 0; c < L.Count; c++) s += L[c]; } } </code></pre> Even when compiling in release mode, the timings results were consistently showing that DoTwo takes ~100 longer then DoOne: <pre class="prettyprint"><code> DoOne took 0.06171706 seconds. DoTwo took 8.841709 seconds. </code></pre> Given the fact the List directly implements IList I was very surprised by the results. Can anyone clarify this behavior? <h3>The gory details</h3> Responding to questions, here is the full code and an image of the project build preferences: <blockquote class="spoiler"> Dead Image Link </blockquote> <pre class="prettyprint"><code>using System; using System.Collections.Generic; using System.Text; using System.Diagnostics; using System.Collections; namespace TimingTests { class Program { static void Main(string[] args) { Stopwatch SW = new Stopwatch(); SW.Start(); DoOne(); SW.Stop(); Console.WriteLine(" DoOne took {0} seconds.", ((float)SW.ElapsedTicks) / Stopwatch.Frequency); SW.Reset(); SW.Start(); DoTwo(); SW.Stop(); Console.WriteLine(" DoTwo took {0} seconds.", ((float)SW.ElapsedTicks) / Stopwatch.Frequency); } private static void DoOne() { List<int> A = new List<int>(); for (int i = 0; i < 200; i++) A.Add(i); int s=0; for (int j = 0; j < 100000; j++) { for (int c = 0; c < A.Count; c++) s += A[c]; } } private static void DoTwo() { List<int> A = new List<int>(); for (int i = 0; i < 200; i++) A.Add(i); IList<int> L = A; int s = 0; for (int j = 0; j < 100000; j++) { for (int c = 0; c < L.Count; c++) s += L[c]; } } } } </code></pre> Thanks for all the good answers (especially @kentaromiura). I would have closed the question, though I feel we still miss an important part of the puzzle. Why would accessing a class via an interface it implements be so much slower? The only difference I can see is that accessing a function via an Interface implies using virtual tables while the normally the functions can be called directly. To see whether this is the case I made a couple of changes to the above code. First I introduced two almost identical classes: <pre class="prettyprint"><code> public class VC { virtual public int f() { return 2; } virtual public int Count { get { return 200; } } } public class C { public int f() { return 2; } public int Count { get { return 200; } } } </code></pre> As you can see VC is using virtual functions and C doesn't. Now to DoOne and DoTwo: <pre class="prettyprint"><code> private static void DoOne() { C a = new C(); int s=0; for (int j = 0; j < 100000; j++) { for (int c = 0; c < a.Count; c++) s += a.f(); } } private static void DoTwo() { VC a = new VC(); int s = 0; for (int j = 0; j < 100000; j++) { for (int c = 0; c < a.Count; c++) s += a.f(); } } </code></pre> And indeed: <pre class="prettyprint"><code>DoOne took 0.01287789 seconds. DoTwo took 8.982396 seconds. </code></pre> This is even more scary - virtual function calls 800 times slower?? so a couple of question to the community: <ol> <li>Can you reproduce? (given the fact that all had worse performance before, but not as bad as mine)</li> <li>Can you explain? </li> <li>(this may be the most important) - can you think of a way to avoid?</li> </ol> Boaz

Profiling one on one: Testing with Snippet compiler. using your code results: <blockquote> 0.043s vs 0.116s </blockquote> eliminating Temporary L <blockquote> 0.043s vs 0.116s - ininfluent </blockquote> by caching A.count in cmax on both Methods <blockquote> 0.041s vs 0.076s </blockquote> <pre class="prettyprint"><code> IList<int> A = new List<int>(); for (int i = 0; i < 200; i++) A.Add(i); int s = 0; for (int j = 0; j < 100000; j++) { for (int c = 0,cmax=A.Count;c< cmax; c++) s += A[c]; } </code></pre> Now I will try to slow down DoOne, first try, casting to IList before add: <pre class="prettyprint"><code>for (int i = 0; i < 200; i++) ((IList<int>)A).Add(i); </code></pre> <blockquote> 0,041s 0,076s - so add is ininfluent </blockquote> so it remains only one place where the slowdown can happen : <code>s += A[c];</code> so I try this: <pre class="prettyprint"><code>s += ((IList<int>)A)[c]; </code></pre> 0.075s 0.075s - TADaaan! so seems that accessing Count or a index element is slower on the interfaced version: EDIT: Just for fun take a look at this: <pre class="prettyprint"><code> for (int c = 0,cmax=A.Count;c< cmax; c++) s += ((List<int>)A)[c]; </code></pre> <blockquote> 0.041s 0.050s </blockquote> so is not a cast problem, but a reflection one!

Why does casting List<T> into IList<T> result in reduced performance?

Tags:

c#

list

generics

I was doing some performance metrics and I ran into something that seems quite odd to me. I time the following two functions:

  private static void DoOne()
      {
         List<int> A = new List<int>();
         for (int i = 0; i < 200; i++) A.Add(i);
          int s=0;
         for (int j = 0; j < 100000; j++)
         {
            for (int c = 0; c < A.Count; c++) s += A[c];
         }

      }

   private static void DoTwo()
      {
         List<int> A = new List<int>();
         for (int i = 0; i < 200; i++) A.Add(i);
         IList<int> L = A;
         int s = 0;
         for (int j = 0; j < 100000; j++)
         {
            for (int c = 0; c < L.Count; c++) s += L[c];
         }

      }

Even when compiling in release mode, the timings results were consistently showing that DoTwo takes ~100 longer then DoOne:

 DoOne took 0.06171706 seconds.
 DoTwo took 8.841709 seconds.

Given the fact the List directly implements IList I was very surprised by the results. Can anyone clarify this behavior?

The gory details

Responding to questions, here is the full code and an image of the project build preferences:

Dead Image Link

using System;
using System.Collections.Generic;
using System.Text;
using System.Diagnostics;
using System.Collections;

namespace TimingTests
{
   class Program
   {
      static void Main(string[] args)
      {
         Stopwatch SW = new Stopwatch();
         SW.Start();
         DoOne();
         SW.Stop();

         Console.WriteLine(" DoOne took {0} seconds.", ((float)SW.ElapsedTicks) / Stopwatch.Frequency);
         SW.Reset();
         SW.Start();
         DoTwo();
         SW.Stop();

         Console.WriteLine(" DoTwo took {0} seconds.", ((float)SW.ElapsedTicks) / Stopwatch.Frequency);

      }

      private static void DoOne()
      {
         List<int> A = new List<int>();
         for (int i = 0; i < 200; i++) A.Add(i);
         int s=0;
         for (int j = 0; j < 100000; j++)
         {
            for (int c = 0; c < A.Count; c++) s += A[c];
         }

      }
      private static void DoTwo()
      {
         List<int> A = new List<int>();
         for (int i = 0; i < 200; i++) A.Add(i);
         IList<int> L = A;
         int s = 0;
         for (int j = 0; j < 100000; j++)
         {
            for (int c = 0; c < L.Count; c++) s += L[c];
         }

      }
   }
}

Thanks for all the good answers (especially @kentaromiura). I would have closed the question, though I feel we still miss an important part of the puzzle. Why would accessing a class via an interface it implements be so much slower? The only difference I can see is that accessing a function via an Interface implies using virtual tables while the normally the functions can be called directly. To see whether this is the case I made a couple of changes to the above code. First I introduced two almost identical classes:

  public class VC
  {
     virtual public int f() { return 2; }
     virtual public int Count { get { return 200; } }

  }

  public class C
  {
      public int f() { return 2; }
      public int Count { get { return 200; } }

  }

As you can see VC is using virtual functions and C doesn't. Now to DoOne and DoTwo:

    private static void DoOne()
      {  C a = new C();
         int s=0;
         for (int j = 0; j < 100000; j++)
         {
            for (int c = 0; c < a.Count; c++) s += a.f();
         }

      }
      private static void DoTwo()
      {
           VC a = new VC();
         int s = 0;
         for (int j = 0; j < 100000; j++)
         {
            for (int c = 0; c < a.Count; c++) s +=  a.f();
         }

      }

And indeed:

DoOne took 0.01287789 seconds.
DoTwo took 8.982396 seconds.

This is even more scary - virtual function calls 800 times slower?? so a couple of question to the community:

Can you reproduce? (given the fact that all had worse performance before, but not as bad as mine)
Can you explain?
(this may be the most important) - can you think of a way to avoid?

Boaz

553

asked May 12 '09 21:05

Boaz

2 Answers

A note to everyone out there who is trying to benchmark stuff like this.

Do not forget that the code is not jitted until the first time it runs. That means that the first time you run a method, the cost of running that method could be dominated by the time spent loading the IL, analyzing the IL, and jitting it into machine code, particularly if it is a trivial method.

If what you're trying to do is compare the "marginal" runtime cost of two methods, it's a good idea to run both of them twice and consider only the second runs for comparison purposes.

101

answered Oct 20 '22 16:10

Eric Lippert

Profiling one on one:

Testing with Snippet compiler.

using your code results:

0.043s vs 0.116s

eliminating Temporary L

0.043s vs 0.116s - ininfluent

by caching A.count in cmax on both Methods

0.041s vs 0.076s

     IList<int> A = new List<int>();
     for (int i = 0; i < 200; i++) A.Add(i);

     int s = 0;
     for (int j = 0; j < 100000; j++)
     {
        for (int c = 0,cmax=A.Count;c< cmax;  c++) s += A[c];
     }

Now I will try to slow down DoOne, first try, casting to IList before add:

for (int i = 0; i < 200; i++) ((IList<int>)A).Add(i);

0,041s 0,076s - so add is ininfluent

so it remains only one place where the slowdown can happen : s += A[c]; so I try this:

s += ((IList<int>)A)[c];

0.075s 0.075s - TADaaan!

so seems that accessing Count or a index element is slower on the interfaced version:

EDIT: Just for fun take a look at this:

 for (int c = 0,cmax=A.Count;c< cmax;  c++) s += ((List<int>)A)[c];

0.041s 0.050s

so is not a cast problem, but a reflection one!

answered Oct 20 '22 15:10

kentaromiura

Related questions
                            
                                ++i operator difference in C# and C++
                            
                                Open file location
                            
                                Convert a number to a letter in C# for use in Microsoft Excel [duplicate]
                            
                                Get DateTime.Now for a specific TimeZone regardless of the device timezone?
                            
                                Why null == false does not result in compile error in c#? [duplicate]
                            
                                How to get JSON response from a 3.5 asmx web service
                            
                                Can I use the Unity networking HLAPI without paying for the Unity Multiplayer service?
                            
                                How to bind the values of the itemsource (array of strings) to a label in a ListView
                            
                                Switch + Enum = Not all code paths return a value
                            
                                C# 3.5 partial class String IsNullOrWhiteSpace
                            
                                Silverlight Rotate & Scale a bitmap image to fit within rectangle without cropping
                            
                                Convert Dataset to XML
                            
                                Html.ValidationMessageFor Text Color
                            
                                c# word interop find and replace everything
                            
                                Random number generator with no duplicates
                            
                                Split by '/' till '[' appears
                            
                                methods and constructors
                            
                                Cancel blocking AcceptTcpClient call
                            
                                Creating HiddenFor IEnumerable<String> in View
                            
                                What's so bad about ref parameters?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does casting List<T> into IList<T> result in reduced performance?

Tags:

c#

list

generics

The gory details

Boaz

People also ask

2 Answers

Eric Lippert

kentaromiura

Recent Activity

Donate For Us