Edit: I apologize everybody. I used the term "jagged array" when I actually meant to say "multi-dimensional array" (as can be seen in my example below). I apologize for using the incorrect name. I actually found jagged arrays to be faster than multi-dimensional ones! I have added my measurements for jagged arrays. I was trying to use a <del>jagged</del> multi-dimensional array today, when I noticed that it's performance is not as I would have expected. Using a single-dimensional array and manually calculating indices was much faster (almost two times) than using a 2D array. I wrote a test using <code>1024*1024</code> arrays (initialized to random values), for 1000 iterations, and I got the following results on my machine: <pre class="prettyprint"><code>sum(double[], int): 2738 ms (100%) sum(double[,]): 5019 ms (183%) sum(double[][]): 2540 ms ( 93%) </code></pre> This is my test code: <pre class="prettyprint"><code>public static double sum(double[] d, int l1) { // assuming the array is rectangular double sum = 0; int l2 = d.Length / l1; for (int i = 0; i < l1; ++i) for (int j = 0; j < l2; ++j) sum += d[i * l2 + j]; return sum; } public static double sum(double[,] d) { double sum = 0; int l1 = d.GetLength(0); int l2 = d.GetLength(1); for (int i = 0; i < l1; ++i) for (int j = 0; j < l2; ++j) sum += d[i, j]; return sum; } public static double sum(double[][] d) { double sum = 0; for (int i = 0; i < d.Length; ++i) for (int j = 0; j < d[i].Length; ++j) sum += d[i][j]; return sum; } public static void Main() { Random random = new Random(); const int l1 = 1024, l2 = 1024; double[ ] d1 = new double[l1 * l2]; double[,] d2 = new double[l1 , l2]; double[][] d3 = new double[l1][]; for (int i = 0; i < l1; ++i) { d3[i] = new double[l2]; for (int j = 0; j < l2; ++j) d3[i][j] = d2[i, j] = d1[i * l2 + j] = random.NextDouble(); } // const int iterations = 1000; TestTime(sum, d1, l1, iterations); TestTime(sum, d2, iterations); TestTime(sum, d3, iterations); } </code></pre> Further investigation showed that the IL for the second method is 23% larger than that of the first method. (Code size 68 vs 52.) This is mostly due to calls to <code>System.Array::GetLength(int)</code>. The compiler also emits calls to <code>Array::Get</code> for the <del>jagged</del> multi-dimensional array, whereas it simply calls <code>ldelem</code> for the simple array. So I am wondering, why is access through multi-dimensional arrays slower than normal arrays? I would have assumed the compiler (or JIT) would do something similar to what I did in my first method, but this was not actually the case. Could you plese help me understand why this is happening the way it is? <hr> Update: Following Henk Holterman's suggestion, here is the implementation of <code>TestTime</code>: <pre class="prettyprint"><code>public static void TestTime<T, TR>(Func<T, TR> action, T obj, int iterations) { Stopwatch stopwatch = Stopwatch.StartNew(); for (int i = 0; i < iterations; ++i) action(obj); Console.WriteLine(action.Method.Name + " took " + stopwatch.Elapsed); } public static void TestTime<T1, T2, TR>(Func<T1, T2, TR> action, T1 obj1, T2 obj2, int iterations) { Stopwatch stopwatch = Stopwatch.StartNew(); for (int i = 0; i < iterations; ++i) action(obj1, obj2); Console.WriteLine(action.Method.Name + " took " + stopwatch.Elapsed); } </code></pre>

Single dimensional arrays with a lower bound of 0 are a different type to either multi-dimensional or non-0 lower bound arrays within IL (<code>vector</code> vs <code>array</code> IIRC). <code>vector</code> is simpler to work with - to get to element x, you just do <code>pointer + size * x</code>. For an <code>array</code>, you have to do <code>pointer + size * (x-lower bound)</code> for a single dimensional array, and yet more arithmetic for each dimension you add. Basically the CLR is optimised for the vastly more common case.

Why are multi-dimensional arrays in .NET slower than normal arrays?

Tags:

performance

arrays

.net

Edit: I apologize everybody. I used the term "jagged array" when I actually meant to say "multi-dimensional array" (as can be seen in my example below). I apologize for using the incorrect name. I actually found jagged arrays to be faster than multi-dimensional ones! I have added my measurements for jagged arrays.

I was trying to use a ~~jagged~~ multi-dimensional array today, when I noticed that it's performance is not as I would have expected. Using a single-dimensional array and manually calculating indices was much faster (almost two times) than using a 2D array. I wrote a test using 1024*1024 arrays (initialized to random values), for 1000 iterations, and I got the following results on my machine:

sum(double[], int): 2738 ms (100%) sum(double[,]):     5019 ms (183%) sum(double[][]):    2540 ms ( 93%)

This is my test code:

public static double sum(double[] d, int l1) {     // assuming the array is rectangular     double sum = 0;     int l2 = d.Length / l1;     for (int i = 0; i < l1; ++i)         for (int j = 0; j < l2; ++j)             sum += d[i * l2 + j];     return sum; }  public static double sum(double[,] d) {     double sum = 0;     int l1 = d.GetLength(0);     int l2 = d.GetLength(1);     for (int i = 0; i < l1; ++i)         for (int j = 0; j < l2; ++j)             sum += d[i, j];     return sum; }  public static double sum(double[][] d) {     double sum = 0;     for (int i = 0; i < d.Length; ++i)         for (int j = 0; j < d[i].Length; ++j)             sum += d[i][j];     return sum; }  public static void Main() {     Random random = new Random();     const int l1  = 1024, l2 = 1024;     double[ ] d1  = new double[l1 * l2];     double[,] d2  = new double[l1 , l2];     double[][] d3 = new double[l1][];      for (int i = 0; i < l1; ++i) {         d3[i] = new double[l2];         for (int j = 0; j < l2; ++j)             d3[i][j] = d2[i, j] = d1[i * l2 + j] = random.NextDouble();     }     //     const int iterations = 1000;     TestTime(sum, d1, l1, iterations);     TestTime(sum, d2, iterations);     TestTime(sum, d3, iterations); }

Further investigation showed that the IL for the second method is 23% larger than that of the first method. (Code size 68 vs 52.) This is mostly due to calls to System.Array::GetLength(int). The compiler also emits calls to Array::Get for the ~~jagged~~ multi-dimensional array, whereas it simply calls ldelem for the simple array.

So I am wondering, why is access through multi-dimensional arrays slower than normal arrays? I would have assumed the compiler (or JIT) would do something similar to what I did in my first method, but this was not actually the case.

Could you plese help me understand why this is happening the way it is?

Update: Following Henk Holterman's suggestion, here is the implementation of TestTime:

public static void TestTime<T, TR>(Func<T, TR> action, T obj,                                    int iterations) {     Stopwatch stopwatch = Stopwatch.StartNew();     for (int i = 0; i < iterations; ++i)         action(obj);     Console.WriteLine(action.Method.Name + " took " + stopwatch.Elapsed); }  public static void TestTime<T1, T2, TR>(Func<T1, T2, TR> action, T1 obj1,                                         T2 obj2, int iterations) {     Stopwatch stopwatch = Stopwatch.StartNew();     for (int i = 0; i < iterations; ++i)         action(obj1, obj2);     Console.WriteLine(action.Method.Name + " took " + stopwatch.Elapsed); }

618

asked Jan 22 '09 11:01

Hosam Aly

2 Answers

Single dimensional arrays with a lower bound of 0 are a different type to either multi-dimensional or non-0 lower bound arrays within IL (vector vs array IIRC). vector is simpler to work with - to get to element x, you just do pointer + size * x. For an array, you have to do pointer + size * (x-lower bound) for a single dimensional array, and yet more arithmetic for each dimension you add.

Basically the CLR is optimised for the vastly more common case.

answered Oct 21 '22 09:10

Jon Skeet

Array bounds checking?

The single-dimension array has a length member that you access directly - when compiled this is just a memory read.

The multidimensional array requires a GetLength(int dimension) method call that processes the argument to get the relevant length for that dimension. That doesn't compile down to a memory read, so you get a method call, etc.

In addition that GetLength(int dimension) will do a bounds check on the parameter.

answered Oct 21 '22 09:10

JeeBee

Related questions
                            
                                Adding new line of data to TextBox
                            
                                How to get the Index of second comma in a string
                            
                                No overload for method 'ToString" takes 1 arguments when casting date
                            
                                Why is this name with an underscore not CLS Compliant?
                            
                                Show row number in row header of a DataGridView
                            
                                How to Close a Window in WPF on a escape key
                            
                                How can I convert decimal? to decimal
                            
                                IServiceCollection does not contain a defintion for AddHttpClient
                            
                                Build error while transitioning between branches: Your project is not referencing the ".NETFramework,Version=v4.7.2" framework
                            
                                MSbuild Copy whole folder
                            
                                how do I check if an entity is the first element of a foreach loop
                            
                                Visual Studio 2013. You do not have sufficient privilege to access IIS web sites on your machine
                            
                                Matching numbers with regular expressions — only digits and commas
                            
                                WCF ExceptionShielding Error ID does not match up with handlingInstanceId passed to Handler
                            
                                Reference a .NET Core Library in a .NET 4.6 project
                            
                                When should I use out parameters?
                            
                                Best Continuous Integration Setup for a solo developer (.NET) [closed]
                            
                                Empty string as a special case?
                            
                                What is the Nuget repositories.config file for?
                            
                                guid to base64, for URL

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With