I often run into the case where I want to eval a query right where I declare it. This is usually because I need to iterate over it multiple times and it is expensive to compute. For example: <pre class="prettyprint"><code>string raw = "..."; var lines = (from l in raw.Split('\n') let ll = l.Trim() where !string.IsNullOrEmpty(ll) select ll).ToList(); </code></pre> This works fine. But if I am not going to modify the result, then I might as well call <code>ToArray()</code> instead of <code>ToList()</code>. I wonder however whether <code>ToArray()</code> is implemented by first calling <code>ToList()</code> and is therefore less memory efficient than just calling <code>ToList()</code>. Am I crazy? Should I just call <code>ToArray()</code> - safe and secure in the knowledge that the memory won't be allocated twice?

The performance difference will be insignificant, since <code>List<T></code> is implemented as a dynamically sized array. Calling either <code>ToArray()</code> (which uses an internal <code>Buffer<T></code> class to grow the array) or <code>ToList()</code> (which calls the <code>List<T>(IEnumerable<T>)</code> constructor) will end up being a matter of putting them into an array and growing the array until it fits them all. If you desire concrete confirmation of this fact, check out the implementation of the methods in question in Reflector -- you'll see they boil down to almost identical code.

Is it better to call ToList() or ToArray() in LINQ queries?

Tags:

performance

.net

linq

I often run into the case where I want to eval a query right where I declare it. This is usually because I need to iterate over it multiple times and it is expensive to compute. For example:

string raw = "..."; var lines = (from l in raw.Split('\n')              let ll = l.Trim()              where !string.IsNullOrEmpty(ll)              select ll).ToList();

This works fine. But if I am not going to modify the result, then I might as well call ToArray() instead of ToList().

I wonder however whether ToArray() is implemented by first calling ToList() and is therefore less memory efficient than just calling ToList().

Am I crazy? Should I just call ToArray() - safe and secure in the knowledge that the memory won't be allocated twice?

421

asked Jul 09 '09 19:07

Frank Krueger

2 Answers

Unless you simply need an array to meet other constraints you should use ToList. In the majority of scenarios ToArray will allocate more memory than ToList.

Both use arrays for storage, but ToList has a more flexible constraint. It needs the array to be at least as large as the number of elements in the collection. If the array is larger, that is not a problem. However ToArray needs the array to be sized exactly to the number of elements.

To meet this constraint ToArray often does one more allocation than ToList. Once it has an array that is big enough it allocates an array which is exactly the correct size and copies the elements back into that array. The only time it can avoid this is when the grow algorithm for the array just happens to coincide with the number of elements needing to be stored (definitely in the minority).

EDIT

A couple of people have asked me about the consequence of having the extra unused memory in the List<T> value.

This is a valid concern. If the created collection is long lived, is never modified after being created and has a high chance of landing in the Gen2 heap then you may be better off taking the extra allocation of ToArray up front.

In general though I find this to be the rarer case. It's much more common to see a lot of ToArray calls which are immediately passed to other short lived uses of memory in which case ToList is demonstrably better.

The key here is to profile, profile and then profile some more.

answered Oct 23 '22 03:10

JaredPar

The performance difference will be insignificant, since List<T> is implemented as a dynamically sized array. Calling either ToArray() (which uses an internal Buffer<T> class to grow the array) or ToList() (which calls the List<T>(IEnumerable<T>) constructor) will end up being a matter of putting them into an array and growing the array until it fits them all.

If you desire concrete confirmation of this fact, check out the implementation of the methods in question in Reflector -- you'll see they boil down to almost identical code.

answered Oct 23 '22 05:10

mqp

Related questions
                            
                                "An attempt was made to load a program with an incorrect format" even when the platforms are the same
                            
                                Best way to reverse a string
                            
                                How to verify that method was NOT called in Moq?
                            
                                Protect .NET code from reverse engineering?
                            
                                If my interface must return Task what is the best way to have a no-operation implementation?
                            
                                How do I specify the exit code of a console application in .NET?
                            
                                Can't specify the 'async' modifier on the 'Main' method of a console app
                            
                                How to use LINQ to select object with minimum or maximum property value
                            
                                'Microsoft.ACE.OLEDB.12.0' provider is not registered on the local machine
                            
                                How do I exit a WPF application programmatically?
                            
                                C# Set collection?
                            
                                How to build a query string for a URL in C#?
                            
                                What 'additional configuration' is necessary to reference a .NET 2.0 mixed mode assembly in a .NET 4.0 project?
                            
                                What does the Visual Studio "Any CPU" target mean?
                            
                                What does "yield break;" do in C#?
                            
                                Declare a const array
                            
                                C# "internal" access modifier when doing unit testing
                            
                                LINQ: When to use SingleOrDefault vs. FirstOrDefault() with filtering criteria
                            
                                What is the difference between ManualResetEvent and AutoResetEvent in .NET?
                            
                                "The breakpoint will not currently be hit. The source code is different from the original version." What does this mean?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With