Boxing and unboxing with generics

Tags:

The .NET 1.0 way of creating collection of integers (for example) was:

ArrayList list = new ArrayList(); list.Add(i);          /* boxing   */ int j = (int)list[0]; /* unboxing */

The penalty of using this is the lack of type safety and performance due to boxing and unboxing.

The .NET 2.0 way is to use generics:

Click to copy

List<int> list = new List<int>(); list.Add(i); int j = list[0];

The price of boxing (to my understanding) is the need to create an object on the heap, copy the stack allocated integer to the new object and vice-versa for unboxing.

How does the use of generics overcome this? Does the stack-allocated integer stays on the stack and being pointed to from the heap (I guess this is not the case because of what will happen when it will get out of scope)? It seems like there is still a need of copying it somewhere else out of the stack.

What is really going on?

353

asked Dec 09 '10 20:12

Itay Karo

2 Answers

When it comes to collections, generics make it possible to avoid boxing/unboxing by utilizing actual T[] arrays internally. List<T> for example uses a T[] array to store its contents.

The array, of course, is a reference type and is therefore (in the current version of the CLR, yada yada) stored on the heap. But since it's a T[] and not an object[], the array's elements can be stored "directly": that is, they're still on the heap, but they're on the heap in the array instead of being boxed and having the array contain references to the boxes.

So for a List<int>, for example, what you'd have in the array would "look" like this:

Click to copy

 [ 1 2 3 ]

Compare this to an ArrayList, which uses an object[] and would therefore "look" something like this:

Click to copy

 [ *a *b *c ]

...where *a, etc. are references to objects (boxed integers):

Click to copy

 *a -> 1 *b -> 2 *c -> 3

Excuse those crude illustrations; hopefully you know what I mean.

103

answered Sep 28 '22 04:09

Dan Tao

Your confusion is a result of misunderstanding what the relationship is between the stack, the heap, and variables. Here's the correct way to think about it.

A variable is a storage location that has a type.
The lifetime of a variable can either be short or long. By "short" we mean "until the current function returns or throws" and by "long" we mean "possibly longer than that".
If the type of a variable is a reference type then the contents of the variable is a reference to a long-lived storage location. If the type of a variable is a value type then the contents of the variable is a value.

As an implementation detail, a storage location which is guaranteed to be short-lived can be allocated on the stack. A storage location which might be long-lived is allocated on the heap. Notice that this says nothing about "value types are always allocated on the stack." Value types are not always allocated on the stack:

Click to copy

int[] x = new int[10]; x[1] = 123;

x[1] is a storage location. It is long-lived; it might live longer than this method. Therefore it must be on the heap. The fact that it contains an int is irrelevant.

You correctly say why a boxed int is expensive:

The price of boxing is the need to create an object on the heap, copy the stack allocated integer to the new object and vice-versa for unboxing.

Where you go wrong is to say "the stack allocated integer". It doesn't matter where the integer was allocated. What matters was that its storage contained the integer, instead of containing a reference to a heap location. The price is the need to create the object and do the copy; that's the only cost that is relevant.

So why isn't a generic variable costly? If you have a variable of type T, and T is constructed to be int, then you have a variable of type int, period. A variable of type int is a storage location, and it contains an int. Whether that storage location is on the stack or the heap is completely irrelevant. What is relevant is that the storage location contains an int, instead of containing a reference to something on the heap. Since the storage location contains an int, you do not have to take on the costs of boxing and unboxing: allocating new storage on the heap and copying the int to the new storage.

Is that now clear?

answered Sep 28 '22 03:09

Eric Lippert

Related questions
                            
                                WPF Window with transparent background containing opaque controls [duplicate]
                            
                                Performance of calling delegates vs methods
                            
                                Difference between string and StringBuilder in C#
                            
                                Convert Word doc and docx format to PDF in .NET Core without Microsoft.Office.Interop
                            
                                Which unit testing framework? [closed]
                            
                                Calling SignalR hub clients from elsewhere in system
                            
                                Convert 20121004 (yyyyMMdd) to a valid date time?
                            
                                How to pass a null variable to a SQL Stored Procedure from C#.net code
                            
                                How to compare dates in c#
                            
                                How can I get scrollbars on Picturebox
                            
                                How do I force full post-back from a button within an UpdatePanel?
                            
                                What is the best or most interesting use of Extension Methods you've seen? [closed]
                            
                                How can I pass a runtime parameter as part of the dependency resolution?
                            
                                String.Replace(char, char) method in C#
                            
                                LINQ to SQL Where Clause Optional Criteria
                            
                                Razor Views not seeing System.Web.Mvc.HtmlHelper
                            
                                Autofac: Resolve all instances of a Type
                            
                                Using ANTLR 3.3?
                            
                                How to force C# .net app to run only one instance in Windows? [duplicate]
                            
                                how to add css class to html generic control div?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Boxing and unboxing with generics

Tags:

c#

.net

generics

boxing

unboxing

Itay Karo

People also ask

2 Answers

Dan Tao

Eric Lippert

Recent Activity

Donate For Us