Something just occurred to me earlier today that has got me scratching my head.
Any variable of type Nullable<T>
can be assigned to null
. For instance:
int? i = null;
At first I couldn't see how this would be possible without somehow defining an implicit conversion from object
to Nullable<T>
:
public static implicit operator Nullable<T>(object box);
But the above operator clearly does not exist, as if it did then the following would also have to be legal, at least at compile-time (which it isn't):
int? i = new object();
Then I realized that perhaps the Nullable<T>
type could define an implicit conversion to some arbitrary reference type that can never be instantiated, like this:
public abstract class DummyBox { private DummyBox() { } } public struct Nullable<T> where T : struct { public static implicit operator Nullable<T>(DummyBox box) { if (box == null) { return new Nullable<T>(); } // This should never be possible, as a DummyBox cannot be instantiated. throw new InvalidCastException(); } }
However, this does not explain what occurred to me next: if the HasValue
property is false
for any Nullable<T>
value, then that value will be boxed as null
:
int? i = new int?(); object x = i; // Now x is null.
Furthermore, if HasValue
is true
, then the value will be boxed as a T
rather than a T?
:
int? i = 5; object x = i; // Now x is a boxed int, NOT a boxed Nullable<int>.
But this seems to imply that there is a custom implicit conversion from Nullable<T>
to object
:
public static implicit operator object(Nullable<T> value);
This is clearly not the case as object
is a base class for all types, and user-defined implicit conversions to/from base types are illegal (as well they should be).
It seems that object x = i;
should box i
like any other value type, so that x.GetType()
would yield the same result as typeof(int?)
(rather than throw a NullReferenceException
).
So I dug around a bit and, sure enough, it turns out this behavior is specific to the Nullable<T>
type, specially defined in both the C# and VB.NET specifications, and not reproducible in any user-defined struct
(C#) or Structure
(VB.NET).
Here's why I'm still confused.
This particular boxing and unboxing behavior appears to be impossible to implement by hand. It only works because both C# and VB.NET give special treatment to the Nullable<T>
type.
Isn't it theoretically possible that a different CLI-based language could exist where Nullable<T>
weren't given this special treatment? And wouldn't the Nullable<T>
type therefore exhibit different behavior in different languages?
How do C# and VB.NET achieve this behavior? Is it supported by the CLR? (That is, does the CLR allow a type to somehow "override" the manner in which it is boxed, even though C# and VB.NET themselves prohibit it?)
Is it even possible (in C# or VB.NET) to box a Nullable<T>
as object
?
Boxing and unboxing An instance of a nullable value type T? is boxed as follows: If HasValue returns false , the null reference is produced. If HasValue returns true , the corresponding value of the underlying value type T is boxed, not the instance of Nullable<T>.
Boxing is an implicit conversion of a value type to the type object or to any interface type implemented by this value type. Boxing a value type allocates an object instance on the heap and copies the value into the new object.
C# provides a special data types, the nullable types, to which you can assign normal range of values as well as null values. For example, you can store any value from -2,147,483,648 to 2,147,483,647 or null in a Nullable<Int32> variable. Similarly, you can assign true, false, or null in a Nullable<bool> variable.
There are two things going on:
1) The compiler treats "null" not as a null reference but as a null value... the null value for whatever type it needs to convert to. In the case of a Nullable<T>
it's just the value which has False for the HasValue
field/property. So if you have a variable of type int?
, it's quite possible for the value of that variable to be null
- you just need to change your understanding of what null
means a little bit.
2) Boxing nullable types gets special treatment by the CLR itself. This is relevant in your second example:
int? i = new int?(); object x = i;
the compiler will box any nullable type value differently to non-nullable type values. If the value isn't null, the result will be the same as boxing the same value as a non-nullable type value - so an int?
with value 5 gets boxed in the same way as an int
with value 5 - the "nullability" is lost. However, the null value of a nullable type is boxed to just the null reference, rather than creating an object at all.
This was introduced late in the CLR v2 cycle, at the request of the community.
It means there's no such thing as a "boxed nullable-value-type value".
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With