As usual, <code>int?</code> means <code>System.Nullable<int></code> (or <code>System.Nullable`1[System.Int32]</code>). Suppose you have an in-memory <code>IEnumerable<int?></code> (such as a <code>List<int?></code> for example), let us call it <code>seq</code>; then you can find its sum with: <pre class="prettyprint"><code>var seqSum = seq.Sum(); </code></pre> Of course this goes to the extension method overload <code>int? IEnumerable<int?>.Sum()</code> (documentation) which is really a static method on <code>System.Linq.Enumerable</code>. However, the method never returns <code>null</code>, so why is the return type declared as <code>Nullable<></code>? Even in cases where <code>seq</code> is an empty collection or more generally a collection all of whose elements are the <code>null</code> value of type <code>int?</code>, the <code>Sum</code> method in question still returns zero, not <code>null</code>. This is evident from the documentation, but also from the System.Core.dll source code: <pre class="prettyprint"><code>public static int? Sum(this IEnumerable<int?> source) { if (source == null) throw Error.ArgumentNull("source"); int sum = 0; checked { foreach (int? v in source) { if (v != null) sum += v.GetValueOrDefault(); } } return sum; } </code></pre> Note that there is only one <code>return</code> statement and its expression <code>sum</code> has type <code>int</code> (which will then implicitly be converted to <code>int?</code> by a wrapping). It seems wasteful to always wrap the return value. (The caller could always do the wrapping implicitly on his side if desired.) Besides, this return type may lead the caller into writing code such as <code>if (!seqSum.HasValue) { /* logic to handle this */ }</code> which will in reality be unreachable (a fact which the C# compiler cannot know of). So why is this return parameter not simply declared as <code>int</code> with no nullable? I wonder if there is any benefit of having the same return type as <code>int? IQueryable<int?>.Sum()</code> (in <code>System.Linq.Queryable</code> class). This latter method may return <code>null</code> in practice if there are LINQ providers (maybe LINQ to SQL?) that implement it so.

Several comments have mentioned that this isn't really answerable (or only opinion based without official response). I won't argue that. However, one can still perform analysis on available code and form a strong enough theory. Mine is simply that this is a an existing MS pattern. If you look through the rest of <code>System.Linq.Enumerable</code>, in particular the math related functions, you start to see a pattern of having the tendency to return the same type as the input parameter, unless the return has a specific reason to be of a different type. See the following functions: <code>Max()</code>: <pre class="prettyprint"><code>public static int Max(this IEnumerable<int> source); public static int? Max(this IEnumerable<int?> source); public static long Max(this IEnumerable<long> source); public static long? Max(this IEnumerable<long?> source); </code></pre> <code>Min()</code>: <pre class="prettyprint"><code>public static int Min(this IEnumerable<int> source); public static int? Min(this IEnumerable<int?> source); public static long Min(this IEnumerable<long> source); public static long? Min(this IEnumerable<long?> source); </code></pre> <code>Sum()</code>: <pre class="prettyprint"><code>public static int Sum(this IEnumerable<int> source); public static int? Sum(this IEnumerable<int?> source); public static long Sum(this IEnumerable<long> source); public static long? Sum(this IEnumerable<long?> source); </code></pre> For the exception to the rule, take a look at <code>Average</code>... <pre class="prettyprint"><code>public static double Average(this IEnumerable<int> source); public static double? Average(this IEnumerable<int?> source); </code></pre> You can see that it still retains the <code>Nullable<T></code> type, however the return type must be altered to a suitable type to support the result that averaging integers together yields. When you look further into <code>Average</code> though, you see the following: <pre class="prettyprint"><code>public static float Average(this IEnumerable<float> source); public static float? Average(this IEnumerable<float?> source); </code></pre> Again, back to the default pattern of returning the same type as the original incoming type. Now that we see this pattern here, let's see if we see this anywhere else... let's take a look at <code>System.Math</code> since we are on that subject. Again, here we see the same pattern of using the same return type: <pre class="prettyprint"><code>public static int Abs(int value); public static long Abs(long value); public static int Max(int val1, int val2); public static long Max(long val1, long val2); </code></pre> I'll mention it again, this is what amounts to an "opinion answer". I have looked for any MS best practices or language specification information that might hint at this being a language pattern for MS to back up my analysis, but I could not find anything. That being said, if you look at various places in the .Net core libraries, especially the <code>System.Collections.Generic</code> namespace, you will see that unless there is specific reason, the return type matches the collection type. I see no reason for that rule to be deviated from when it comes to <code>Nullable<T></code> types.

Why is the Linq-to-Objects sum of a sequence of nullables itself nullable?

Tags:

c#

linq

sum

nullable

base-class-library

As usual, int? means System.Nullable<int> (or System.Nullable`1[System.Int32]).

Suppose you have an in-memory IEnumerable<int?> (such as a List<int?> for example), let us call it seq; then you can find its sum with:

var seqSum = seq.Sum();

Of course this goes to the extension method overload int? IEnumerable<int?>.Sum() (documentation) which is really a static method on System.Linq.Enumerable.

However, the method never returns null, so why is the return type declared as Nullable<>? Even in cases where seq is an empty collection or more generally a collection all of whose elements are the null value of type int?, the Sum method in question still returns zero, not null.

This is evident from the documentation, but also from the System.Core.dll source code:

public static int? Sum(this IEnumerable<int?> source) { 
    if (source == null) throw Error.ArgumentNull("source"); 
    int sum = 0; 
    checked { 
        foreach (int? v in source) { 
            if (v != null) sum += v.GetValueOrDefault(); 
        } 
    } 
    return sum; 
}

Note that there is only one return statement and its expression sum has type int (which will then implicitly be converted to int? by a wrapping).

It seems wasteful to always wrap the return value. (The caller could always do the wrapping implicitly on his side if desired.)

Besides, this return type may lead the caller into writing code such as if (!seqSum.HasValue) { /* logic to handle this */ } which will in reality be unreachable (a fact which the C# compiler cannot know of).

So why is this return parameter not simply declared as int with no nullable?

I wonder if there is any benefit of having the same return type as int? IQueryable<int?>.Sum() (in System.Linq.Queryable class). This latter method may return null in practice if there are LINQ providers (maybe LINQ to SQL?) that implement it so.

656

asked Dec 08 '16 13:12

Jeppe Stig Nielsen

1 Answers

Several comments have mentioned that this isn't really answerable (or only opinion based without official response). I won't argue that. However, one can still perform analysis on available code and form a strong enough theory. Mine is simply that this is a an existing MS pattern.

If you look through the rest of System.Linq.Enumerable, in particular the math related functions, you start to see a pattern of having the tendency to return the same type as the input parameter, unless the return has a specific reason to be of a different type.

See the following functions:

Max():

public static int Max(this IEnumerable<int> source);
public static int? Max(this IEnumerable<int?> source);
public static long Max(this IEnumerable<long> source);
public static long? Max(this IEnumerable<long?> source);

Min():

public static int Min(this IEnumerable<int> source);
public static int? Min(this IEnumerable<int?> source);
public static long Min(this IEnumerable<long> source);
public static long? Min(this IEnumerable<long?> source);

Sum():

public static int Sum(this IEnumerable<int> source);
public static int? Sum(this IEnumerable<int?> source);
public static long Sum(this IEnumerable<long> source);
public static long? Sum(this IEnumerable<long?> source);

For the exception to the rule, take a look at Average...

public static double Average(this IEnumerable<int> source);
public static double? Average(this IEnumerable<int?> source);

You can see that it still retains the Nullable<T> type, however the return type must be altered to a suitable type to support the result that averaging integers together yields.

When you look further into Average though, you see the following:

public static float Average(this IEnumerable<float> source);
public static float? Average(this IEnumerable<float?> source);

Again, back to the default pattern of returning the same type as the original incoming type.

Now that we see this pattern here, let's see if we see this anywhere else... let's take a look at System.Math since we are on that subject.

Again, here we see the same pattern of using the same return type:

public static int Abs(int value);
public static long Abs(long value);

public static int Max(int val1, int val2);
public static long Max(long val1, long val2);

I'll mention it again, this is what amounts to an "opinion answer". I have looked for any MS best practices or language specification information that might hint at this being a language pattern for MS to back up my analysis, but I could not find anything. That being said, if you look at various places in the .Net core libraries, especially the System.Collections.Generic namespace, you will see that unless there is specific reason, the return type matches the collection type.

I see no reason for that rule to be deviated from when it comes to Nullable<T> types.

answered Oct 09 '22 07:10

gmiley

Related questions
                            
                                Box2D body velocity cap?
                            
                                Run new process as admin and read standard output
                            
                                Hosting RemoteAPP session within Winform
                            
                                How to diagnose source of Handle leak
                            
                                How to avoid "Violation of UNIQUE KEY constraint" when doing LOTS of concurrent INSERTs
                            
                                Xamarin IDE Access to the Path is denied
                            
                                How do I tell Resharper that my IEnumerable method removes nulls?
                            
                                Dynamically append OWIN JWT resource server Application clients (audiences)
                            
                                Designing an F# module to be called by C# (Console/MVC/WPF)
                            
                                How do I stream a video file using ASP.NET MVC?
                            
                                MongoDB connection problems on Azure
                            
                                Extremely slow and inefficient query execution from Entity Framework
                            
                                Suppress System Overlays, Windows phone 8.1 (Silverlight)
                            
                                Strange difference between .net 3.5 and .net 4.0
                            
                                How to change stroke of Ellipse when ListBox item is selected in Windows Phone 8?
                            
                                Is it possible to update the Service Fabric Cluster Manifest?
                            
                                Can ASP.NET MVC + EF scaffolding be used after implementing EntityTypeConfiguration classes?
                            
                                Load test doesn't show more than 4GB for Working Set PerformanceCounter
                            
                                C# Image.FromStream(): Lost metadata when running in Windows 8 / 10
                            
                                How to resize Webview height based on HTML content in Windows 10 UWP?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With