I was playing around with a hobby project when I came across a type-inference error I didn't understand. I have simplified it to the following trivial example. I have the following classes and functions: <pre class="prettyprint"><code>class Foo { } class Bar { } class Baz { } static T2 F<T1, T2>(Func<T1, T2> f) { return default(T2); } static T3 G<T1, T2, T3>(Func<T1, Func<T2, T3>> f) { return default(T3); } </code></pre> Now consider the following examples: <pre class="prettyprint"><code>// 1. F with explicit type arguments - Fine F<Foo, Bar>(x => new Bar()); // 2. F with implicit type arguments - Also fine, compiler infers <Foo, Bar> F((Foo x) => new Bar()); // 3. G with explicit type arguments - Still fine... G<Foo, Bar, Baz>(x => y => new Baz()); // 4. G with implicit type arguments - Bang! // Compiler error: Type arguments cannot be inferred from usage G((Foo x) => (Bar y) => new Baz()); </code></pre> The last example produces a compiler error, but it seems to me that it should be able to infer the type arguments without any problems. QUESTION: Why can't the compiler infer <code><Foo, Bar, Baz></code> in this case? UPDATE: I have discovered that simply wrapping the second lambda in an identity function will cause the compiler to infer all the types correctly: <pre class="prettyprint"><code>static Func<T1, T2> I<T1, T2>(Func<T1, T2> f) { return f; } // Infers G<Foo, Bar, Baz> and I<Bar, Baz> G((Foo x) => I((Bar y) => new Baz())); </code></pre> Why can it do all the individual steps perfectly, but not the whole inference at once? Is there some subtlety in the order that the compiler analyses implicit lambda types and implicit generic types?

Because the algorithm as described in the C# specification doesn’t succeed in this case. Let’s look at the specification in order to see why this is. The algorithm description is long and complicated, so I’ll heavily abbreviate this. The relevant types mentioned in the algorithm have the following values for you: <ul> <li> <code>Eᵢ</code> = the anonymous lambda <code>(Foo x) => (Bar y) => new Baz()</code> </li> <li> <code>Tᵢ</code> = the parameter type (<code>Func<T1, Func<T2, T3>></code>)</li> <li> <code>Xᵢ</code> = the three generic type parameters (<code>T1</code>, <code>T2</code>, <code>T3</code>)</li> </ul> Firstly, there’s the first phase, which in your case does only one thing: <blockquote> <h3>7.5.2.1 The first phase</h3> For each of the method arguments <code>Eᵢ</code> (in your case, there’s only one, the lambda): <ul> <li>If <code>Eᵢ</code> is an anonymous function [it is], an explicit parameter type inference (§7.5.2.7) is made from <code>Eᵢ</code> to <code>Tᵢ</code> </li> <li>Otherwise, [not relevant]</li> <li>Otherwise, [not relevant]</li> <li>Otherwise, no inference is made for this argument.</li> </ul> </blockquote> I’ll skip the details of the explicit parameter type inference here; it suffices to say that for the call <code>G((Foo x) => (Bar y) => new Baz())</code>, it infers that <code>T1</code> = <code>Foo</code>. Then comes the second phase, which is effectively a loop that tries to narrow down the type of each generic type parameter until it either finds all of them or gives up. The one important bullet point is the last one: <blockquote> <h3>7.5.2.2 The second phase</h3> The second phase proceeds as follows: <ul> <li>[...]</li> <li>Otherwise, for all arguments <code>Eᵢ</code> with corresponding parameter type <code>Tᵢ</code> where the output types (§7.5.2.4) contain unfixed type variables <code>Xj</code> but the input types (§7.5.2.3) do not, an output type inference (§7.5.2.6) is made from <code>Eᵢ</code> to <code>Tᵢ</code>. Then the second phase is repeated.</li> </ul> [Translated and applied to your case, this means: <ul> <li>Otherwise, if the return type of the delegate (i.e. <code>Func<T2,T3></code>) contains an as yet undetermined type variable (it does) but its parameter types (i.e. <code>T1</code>) do not (they do not, we already know that <code>T1</code> = <code>Foo</code>), an output type inference (§7.5.2.6) is made.]</li> </ul> </blockquote> The output type inference now proceeds as follows; again, only one bullet point is relevant, this time it’s the first one: <blockquote> <h3>7.5.2.6 Output type inferences</h3> An output type inference is made from an expression <code>E</code> to a type <code>T</code> in the following way: <ul> <li>If <code>E</code> is an anonymous function [it is] with inferred return type <code>U</code> (§7.5.2.12) and <code>T</code> is a delegate type or expression tree type with return type <code>Tb</code>, then a lower-bound inference (§7.5.2.9) is made from <code>U</code> to <code>Tb</code>.</li> <li>Otherwise, [rest snipped]</li> </ul> </blockquote> The “inferred return type” <code>U</code> is the anonymous lambda <code>(Bar y) => new Baz()</code> and <code>Tb</code> is <code>Func<T2,T3></code>. Cue lower-bound inference. I don’t think I need to quote the entire lower-bound inference algorithm now (it’s long); it is enough to say that it doesn’t mention anonymous functions. It takes care of inheritance relationships, interface implementations, array covariance, interface and delegate co-/contravariance, ... but not lambdas. Therefore, its last bullet point applies: <blockquote> <ul> <li>Otherwise, no inferences are made.</li> </ul> </blockquote> Then we come back to the second phase, which gives up because no inferences have been made for <code>T2</code> and <code>T3</code>. Moral of the story: the type inference algorithm is not recursive with lambdas. It can only infer types from the parameter and return types of the outer lambda, not lambdas nested inside of it. Only lower-bound inference is recursive (so that it can take nested generic constructions like <code>List<Tuple<List<T1>, T2>></code> apart) but neither output type inferences (§7.5.2.6) nor explicit parameter type inferences (§7.5.2.7) are recursive and are never applied to inner lambdas. <h3>Addendum</h3> When you add a call to that identify function <code>I</code>: <ul> <li><code>G((Foo x) => I((Bar y) => new Baz()));</code></li> </ul> then type inference is first applied to the call to <code>I</code>, which results in <code>I</code>’s return type being inferred as <code>Func<Bar, Baz></code>. Then the “inferred return type” <code>U</code> of the outer lambda is the delegate type <code>Func<Bar, Baz></code> and <code>Tb</code> is <code>Func<T2, T3></code>. Thus lower-bound inference will succeed because it will be faced with two explicit delegate types (<code>Func<Bar, Baz></code> and <code>Func<T2, T3></code>) but no anonymous functions/lambdas. This is why the identify function makes it succeed.

Nested Generics: Why can't the compiler infer the type arguments in this case?

Tags:

I was playing around with a hobby project when I came across a type-inference error I didn't understand. I have simplified it to the following trivial example.

I have the following classes and functions:

class Foo { } class Bar { } class Baz { }  static T2 F<T1, T2>(Func<T1, T2> f) { return default(T2); } static T3 G<T1, T2, T3>(Func<T1, Func<T2, T3>> f) { return default(T3); }

Now consider the following examples:

// 1. F with explicit type arguments - Fine F<Foo, Bar>(x => new Bar());  // 2. F with implicit type arguments - Also fine, compiler infers <Foo, Bar> F((Foo x) => new Bar());  // 3. G with explicit type arguments - Still fine... G<Foo, Bar, Baz>(x => y => new Baz());  // 4. G with implicit type arguments - Bang! // Compiler error: Type arguments cannot be inferred from usage G((Foo x) => (Bar y) => new Baz());

The last example produces a compiler error, but it seems to me that it should be able to infer the type arguments without any problems.

QUESTION: Why can't the compiler infer <Foo, Bar, Baz> in this case?

UPDATE: I have discovered that simply wrapping the second lambda in an identity function will cause the compiler to infer all the types correctly:

static Func<T1, T2> I<T1, T2>(Func<T1, T2> f) { return f; }  // Infers G<Foo, Bar, Baz> and I<Bar, Baz> G((Foo x) => I((Bar y) => new Baz()));

Why can it do all the individual steps perfectly, but not the whole inference at once? Is there some subtlety in the order that the compiler analyses implicit lambda types and implicit generic types?

613

asked Sep 04 '12 02:09

verdesmarald

1 Answers

Because the algorithm as described in the C# specification doesn’t succeed in this case. Let’s look at the specification in order to see why this is.

The algorithm description is long and complicated, so I’ll heavily abbreviate this.

The relevant types mentioned in the algorithm have the following values for you:

Eᵢ = the anonymous lambda (Foo x) => (Bar y) => new Baz()
Tᵢ = the parameter type (Func<T1, Func<T2, T3>>)
Xᵢ = the three generic type parameters (T1, T2, T3)

Firstly, there’s the first phase, which in your case does only one thing:

7.5.2.1 The first phase

For each of the method arguments Eᵢ (in your case, there’s only one, the lambda):

If Eᵢ is an anonymous function [it is], an explicit parameter type inference (§7.5.2.7) is made from Eᵢ to Tᵢ

Otherwise, [not relevant]

Otherwise, [not relevant]

Otherwise, no inference is made for this argument.

I’ll skip the details of the explicit parameter type inference here; it suffices to say that for the call G((Foo x) => (Bar y) => new Baz()), it infers that T1 = Foo.

Then comes the second phase, which is effectively a loop that tries to narrow down the type of each generic type parameter until it either finds all of them or gives up. The one important bullet point is the last one:

7.5.2.2 The second phase

The second phase proceeds as follows:

[...]

Otherwise, for all arguments Eᵢ with corresponding parameter type Tᵢ where the output types (§7.5.2.4) contain unfixed type variables Xj but the input types (§7.5.2.3) do not, an output type inference (§7.5.2.6) is made from Eᵢ to Tᵢ. Then the second phase is repeated.

[Translated and applied to your case, this means:

Otherwise, if the return type of the delegate (i.e. Func<T2,T3>) contains an as yet undetermined type variable (it does) but its parameter types (i.e. T1) do not (they do not, we already know that T1 = Foo), an output type inference (§7.5.2.6) is made.]

The output type inference now proceeds as follows; again, only one bullet point is relevant, this time it’s the first one:

7.5.2.6 Output type inferences

An output type inference is made from an expression E to a type T in the following way:

If E is an anonymous function [it is] with inferred return type U (§7.5.2.12) and T is a delegate type or expression tree type with return type Tb, then a lower-bound inference (§7.5.2.9) is made from U to Tb.

Otherwise, [rest snipped]

The “inferred return type” U is the anonymous lambda (Bar y) => new Baz() and Tb is Func<T2,T3>. Cue lower-bound inference.

I don’t think I need to quote the entire lower-bound inference algorithm now (it’s long); it is enough to say that it doesn’t mention anonymous functions. It takes care of inheritance relationships, interface implementations, array covariance, interface and delegate co-/contravariance, ... but not lambdas. Therefore, its last bullet point applies:

Otherwise, no inferences are made.

Then we come back to the second phase, which gives up because no inferences have been made for T2 and T3.

Moral of the story: the type inference algorithm is not recursive with lambdas. It can only infer types from the parameter and return types of the outer lambda, not lambdas nested inside of it. Only lower-bound inference is recursive (so that it can take nested generic constructions like List<Tuple<List<T1>, T2>> apart) but neither output type inferences (§7.5.2.6) nor explicit parameter type inferences (§7.5.2.7) are recursive and are never applied to inner lambdas.

Addendum

When you add a call to that identify function I:

G((Foo x) => I((Bar y) => new Baz()));

then type inference is first applied to the call to I, which results in I’s return type being inferred as Func<Bar, Baz>. Then the “inferred return type” U of the outer lambda is the delegate type Func<Bar, Baz> and Tb is Func<T2, T3>. Thus lower-bound inference will succeed because it will be faced with two explicit delegate types (Func<Bar, Baz> and Func<T2, T3>) but no anonymous functions/lambdas. This is why the identify function makes it succeed.

120

answered Oct 13 '22 18:10

Timwi

Related questions
                            
                                iOS-like storyboard tool for Android project? [closed]
                            
                                How to add event listeners to objects in a svg?
                            
                                Why anonymous methods inside structs can not access instance members of 'this'
                            
                                TypeScript module import in nodejs
                            
                                arbitrary number of arguments in a python function
                            
                                Call an Oracle function from Java
                            
                                Is there a way to have an onload callback after changing window.location.href?
                            
                                Why can't diamond infer types on anonymous inner classes?
                            
                                What is a CSS Authoring framework?
                            
                                How are Java objects laid out in memory on Android?
                            
                                How to make custom scrollbars show in all browsers?
                            
                                When should I use O_CLOEXEC when I open file in Linux?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Nested Generics: Why can't the compiler infer the type arguments in this case?

Tags:

verdesmarald

People also ask

1 Answers

7.5.2.1 The first phase

7.5.2.2 The second phase

7.5.2.6 Output type inferences

Addendum

Timwi

Recent Activity

Donate For Us