How exactly is foreach
implemented in C#?
I imagine a part of it looking like:
var enumerator = TInput.GetEnumerator(); while(enumerator.MoveNext()) { // do some stuff here }
However I'm unsure what's really going on. What methodology is used for returning enumerator.Current
for each cycle? Does it return [for each cycle] or does it take an anonymous function or something to execute the body of foreach
?
foreach isn't a function call - it's built-into the language itself, just like for loops and while loops. There's no need for it to return anything or "take" a function of any kind.
There is no foreach in C. You can use a for loop to loop through the data but the length needs to be know or the data needs to be terminated by a know value (eg. null).
A for-each loop is a loop that can only be used on a collection of items. It will loop through the collection and each time through the loop it will use the next item from the collection. It starts with the first item in the array (the one at index 0) and continues through in order to the last item in the array.
The foreach loop is used to iterate over the elements of the collection. The collection may be an array or a list. It executes for each element present in the array. It is necessary to enclose the statements of foreach loop in curly braces {}.
It doesn't use an anonymous function, no. Basically the compiler converts the code into something broadly equivalent to the while loop you've shown here.
foreach
isn't a function call - it's built-into the language itself, just like for
loops and while
loops. There's no need for it to return anything or "take" a function of any kind.
Note that foreach
has a few interesting wrinkles:
IEnumerator
foreach
will dispose of the iterator at the end; that's simple for IEnumerator<T>
which extends IDisposable
, but as IEnumerator
doesn't, the compiler inserts a check to test at execution time whether the iterator implements IDisposable
IEnumerable
or IEnumerable<T>
, so long as you have an applicable GetEnumerator()
method which returns a type with suitable Current
and MoveNext()
members. As noted in comments, a type can also implement IEnumerable
or IEnumerable<T>
explicitly, but have a public GetEnumerator()
method which returns a type other than IEnumerator
/IEnumerator<T>
. See List<T>.GetEnumerator()
for an example - this avoids creating a reference type object unnecessarily in many cases.See section 8.8.4 of the C# 4 spec for more information.
Surprised the exact implementation is not touched. While what you have posted in the question is the simplest form, the complete implementation (including enumerator disposal, casting etc) is in the 8.8.4 section of the spec.
Now there are 2 scenarios where a foreach
loop can be run on a type:
If the type has a public/non-static/non-generic/parameterless method named GetEnumerator
which returns something that has a public MoveNext
method and a public Current
property. As noted by Mr Eric Lippert in this blog article, this was designed so as to accommodate pre generic era for both type safety and boxing related performance issues in case of value types. Note that this a case of duck typing. For instance this works:
class Test { public SomethingEnumerator GetEnumerator() { } } class SomethingEnumerator { public Something Current //could return anything { get { return ... } } public bool MoveNext() { } } //now you can call foreach (Something thing in new Test()) //type safe { }
This is then translated by the compiler to:
E enumerator = (collection).GetEnumerator(); try { ElementType element; //pre C# 5 while (enumerator.MoveNext()) { ElementType element; //post C# 5 element = (ElementType)enumerator.Current; statement; } } finally { IDisposable disposable = enumerator as System.IDisposable; if (disposable != null) disposable.Dispose(); }
If the type implements IEnumerable
where theGetEnumerator
returns IEnumerator
that has a public MoveNext
method and a public Current
property. But an interesting sub case is that even if you implement IEnumerable
explicitly (ie no public GetEnumerator
method on Test
class), you can have a foreach
.
class Test : IEnumerable { IEnumerator IEnumerable.GetEnumerator() { } }
This is because in this case foreach
is implemented as (provided there is no other public GetEnumerator
method in the class):
IEnumerator enumerator = ((IEnumerable)(collection)).GetEnumerator(); try { ElementType element; //pre C# 5 while (enumerator.MoveNext()) { ElementType element; //post C# 5 element = (ElementType)enumerator.Current; statement; } } finally { IDisposable disposable = enumerator as System.IDisposable; if (disposable != null) disposable.Dispose(); }
If the type implements IEnumerable<T>
explicitly then the foreach
is converted to (provided there is no other public GetEnumerator
method in the class):
IEnumerator<T> enumerator = ((IEnumerable<T>)(collection)).GetEnumerator(); try { ElementType element; //pre C# 5 while (enumerator.MoveNext()) { ElementType element; //post C# 5 element = (ElementType)enumerator.Current; //Current is `T` which is cast statement; } } finally { enumerator.Dispose(); //Enumerator<T> implements IDisposable }
Few interesting things to note are:
In both the above cases the Enumerator
class should have a public MoveNext
method and a public Current
property. In other words, if you're implementing IEnumerator
interface it has to be implemented implicitly. For eg, foreach
wont work for this enumerator:
public class MyEnumerator : IEnumerator { void IEnumerator.Reset() { throw new NotImplementedException(); } object IEnumerator.Current { get { throw new NotImplementedException(); } } bool IEnumerator.MoveNext() { throw new NotImplementedException(); } }
(Thanks Roy Namir for pointing this out. foreach
implementation isnt as easy it seems on the surface)
Enumerator precedence - It goes like if you have a public GetEnumerator
method, then that is the default choice of foreach
irrespective of who is implementing it. For example:
class Test : IEnumerable<int> { public SomethingEnumerator GetEnumerator() { //this one is called } IEnumerator<int> IEnumerable<int>.GetEnumerator() { } }
If you don't have a public implementation (ie only explicit implementation), then precedence goes like IEnumerator<T>
> IEnumerator
.
There is a cast operator involved in the implementation of foreach
where the collection element is cast back to the type (specified in the foreach
loop itself). Which means even if you had written the SomethingEnumerator
like this:
class SomethingEnumerator { public object Current //returns object this time { get { return ... } } public bool MoveNext() { } }
You could write:
foreach (Something thing in new Test()) { }
Because Something
is type compatible with object
, going by C# rules ,or in other words, the compiler lets it if there is an explicit cast possible between the two types. Otherwise the compiler prevents it. The actual cast is performed at run time which may or may not fail.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With