I stumbled upon some "odd behaviour". I was using the F# interactive to test some code and wrote <pre class="prettyprint"><code>Seq.zip "ACT" "GGA" |> Seq.map ((<||) compare) // val it : seq<int> = seq [-1; -1; 1] </code></pre> Then I wanted to make a function out of it and wrote <pre class="prettyprint"><code>let compute xs ys = Seq.zip xs ys |> Seq.map ((<||) compare) // val compute : xs:seq<'a> -> xs:seq<'a> -> seq<int> when 'a : comparison </code></pre> That generalized the first snippet of code and I thought that was a good thing... until I tried to use it <pre class="prettyprint"><code>compute "ACT" "GGA" // val it : seq<int> = seq [-6; -4; 19] </code></pre> So somehow <code>compare</code> acts differently for the "same thing" when there is a different "point of view" (explicit type vs generics) I know how to solve it: either by making the type explicit <pre class="prettyprint"><code>let compute (xs: #seq<char>) // ... or char seq or string </code></pre> Or keeping the type generic and composing with the <code>sign</code> function <pre class="prettyprint"><code>let compute (* ... *) ((<||) compare >> sign) </code></pre> tl;dr the question is where does the difference in behavior come from exactly?

This is an intricate interplay between F# compiler optimization and .NET standard library optimization. First, F# tries hard to optimize your program. When the types are known at compile time, and the types are primitive, and comparable, then the call to <code>compare</code> gets compiled to just straight up comparison. So comparing the characters in your example would look like <code>if 'A' < 'G' then -1 elif 'A' > 'G' then 1 else 0</code>. But when you wrap the thing in a generic method, you take away the type information. The types are generic now, the compiler doesn't know that they are <code>char</code>. So the compiler is forced to fall back to calling <code>HashCompare.GenericComparisonIntrinsic</code>, which in turn calls <code>IComparable.CompareTo</code> on the arguments. And now guess how <code>IComparable</code> is implemented on the <code>char</code> type? It simply subtracts the values and returns the result. Seriously, try this in C#: <pre class="prettyprint"><code>Console.WriteLine( 'A'.CompareTo('G') ); // prints -6 </code></pre> Note that such implementation of <code>IComparable</code> is not technically a bug. According to the documentation, it doesn't have to return only <code>[-1,0,+1]</code>, it can return any value so long as its sign is correct. My best guess would be that this is also done for optimization. F# documentation for <code>compare</code> doesn't specify this at all. It just says "result of the comparison" - go figure what that's supposed to be :-) <hr> If you want your <code>compute</code> function to return only <code>[-1,0,+1]</code>, that can be easily achieved by making the function <code>inline</code>: <pre class="prettyprint"><code>let inline compute xs ys = Seq.zip xs ys |> Seq.map ((<||) compare) </code></pre> Now it will get expanded at call site, where the types are known, and the optimized code can be inserted. Keep in mind though that, since <code>[-1,0,+1]</code> behavior is not guaranteed in the docs, it may disappear in the future. So I would rather not rely on it.

compare working differently with generics involved

Tags:

comparison

generics

f#

I stumbled upon some "odd behaviour". I was using the F# interactive to test some code and wrote

Seq.zip "ACT" "GGA" |> Seq.map ((<||) compare)
// val it : seq<int> = seq [-1; -1; 1]

Then I wanted to make a function out of it and wrote

let compute xs ys = Seq.zip xs ys |> Seq.map ((<||) compare)
// val compute : xs:seq<'a> -> xs:seq<'a> -> seq<int> when 'a : comparison

That generalized the first snippet of code and I thought that was a good thing... until I tried to use it

compute "ACT" "GGA"
// val it : seq<int> = seq [-6; -4; 19]

So somehow compare acts differently for the "same thing" when there is a different "point of view" (explicit type vs generics)

I know how to solve it: either by making the type explicit

let compute (xs: #seq<char>) // ... or char seq or string

Or keeping the type generic and composing with the sign function

let compute (* ... *) ((<||) compare >> sign)

tl;dr the question is where does the difference in behavior come from exactly?

338

asked Aug 17 '16 17:08

Sehnsucht

1 Answers

This is an intricate interplay between F# compiler optimization and .NET standard library optimization.

First, F# tries hard to optimize your program. When the types are known at compile time, and the types are primitive, and comparable, then the call to compare gets compiled to just straight up comparison. So comparing the characters in your example would look like if 'A' < 'G' then -1 elif 'A' > 'G' then 1 else 0.

But when you wrap the thing in a generic method, you take away the type information. The types are generic now, the compiler doesn't know that they are char. So the compiler is forced to fall back to calling HashCompare.GenericComparisonIntrinsic, which in turn calls IComparable.CompareTo on the arguments.

And now guess how IComparable is implemented on the char type? It simply subtracts the values and returns the result. Seriously, try this in C#:

Console.WriteLine( 'A'.CompareTo('G') ); // prints -6

Note that such implementation of IComparable is not technically a bug. According to the documentation, it doesn't have to return only [-1,0,+1], it can return any value so long as its sign is correct. My best guess would be that this is also done for optimization.

F# documentation for compare doesn't specify this at all. It just says "result of the comparison" - go figure what that's supposed to be :-)

If you want your compute function to return only [-1,0,+1], that can be easily achieved by making the function inline:

let inline compute xs ys = Seq.zip xs ys |> Seq.map ((<||) compare)

Now it will get expanded at call site, where the types are known, and the optimized code can be inserted. Keep in mind though that, since [-1,0,+1] behavior is not guaranteed in the docs, it may disappear in the future. So I would rather not rely on it.

answered Nov 13 '22 07:11

Fyodor Soikin

Related questions
                            
                                Use generic to store common supertype in Java
                            
                                Create objects in GenericObjectPool
                            
                                Using lambda impedes inference of type variable
                            
                                How is it that a struct containing ValueTuple can satisfy unmanaged constraints, but ValueTuple itself cannot?
                            
                                Wildcards vs. generic methods
                            
                                C# Xml Serializing List<T> descendant with Xml Attribute
                            
                                Why is a parameter's private field visible to a generic method in Java 6 but not in Java 7? [duplicate]
                            
                                Trying to understand the Choice type in F#
                            
                                Why does Java Collector.toList() require a wildcard type placeholder in its return type?
                            
                                Conditional typing in generic method
                            
                                Generics vs. Interfaces
                            
                                implementing a cast operator in a generic abstract class
                            
                                The inherited method Object.clone() cannot hide the public abstract method
                            
                                Is there a foreach generic method in Delphi for that can be called with anonymous function
                            
                                Is there a clean way to assign the Class of a generic type to a variable?
                            
                                Can a Generic Method handle both Reference and Nullable Value types?
                            
                                C# compiler fails to recognize a class is implementing an interface
                            
                                Scala: specify a default generic type instead of Nothing
                            
                                Performance of Func<T> and inheritance
                            
                                Sorting the [Any] array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With