According to the Haskell 2010 language report, its type checker is based on Hindley-Milner. So consider a function f
of this type,
f :: forall a. [a] -> Int
It could be the length function for instance. According to Hindley-Milner, f []
type checks to Int
. We can prove this by instantiating the type of f
to [Int] -> Int
, and the type of []
to [Int]
, then conclude that the application ([Int] -> Int) [Int]
is of type Int
.
In this proof, I chose to instantiate types forall a. [a] -> Int
and forall a. [a]
by substituting Int
to a
. I can substitute Bool
instead, the proof works too. Isn't it strange in Hindley-Milner that we can apply a polymorphic type to another, without specifying which instances we use ?
More specifically, what in Haskell prevents me from using the type a
in the implementation of f
? I could imagine that f
is a function that equals 18
on any [Bool]
, and equals the usual length function on all other types of lists. In this case, would f []
be 18
or 0
? The Haskell report says "the kernel is not formally specified", so it's hard to tell.
During type inference, such type variables can indeed instantiated to any type. This may be seen as a highly non deterministic step.
GHC, for what it is worth, uses the internal Any
type in such cases. For instance, compiling
{-# NOINLINE myLength #-}
myLength :: [a] -> Int
myLength = length
test :: Int
test = myLength []
results in the following Core:
-- RHS size: {terms: 3, types: 4, coercions: 0}
myLength [InlPrag=NOINLINE] :: forall a_aw2. [a_aw2] -> Int
[GblId, Str=DmdType]
myLength =
\ (@ a_aP5) -> length @ [] Data.Foldable.$fFoldable[] @ a_aP5
-- RHS size: {terms: 2, types: 6, coercions: 0}
test :: Int
[GblId, Str=DmdType]
test = myLength @ GHC.Prim.Any (GHC.Types.[] @ GHC.Prim.Any)
where GHC.Prim.Any
occurs in the last line.
Now, is that really not deterministic? Well, it does involve a kind of non deterministic step "in the middle" of the algorithm, but the final resulting (most general) type is Int
, and deterministically so. It does not matter what type we choose for a
, we always get type Int
at the end.
Of course, getting the same type is not enough: we also want to get the same Int
value. I conjecture that it can be proven that, given
f :: forall a. T a
g :: forall a. T a -> U
then
g @ V (f @ V) :: U
is the same value whatever type V
is. This should be a consequence of parametricity applied to those polymorphic types.
To follow-up on Chi's answer, here is the proof that f []
cannot depend on the type instances of f
and []
. According to Theorems for free (the last article here),
for any types a,a'
and any function g :: a -> a'
, then
f_a = f_a' . map g
where f_a
is the instantiation of f
on type a
, for example in Haskell
f_Bool :: [Bool] -> Int
f_Bool = f
Then if you evaluate the previous equality on []_a
, it yields f_a []_a = f_a' []_a'
. In the case of the original question, f_Int []_Int = f_Bool []_Bool
.
Some references for parametricity in Haskell would be useful too, because Haskell looks stronger than the polymorphic lambda calculus described in Walder's paper. In particular, this wiki page says parametricity can be broken in Haskell by using the seq
function.
The wiki page also says that my type-depending function exists (in other languages than Haskell), it is called ad-hoc polymorphism.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With