I'm trying to use a GADT with DataKinds, as shown below <pre class="prettyprint lang-hs prettyprint-override"><code>{-# LANGUAGE KindSignatures, DataKinds, GADTs #-} module NewGadt where data ExprType = Var | Nest data Expr (a :: ExprType) where ExprVar :: String -> Expr Var ExprNest :: Expr a -> Expr Nest data BaseExpr = BaseExprVar String | BaseExprNest BaseExpr translate :: BaseExpr -> Expr a translate (BaseExprVar id) = ExprVar id translate (BaseExprNest expr) = ExprNest $ translate expr </code></pre> Compile error: <pre class="prettyprint"><code>/home/elijah/code/monty/src/NewGadt.hs:15:32: error: • Couldn't match type ‘a’ with ‘'Var’ ‘a’ is a rigid type variable bound by the type signature for: bexprToExpr :: forall (a :: ExprType). BaseExpr -> Expr a at src/NewGadt.hs:14:1-33 Expected type: Expr a Actual type: Expr 'Var • In the expression: ExprVar id In an equation for ‘bexprToExpr’: bexprToExpr (BaseExprVar id) = ExprVar id • Relevant bindings include bexprToExpr :: BaseExpr -> Expr a (bound at src/NewGadt.hs:15:1) | 15 | bexprToExpr (BaseExprVar id) = ExprVar id | ^^^^^^^^^^ </code></pre> I would like to do this so certain functions can only work on a specific type of expr, for example: <pre class="prettyprint lang-hs prettyprint-override"><code>printVariable :: Expr Var -> IO () printVariable (ExprVar id) = putStrLn id printNested :: Expr Nest -> IO () printNested (ExprNest inner) = putStrLn "nested expression" printExpr :: Expr a -> IO () printExpr expr@ExprVar {} = printVariable expr printExpr expr@ExprNest {} = printNested expr </code></pre> And of course, calling <code>printVariable</code> with an <code>Expr Nest</code> should fail compilation. Is there a way I can have the translate function return <code>Expr a</code> like this? Or is this an inappropriate usage of DataKinds & GADTs? Edit: Solution worked! But, I had to upgrade to ghc >=8.10 and enable <code>StandaloneKindSignatures, PolyKinds</code>

You can define an existential wrapper <pre class="prettyprint"><code>{-# Language PolyKinds #-} {-# Language StandaloneKindSignatures #-} {-# Language TypeApplications #-} import Data.Kind (Type) -- -- Exists @ExprType :: (ExprType -> Type) -> Type -- type Exists :: forall k. (k -> Type) -> Type data Exists f where Exists :: f x -> Exists f </code></pre> and return <code>Exists Expr</code> <pre class="prettyprint"><code>translate :: BaseExpr -> Exists @ExprType Expr translate (BaseExprVar id) = Exists (ExprVar id) translate (BaseExprNest expr) | Exists a <- translate expr = Exists (ExprNest a) </code></pre> This uses pattern guards to unpack the existential type <blockquote> pattern guards are of the form <code>p <- e</code>, where <code>p</code> is a pattern (see Section 3.17) of type <code>t</code> and <code>e</code> is an expression type <code>t1</code>. They succeed if the expression <code>e</code> matches the pattern <code>p</code>, and introduce the bindings of the pattern to the environment. Haskell Report </blockquote> and is equivalent to these <pre class="prettyprint"><code>translate (BaseExprNest expr) = case translate expr of Exists a -> Exists (ExprNest a) </code></pre> <pre class="prettyprint"><code>{-# Language ViewPatterns #-} translate (BaseExprNest (translate -> Expr a)) = Exists (ExprNest a) </code></pre> But try it with <code>let</code> or <code>where</code> and it won't work. References <ul> <li>Stitch: The Sound Type-Indexed Type Checker</li> <li>An Existential Crisis Resolved</li> </ul>

The reason why this fails is, because you're making a promise you can't keep. The type of translate is <code>BaseExpr -> Expr a</code>, so you're really saying "If you give me a <code>BaseExpr</code>, I will give you an <code>Expr a</code> for any type <code>a</code> that you want". GHC complains, because you don't actually do this. If you call <code>translate</code> with a <code>BaseExprVar</code>, you won't actually get an <code>Expr a</code> for any type a, but you get an <code>Expr Var</code>. To solve this, you can create an existential wrapper, as explained in @iceland_jack's answer. The type <code>BaseExpr -> Exists @ExprType Expr</code> really means "If you give me a <code>BaseExpr</code>, I will give you an <code>Expr a</code> for some a, that I determine.", which is exactly what your function does.

<h3>Why doesn't your original code work?</h3> This type cannot work: <pre class="prettyprint"><code>translate :: BaseExpr -> Expr a </code></pre> Remember what type variables mean in Haskell. This type for <code>translate</code> is saying "for any type <code>a</code> the caller of <code>translate</code> chooses, it will take a <code>BaseExpr</code> and turn it into an <code>Expr a</code>". So if I want to pass <code>BaseExprVar "variableName"</code> to <code>translate</code> and receive an <code>Expr Nest</code>, I should be able to do that; I'll just choose <code>a</code> to be <code>Nest</code>. It doesn't matter that this particular <code>BaseExpr</code> is just a variable, not a nested expression, and there is no value of type <code>Expr Nest</code> that faithfully represents this. With a polymorphic GADT as a return type, that's not my problem, it's <code>translate</code>'s problem, since the type for <code>translate</code> promised to be able to come up with such a value. <code>translate</code> can't force me to choose <code>Var</code> for <code>a</code>. That isn't the type you want. You don't want the type of <code>Expr</code> that is returned to be chosen by the caller of <code>translate</code>, you want it to be chosen by the implementation of <code>translate</code> (so it can return whatever type is appropriate to the contents of the <code>BaseExpr</code>). To have the implementation of a function choose a type parameter, you need to either use an existential wrapper, or use higher-rank types and continuation passing. <h3>Existential wrapper</h3> <pre class="prettyprint"><code>data SomeExpr where SomeExpr :: Expr a -> SomeExpr translate :: BaseExpr -> SomeExpr translate (BaseExprVar id) = Some (ExprVar id) translate (BaseExprNest expr) = case translate expr of Some e -> Some (ExprNest e) </code></pre> Notice that the type parameter of <code>Expr</code> does not appear in the type for <code>translate</code>, so a caller of <code>translate</code> doesn't need to specify what it should be. To use the output of <code>translate</code>, you will need to pattern match on the <code>Some</code> constructor, which will give you a value of type <code>Expr a</code> for some unknown <code>a</code> (as opposed to the original attempt at <code>translate</code>, which gives you a value of type <code>Expr a</code> for some particular <code>a</code> that you choose). Inside the pattern match, you therefore have to handle any possible value of <code>a</code>, and the resulting value cannot have a type that depends on <code>a</code> (we're only allowed to "see" it inside the pattern match). You can see an example of this above in the recursive call; <code>translate expr</code> returned <code>Some Expr</code>, but what we needed to wrap up in the <code>ExprNest</code> constructor is <code>Expr a</code>. By pattern matching on <code>Some</code>, we get the contained <code>Expr a</code>, which we can then put inside the <code>ExprNest</code> constructor to get a value of type <code>Expr Nest</code>. Then we hide that back in a <code>Some</code> constructor, and finally have the right type to return from <code>translate</code>. You can of course also generalise the existential wrapper so that it works on more types than just <code>Expr</code>, as shown in @Iceland_jack's answer. There's nothing special to the form I've used here; it is itself just another use of GADT syntax (to embed a value with a type parameter that isn't exposed in the overall type). A general one is nice, but often you'll want some type class constraints on the hidden type so that you have more things you can do with the value when you pattern match on it, so you'll need to get more complicated to make a type general enough to reuse any time you need an existential wrapper. <h3>Higher-rank types</h3> The other way to handle the requirement for <code>translate</code> to return different types based on the data it is processing, is to use higher rank types. For this, you'll need the <code>{-# LANGUAGE RankNTypes #-}</code> extension. <pre class="prettyprint"><code>translate :: BaseExpr -> (forall a. Expr a -> r) -> r translate (BaseExprVar id) handler = handler (ExprVar id) translate (BaseExprNest expr) handler = handler (translate expr ExprNest) </code></pre> Here, instead of using an extra wrapper type to "hide" the type variable, we restructure <code>translate</code> so that it takes a "handler function" for <code>Expr</code>s. Now <code>translate</code> doesn't return any form of <code>Expr</code> at all, rather it makes one, passes it to <code>handler</code>, and returns what the handler returns. Why does this work? The trick is that <code>handler</code> is required to be passed to <code>translate</code> as a polymorphic function. The nested <code>forall a.</code> inside the type of one of <code>translate</code>'s arguments means that the caller doesn't choose <code>a</code> like it normally would, instead the caller is required to pass a function that works for any possible <code>a</code>. The code of the handler function is similar to the "inside of the pattern match" in the existential wrapper version; it has to handle any possible <code>a</code> and can't return anything whose type depends on <code>a</code>. Because the caller had to pass a <code>handler</code> that works for any <code>a</code>, the code for <code>translate</code> can use it at any type it likes, including at different types on different branches. So the first case for <code>translate</code> can pick <code>Var</code> and use <code>handler</code> as <code>Expr Var -> r</code>, while the second case can pick <code>Nest</code> and use handler as <code>Expr Nest -> r</code>. Note a subtlety in the second case. Now when <code>translate</code> needs to recursively call itself, it needs to pass a "handler function". We could try passing the handler we received in from the top, but that's not right. We're supposed to be calling that handler only once on the complete <code>Expr</code>, not calling it at every level of the <code>BaseExpr</code>. And besides, that would only give us an <code>r</code>, which is not what we need to wrap up in the <code>ExprNest</code> constructor. Instead we need to realise that <code>ExprNest</code> is itself a function of type <code>forall a. Expr a -> Expr Nest</code>, which fits with the type we need for handler functions! Translating the nested expression with the <code>ExprNest</code> constructor as the handler will translate whatever is in it, and then feed it to the constructor, returning us the complete <code>Expr Nest</code> value we need to feed to the top level handler. In this particular example the higher rank version ended up using less boilerplate than the existential wrapper version. That's often the case, but a possible downside is it makes you restructure some of your calling code into continuation passing form; you can't just call <code>translate</code> and then use the resulting value, you have to construct a function for what you would do with a translated expression if you had one, and then pass that function to <code>translate</code>. Sometimes that coding style is harder to read/write.

Using GADTs with DataKinds for type level data constructor constraints in functions

Tags:

haskell

existential-type

data-kinds

gadt

I'm trying to use a GADT with DataKinds, as shown below

{-# LANGUAGE KindSignatures, DataKinds, GADTs #-}
module NewGadt where

data ExprType = Var | Nest

data Expr (a :: ExprType) where
  ExprVar :: String -> Expr Var
  ExprNest :: Expr a -> Expr Nest

data BaseExpr
  = BaseExprVar String
  | BaseExprNest BaseExpr

translate :: BaseExpr -> Expr a
translate (BaseExprVar id) = ExprVar id
translate (BaseExprNest expr) = ExprNest $ translate expr

Compile error:

/home/elijah/code/monty/src/NewGadt.hs:15:32: error:
    • Couldn't match type ‘a’ with ‘'Var’
      ‘a’ is a rigid type variable bound by
        the type signature for:
          bexprToExpr :: forall (a :: ExprType). BaseExpr -> Expr a
        at src/NewGadt.hs:14:1-33
      Expected type: Expr a
        Actual type: Expr 'Var
    • In the expression: ExprVar id
      In an equation for ‘bexprToExpr’:
          bexprToExpr (BaseExprVar id) = ExprVar id
    • Relevant bindings include
        bexprToExpr :: BaseExpr -> Expr a (bound at src/NewGadt.hs:15:1)
   |
15 | bexprToExpr (BaseExprVar id) = ExprVar id
   |                                ^^^^^^^^^^

I would like to do this so certain functions can only work on a specific type of expr, for example:

printVariable :: Expr Var -> IO ()
printVariable (ExprVar id) = putStrLn id

printNested :: Expr Nest -> IO ()
printNested (ExprNest inner) = putStrLn "nested expression"

printExpr :: Expr a -> IO ()
printExpr expr@ExprVar {} = printVariable expr
printExpr expr@ExprNest {} = printNested expr

And of course, calling printVariable with an Expr Nest should fail compilation.

Is there a way I can have the translate function return Expr a like this? Or is this an inappropriate usage of DataKinds & GADTs?

Edit:

Solution worked! But, I had to upgrade to ghc >=8.10 and enable StandaloneKindSignatures, PolyKinds

796

asked Sep 10 '21 21:09

elimirks

3 Answers

You can define an existential wrapper

{-# Language PolyKinds                #-}
{-# Language StandaloneKindSignatures #-}
{-# Language TypeApplications         #-}

import Data.Kind (Type)

--
--   Exists @ExprType :: (ExprType -> Type) -> Type
--
type Exists :: forall k. (k -> Type) -> Type
data Exists f where
  Exists :: f x -> Exists f

and return Exists Expr

translate :: BaseExpr -> Exists @ExprType Expr
translate (BaseExprVar id)
  = Exists (ExprVar id)
translate (BaseExprNest expr)
  | Exists a <- translate expr
  = Exists (ExprNest a)

This uses pattern guards to unpack the existential type

pattern guards are of the form p <- e, where p is a pattern (see Section 3.17) of type t and e is an expression type t1. They succeed if the expression e matches the pattern p, and introduce the bindings of the pattern to the environment.

Haskell Report

and is equivalent to these

translate (BaseExprNest expr) = case translate expr of
  Exists a -> Exists (ExprNest a)

{-# Language ViewPatterns #-}

translate (BaseExprNest (translate -> Expr a)) = Exists (ExprNest a)

But try it with let or where and it won't work.

References

Stitch: The Sound Type-Indexed Type Checker
An Existential Crisis Resolved

answered Oct 23 '22 18:10

Iceland_jack

The reason why this fails is, because you're making a promise you can't keep. The type of translate is BaseExpr -> Expr a, so you're really saying "If you give me a BaseExpr, I will give you an Expr a for any type a that you want". GHC complains, because you don't actually do this. If you call translate with a BaseExprVar, you won't actually get an Expr a for any type a, but you get an Expr Var.

To solve this, you can create an existential wrapper, as explained in @iceland_jack's answer. The type BaseExpr -> Exists @ExprType Expr really means "If you give me a BaseExpr, I will give you an Expr a for some a, that I determine.", which is exactly what your function does.

answered Oct 23 '22 17:10

Prophet

Why doesn't your original code work?

This type cannot work:

translate :: BaseExpr -> Expr a

Remember what type variables mean in Haskell. This type for translate is saying "for any type a the caller of translate chooses, it will take a BaseExpr and turn it into an Expr a".

So if I want to pass BaseExprVar "variableName" to translate and receive an Expr Nest, I should be able to do that; I'll just choose a to be Nest. It doesn't matter that this particular BaseExpr is just a variable, not a nested expression, and there is no value of type Expr Nest that faithfully represents this. With a polymorphic GADT as a return type, that's not my problem, it's translate's problem, since the type for translate promised to be able to come up with such a value. translate can't force me to choose Var for a.

That isn't the type you want. You don't want the type of Expr that is returned to be chosen by the caller of translate, you want it to be chosen by the implementation of translate (so it can return whatever type is appropriate to the contents of the BaseExpr).

To have the implementation of a function choose a type parameter, you need to either use an existential wrapper, or use higher-rank types and continuation passing.

Existential wrapper

data SomeExpr
  where SomeExpr :: Expr a -> SomeExpr

translate :: BaseExpr -> SomeExpr
translate (BaseExprVar id) = Some (ExprVar id)
translate (BaseExprNest expr)
  = case translate expr of
      Some e -> Some (ExprNest e)

Notice that the type parameter of Expr does not appear in the type for translate, so a caller of translate doesn't need to specify what it should be.

To use the output of translate, you will need to pattern match on the Some constructor, which will give you a value of type Expr a for some unknown a (as opposed to the original attempt at translate, which gives you a value of type Expr a for some particular a that you choose). Inside the pattern match, you therefore have to handle any possible value of a, and the resulting value cannot have a type that depends on a (we're only allowed to "see" it inside the pattern match).

You can see an example of this above in the recursive call; translate expr returned Some Expr, but what we needed to wrap up in the ExprNest constructor is Expr a. By pattern matching on Some, we get the contained Expr a, which we can then put inside the ExprNest constructor to get a value of type Expr Nest. Then we hide that back in a Some constructor, and finally have the right type to return from translate.

You can of course also generalise the existential wrapper so that it works on more types than just Expr, as shown in @Iceland_jack's answer. There's nothing special to the form I've used here; it is itself just another use of GADT syntax (to embed a value with a type parameter that isn't exposed in the overall type). A general one is nice, but often you'll want some type class constraints on the hidden type so that you have more things you can do with the value when you pattern match on it, so you'll need to get more complicated to make a type general enough to reuse any time you need an existential wrapper.

Higher-rank types

The other way to handle the requirement for translate to return different types based on the data it is processing, is to use higher rank types. For this, you'll need the {-# LANGUAGE RankNTypes #-} extension.

translate :: BaseExpr -> (forall a. Expr a -> r) -> r
translate (BaseExprVar id) handler = handler (ExprVar id)
translate (BaseExprNest expr) handler
  = handler (translate expr ExprNest)

Here, instead of using an extra wrapper type to "hide" the type variable, we restructure translate so that it takes a "handler function" for Exprs. Now translate doesn't return any form of Expr at all, rather it makes one, passes it to handler, and returns what the handler returns.

Why does this work? The trick is that handler is required to be passed to translate as a polymorphic function. The nested forall a. inside the type of one of translate's arguments means that the caller doesn't choose a like it normally would, instead the caller is required to pass a function that works for any possible a. The code of the handler function is similar to the "inside of the pattern match" in the existential wrapper version; it has to handle any possible a and can't return anything whose type depends on a.

Because the caller had to pass a handler that works for any a, the code for translate can use it at any type it likes, including at different types on different branches. So the first case for translate can pick Var and use handler as Expr Var -> r, while the second case can pick Nest and use handler as Expr Nest -> r.

Note a subtlety in the second case. Now when translate needs to recursively call itself, it needs to pass a "handler function". We could try passing the handler we received in from the top, but that's not right. We're supposed to be calling that handler only once on the complete Expr, not calling it at every level of the BaseExpr. And besides, that would only give us an r, which is not what we need to wrap up in the ExprNest constructor. Instead we need to realise that ExprNest is itself a function of type forall a. Expr a -> Expr Nest, which fits with the type we need for handler functions! Translating the nested expression with the ExprNest constructor as the handler will translate whatever is in it, and then feed it to the constructor, returning us the complete Expr Nest value we need to feed to the top level handler.

In this particular example the higher rank version ended up using less boilerplate than the existential wrapper version. That's often the case, but a possible downside is it makes you restructure some of your calling code into continuation passing form; you can't just call translate and then use the resulting value, you have to construct a function for what you would do with a translated expression if you had one, and then pass that function to translate. Sometimes that coding style is harder to read/write.

answered Oct 23 '22 16:10

Ben

Related questions
                            
                                Why does this function fail to typecheck?
                            
                                Wadler's paper: How is a tuple a function?
                            
                                Operator section for applicative with <$> and <*>
                            
                                Monadic alteration to tuple
                            
                                Cartesian product over a list of lists in Haskell
                            
                                When is my Haskell expression evaluated?
                            
                                Dynamic stdout in Haskell
                            
                                Lists, boxed vectors, and unboxed vectors for heavy scientific computations
                            
                                What is the story behind a Getter?
                            
                                Constrained Value Types in Haskell
                            
                                Why does httpJSON fail, but httpLBS succeeds?
                            
                                No instance for (Eq a) arising from a use of ‘==’
                            
                                explain this haskell funtion "list of tuples to list"
                            
                                Should my monoid be left- or right-biased?
                            
                                Which polymorphism types are supported in Haskell?
                            
                                Why is there difference between throw and throwIO?
                            
                                How is it possible that a binding shadows the existing binding in `case of` block?
                            
                                Are there examples of functors in Haskell which fails to be monads because we cannot implement `return`?
                            
                                Does Haskell have an eager version of `foldr`?
                            
                                Why are primitives not needed for haskell data constructors?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With