I'd like to make a typed AST for a dynamic language. At present, I'm stuck on handling collections. Here's a representative code sample: <pre class="prettyprint"><code>{-# LANGUAGE GADTs #-} {-# LANGUAGE DataKinds #-} {-# LANGUAGE KindSignatures #-} {-# LANGUAGE ExistentialQuantification #-} data Box = forall s. B s data BinOp = Add | Sub | Mul | Div deriving (Eq, Show) data Flag = Empty | NonEmpty data List :: Flag -> * -> * where Nil :: List Empty a Cons :: a -> List f a -> List NonEmpty a data Expr ty where EInt :: Integer -> Expr Integer EDouble :: Double -> Expr Double -- EList :: List -> Expr List </code></pre> While I can construct instances of <code>List</code> well enough: <pre class="prettyprint"><code>*Main> :t (Cons (B (EInt 1)) (Cons (B (EDouble 2.0)) Nil)) (Cons (B (EInt 1)) (Cons (B (EDouble 2.0)) Nil)) :: List Box 'NonEmpty </code></pre> I'm not at all sure how to encode this type in <code>Expr</code> for <code>EList</code>. Am I even on the right path here?

One way to approach this problem is to tag values with run-time type representatives. I'm channelling Stephanie Weirich, here. Let's have a small example. First, give a representation to some types. That's typically done with a singleton construction. <pre class="prettyprint"><code>data Type :: * -> * where Int :: Type Int Char :: Type Char List :: Type x -> Type [x] </code></pre> So <code>Type Int</code> contains one value, which I've also called <code>Int</code>, because it acts as the run-time representative of the type <code>Int</code>. If you can see colour even in monochrome things, the <code>Int</code> left of the <code>::</code> is red, and the <code>Int</code> after <code>Type</code> is blue. Now we can do existential packaging, preserving utility. <pre class="prettyprint"><code>data Cell :: * where (:::) :: x -> Type x -> Cell </code></pre> A <code>Cell</code> is a value tagged with a run-time representative of its type. You can recover the utility of the value by reading its type tag. Indeed, as types are first-order structures, we can check them for equality in a useful way. <pre class="prettyprint"><code>data EQ :: k -> k -> * where Refl :: EQ x x typeEQ :: Type x -> Type y -> Maybe (EQ x y) typeEQ Int Int = Just Refl typeEQ Char Char = Just Refl typeEQ (List s) (List t) = case typeEQ s t of Just Refl -> Just Refl Nothing -> Nothing typeEQ _ _ = Nothing </code></pre> A Boolean equality on type representatives is no use: we need the equality test to construct the evidence that the represented types can be unified. With the evidence-producing test, we can write <pre class="prettyprint"><code>gimme :: Type x -> Cell -> Maybe x gimme t (x ::: s) = case typeEQ s t of Just Refl -> Just x Nothing -> Nothing </code></pre> Of course, writing the type tags is a nuisance. But why keep a dog and bark yourself? <pre class="prettyprint"><code>class TypeMe x where myType :: Type x instance TypeMe Int where myType = Int instance TypeMe Char where myType = Char instance TypeMe x => TypeMe [x] where myType = List myType cell :: TypeMe x => x -> Cell cell x = x ::: myType </code></pre> And now we can do things like <pre class="prettyprint"><code>myCells :: [Cell] myCells = [cell (length "foo"), cell "foo"] </code></pre> and then get <pre class="prettyprint"><code>> gimme Int (head myCells) Just 3 </code></pre> Of course, it would all be so much tidier if we didn't have to do the singleton construction and could just pattern-match on such types as we might choose to retain at run-time. I expect we'll get there when the mythical <code>pi</code> quantifier becomes less mythical.

How to specify the type for a heterogenous collection in a GADT formulated AST?

Tags:

haskell

existential-type

data-kinds

gadt

I'd like to make a typed AST for a dynamic language. At present, I'm stuck on handling collections. Here's a representative code sample:

{-# LANGUAGE GADTs #-}
{-# LANGUAGE DataKinds #-}
{-# LANGUAGE KindSignatures #-}
{-# LANGUAGE ExistentialQuantification #-}

data Box = forall s. B s

data BinOp = Add | Sub | Mul | Div
             deriving (Eq, Show)

data Flag = Empty | NonEmpty

data List :: Flag -> * -> * where
    Nil :: List Empty a
    Cons :: a -> List f a -> List NonEmpty a

data Expr ty where
    EInt :: Integer -> Expr Integer
    EDouble :: Double -> Expr Double
--    EList :: List -> Expr List

While I can construct instances of List well enough:

*Main> :t (Cons (B (EInt 1)) (Cons (B (EDouble 2.0)) Nil))
(Cons (B (EInt 1)) (Cons (B (EDouble 2.0)) Nil))
  :: List Box 'NonEmpty

I'm not at all sure how to encode this type in Expr for EList. Am I even on the right path here?

506

asked Dec 14 '15 04:12

troutwine

1 Answers

One way to approach this problem is to tag values with run-time type representatives. I'm channelling Stephanie Weirich, here. Let's have a small example. First, give a representation to some types. That's typically done with a singleton construction.

data Type :: * -> * where
  Int   :: Type Int
  Char  :: Type Char
  List  :: Type x -> Type [x]

So Type Int contains one value, which I've also called Int, because it acts as the run-time representative of the type Int. If you can see colour even in monochrome things, the Int left of the :: is red, and the Int after Type is blue.

Now we can do existential packaging, preserving utility.

data Cell :: * where
 (:::) :: x -> Type x -> Cell

A Cell is a value tagged with a run-time representative of its type. You can recover the utility of the value by reading its type tag. Indeed, as types are first-order structures, we can check them for equality in a useful way.

data EQ :: k -> k -> * where
  Refl :: EQ x x

typeEQ :: Type x -> Type y -> Maybe (EQ x y)
typeEQ Int Int = Just Refl
typeEQ Char Char = Just Refl
typeEQ (List s) (List t) = case typeEQ s t of
  Just Refl -> Just Refl
  Nothing -> Nothing
typeEQ _ _ = Nothing

A Boolean equality on type representatives is no use: we need the equality test to construct the evidence that the represented types can be unified. With the evidence-producing test, we can write

gimme :: Type x -> Cell -> Maybe x
gimme t (x ::: s) = case typeEQ s t of
  Just Refl -> Just x
  Nothing   -> Nothing

Of course, writing the type tags is a nuisance. But why keep a dog and bark yourself?

class TypeMe x where
  myType :: Type x

instance TypeMe Int where
  myType = Int

instance TypeMe Char where
  myType = Char

instance TypeMe x => TypeMe [x] where
  myType = List myType

cell :: TypeMe x => x -> Cell
cell x = x ::: myType

And now we can do things like

myCells :: [Cell]
myCells = [cell (length "foo"), cell "foo"]

and then get

> gimme Int (head myCells)
Just 3

Of course, it would all be so much tidier if we didn't have to do the singleton construction and could just pattern-match on such types as we might choose to retain at run-time. I expect we'll get there when the mythical pi quantifier becomes less mythical.

108

answered Sep 19 '22 10:09

pigworker

Related questions
                            
                                cabal sandbox install still fails with "packages are likely to be broken by the reinstalls"
                            
                                Pretty print llvm-general-pure ASTs as llvm-ir?
                            
                                Modifying the target of a Lens conditionally
                            
                                Haskell functional dependency a b -> c depending on c?
                            
                                Haskell (.) for function with multiple operands
                            
                                How to ignore HLint's arrow hints?
                            
                                How can I limit size of request body and headers in WAI?
                            
                                explain the Haskell breadth first numbering code to traverse trees
                            
                                Strange <<loop>> exception in Array generation
                            
                                Yesod - how to make addScriptRemote add the script in the head section?
                            
                                A type that's easy to do arithmetic with and is guaranteed in bounds
                            
                                Using Pipes to read and write binary data in Haskell
                            
                                Garbage collecting a list while running an IO action over it
                            
                                How do I "unpack" a list as individual arguments in Haskell? [duplicate]
                            
                                Dynamically generate Tasty `TestTree` from the file system
                            
                                Is it possible to improve the asymptotics of functional containers?
                            
                                Haskell - All functions of form A -> A -> ... -> A
                            
                                fromEnum toEnum Instance?
                            
                                How to Transform a List of Integers to a Matrix of True and False in Haskell
                            
                                Is it actually possible to remove "Pi" from Calculus of Constructions?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With