Now that we have injective type families, is there any remaining use case for using data families over type families? Looking at past StackOverflow questions about data families, there is this question from a couple years ago discussing the difference between type families and data families, and this answer about use cases of data families. Both say that the injectivity of data families is their greatest strength. Looking at the docs on data families, I see reason not to rewrite all uses of data families using injective type families. For example, say I have a data family (I've merged some examples from the docs to try to squeeze in all the features of data families) <pre class="prettyprint"><code>data family G a b data instance G Int Bool = G11 Int | G12 Bool deriving (Eq) newtype instance G () a = G21 a data instance G [a] b where G31 :: c -> G [Int] b G32 :: G [a] Bool </code></pre> I might as well rewrite it as <pre class="prettyprint"><code>type family G a b = g | g -> a b type instance G Int Bool = G_Int_Bool type instance G () a = G_Unit_a a type instance G [a] b = G_lal_b a b data G_Int_Bool = G11 Int | G12 Bool deriving (Eq) newtype G_Unit_a a = G21 a data G_lal_b a b where G31 :: c -> G_lal_b [Int] b G32 :: G_lal_b [a] Bool </code></pre> It goes without saying that associated instances for data families correspond to associated instances with type families in the same way. Then is the only remaining difference that we have less things in the type-namespace? As a followup, is there any benefit to having less things in the type-namespace? All I can think of is that this will become debugging hell for someone playing with this on <code>ghci</code> - the types of the constructors all seem to indicate that the constructors are all under one GADT...

<pre class="prettyprint"><code>type family T a = r | r -> a data family D a </code></pre> An injective type family <code>T</code> satisfies the injectivity axiom <blockquote> if <code>T a ~ T b</code> then <code>a ~ b</code> </blockquote> But a data family satisfies the much stronger generativity axiom <blockquote> if <code>D a ~ g b</code> then <code>D ~ g</code> and <code>a ~ b</code> </blockquote> (If you like: Because the instances of <code>D</code> define new types that are different from any existing types.) In fact <code>D</code> itself is a legitimate type in the type system, unlike a type family like <code>T</code>, which can only ever appear in a fully saturated application like <code>T a</code>. This means <ul> <li><code>D</code> can be the argument to another type constructor, like <code>MaybeT D</code>. (<code>MaybeT T</code> is illegal.)</li> <li>You can define instances for <code>D</code>, like <code>instance Functor D</code>. (You can't define instances for a type family <code>Functor T</code>, and it would be unusable anyway because instance selection for, e.g., <code>map :: Functor f => (a -> b) -> f a -> f b</code> relies on the fact that from the type <code>f a</code> you can determine both <code>f</code> and <code>a</code>; for this to work <code>f</code> cannot be allowed to vary over type families, even injective ones.)</li> </ul>

You're missing one other detail - data families create new types. Type families can only refer to other types. In particular, every instance of a data family declares new constructors. And it's nicely generic. You can create a data instance with <code>newtype instance</code> if you want newtype semantics. Your instance can be a record. It can have multiple constructors. It can even be a GADT if you want. It's exactly the difference between the <code>type</code> and <code>data</code>/<code>newtype</code> keywords. Injective type families don't give you new types, rendering them useless in the case where you need that. I understand where you're coming from. I had this same issue with the difference initially. Then I finally ran into a use case where they're useful, even without a type class getting involved. I wanted to write an api for dealing with mutable cells in a few different contexts, without using classes. I knew I wanted to do it with a free monad with interpreters in <code>IO</code>, <code>ST</code>, and maybe some horrible hacks with <code>unsafeCoerce</code> to even go so far as shoehorning it into <code>State</code>. This wasn't for any practical purpose, of course - I was just exploring API designs. So I had something like this: <pre class="prettyprint"><code>data MutableEnv (s :: k) a ... newRef :: a -> MutableEnv s (Ref s a) readRef :: Ref s a -> MutableEnv s a writeRef :: Ref s a -> a -> MutableEnv s () </code></pre> The definition of <code>MutableEnv</code> wasn't important. Just standard free/operational monad stuff with constructors matching the three functions in the api. But I was stuck on what to define Ref as. I didn't want some sort of class, I wanted it to be a concrete type as far as the type system was concerned. Then late one night I was out for a walk and it hit me - what I essentially want is a type whose constructors are indexed by an argument type. But it had to be open, unlike a GADT - new interpreters could be added at will. And then it hit me. That's exactly what a data family is. An open, type-indexed family of data values. I could complete the api with just the following: <pre class="prettyprint"><code>data family Ref (s :: k) :: * -> * </code></pre> Then, dealing with the underlying representation for a Ref was no big deal. Just create a data instance (or newtype instance, more likely) whenever an interpreter for <code>MutableEnv</code> is defined. This exact example isn't really useful. But it clearly illustrates something data families can do that injective type families can't.

Data families vs Injective type families

Tags:

haskell

ghc

type-families

Now that we have injective type families, is there any remaining use case for using data families over type families?

Looking at past StackOverflow questions about data families, there is this question from a couple years ago discussing the difference between type families and data families, and this answer about use cases of data families. Both say that the injectivity of data families is their greatest strength.

Looking at the docs on data families, I see reason not to rewrite all uses of data families using injective type families.

For example, say I have a data family (I've merged some examples from the docs to try to squeeze in all the features of data families)

data family G a b
data instance G Int Bool = G11 Int | G12 Bool deriving (Eq)
newtype instance G () a = G21 a
data instance G [a] b where
   G31 :: c -> G [Int] b
   G32 :: G [a] Bool

I might as well rewrite it as

type family G a b = g | g -> a b
type instance G Int Bool = G_Int_Bool
type instance G () a = G_Unit_a a
type instance G [a] b = G_lal_b a b

data G_Int_Bool = G11 Int | G12 Bool  deriving (Eq)
newtype G_Unit_a a = G21 a
data G_lal_b a b where
   G31 :: c -> G_lal_b [Int] b
   G32 :: G_lal_b [a] Bool

It goes without saying that associated instances for data families correspond to associated instances with type families in the same way. Then is the only remaining difference that we have less things in the type-namespace?

As a followup, is there any benefit to having less things in the type-namespace? All I can think of is that this will become debugging hell for someone playing with this on ghci - the types of the constructors all seem to indicate that the constructors are all under one GADT...

218

asked Sep 26 '16 15:09

Alec

2 Answers

type family T a = r | r -> a
data family D a

An injective type family T satisfies the injectivity axiom

if T a ~ T b then a ~ b

But a data family satisfies the much stronger generativity axiom

if D a ~ g b then D ~ g and a ~ b

(If you like: Because the instances of D define new types that are different from any existing types.)

In fact D itself is a legitimate type in the type system, unlike a type family like T, which can only ever appear in a fully saturated application like T a. This means

D can be the argument to another type constructor, like MaybeT D. (MaybeT T is illegal.)
You can define instances for D, like instance Functor D. (You can't define instances for a type family Functor T, and it would be unusable anyway because instance selection for, e.g., map :: Functor f => (a -> b) -> f a -> f b relies on the fact that from the type f a you can determine both f and a; for this to work f cannot be allowed to vary over type families, even injective ones.)

166

answered Oct 07 '22 00:10

Reid Barton

You're missing one other detail - data families create new types. Type families can only refer to other types. In particular, every instance of a data family declares new constructors. And it's nicely generic. You can create a data instance with newtype instance if you want newtype semantics. Your instance can be a record. It can have multiple constructors. It can even be a GADT if you want.

It's exactly the difference between the type and data/newtype keywords. Injective type families don't give you new types, rendering them useless in the case where you need that.

I understand where you're coming from. I had this same issue with the difference initially. Then I finally ran into a use case where they're useful, even without a type class getting involved.

I wanted to write an api for dealing with mutable cells in a few different contexts, without using classes. I knew I wanted to do it with a free monad with interpreters in IO, ST, and maybe some horrible hacks with unsafeCoerce to even go so far as shoehorning it into State. This wasn't for any practical purpose, of course - I was just exploring API designs.

So I had something like this:

data MutableEnv (s :: k) a ...

newRef :: a -> MutableEnv s (Ref s a)
readRef :: Ref s a -> MutableEnv s a
writeRef :: Ref s a -> a -> MutableEnv s ()

The definition of MutableEnv wasn't important. Just standard free/operational monad stuff with constructors matching the three functions in the api.

But I was stuck on what to define Ref as. I didn't want some sort of class, I wanted it to be a concrete type as far as the type system was concerned.

Then late one night I was out for a walk and it hit me - what I essentially want is a type whose constructors are indexed by an argument type. But it had to be open, unlike a GADT - new interpreters could be added at will. And then it hit me. That's exactly what a data family is. An open, type-indexed family of data values. I could complete the api with just the following:

data family Ref (s :: k) :: * -> *

Then, dealing with the underlying representation for a Ref was no big deal. Just create a data instance (or newtype instance, more likely) whenever an interpreter for MutableEnv is defined.

This exact example isn't really useful. But it clearly illustrates something data families can do that injective type families can't.

answered Oct 06 '22 23:10

Carl

Related questions
                            
                                SQL Server add index without dropping table
                            
                                How to Route without reloading the whole page?
                            
                                ControlValueAccessor with multiple formControl in child component
                            
                                Adding line to scatter plot using python's matplotlib
                            
                                Share a list between different processes?
                            
                                git cherry-pick --continue, '--no-edit' option?
                            
                                Subset a file by row and column numbers
                            
                                Visual Studio 2015 does not find existing Web Apps on Azure Cloud during Web Deploy
                            
                                What is Leading Margin in Android?
                            
                                Why does Spark think this is a cross / Cartesian join
                            
                                How to resolve "TypeError: NetworkError when attempting to fetch resource."
                            
                                Remove type hints in Python source programmatically

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With