Benefits of using type synonym families is clear - it is type-level functions. But it is not the case with data families - so my question is, what is use-cases for data families? Where should I use it?

One benefit is that data families are injective, unlike type families. If you have <pre class="prettyprint"><code>type family TF a data family DF a </code></pre> Then you know that <code>DF a ~ DF b</code> implies that <code>a ~ b</code>, while with TF, you don't -- for any <code>a</code> you can be sure that <code>DF a</code> is a completely new type (just like <code>[a]</code> is a different type from <code>[b]</code>, unless of course <code>a ~ b</code>), while a type family can map multiple input types onto the same existing type. A second is that data families can be partially applied, like any other type constructor, while type families can't. This is not a particularly real-world example, but for example, you can do: <pre class="prettyprint"><code>data instance DF Int = DInt Int data instance DF String = DString String class C t where foo :: t Int -> t String instance C DF where -- notice we are using DF without an argument -- notice also that you can write instances for data families at all, -- unlike type families foo (DInt i) = DString (show i) </code></pre> Basically, <code>DF</code> and <code>DF a</code> are actual, first-class, legitimate types, in themselves, like any other type you declare with <code>data</code>. <code>TF a</code> is just an intermediate form that evaluates to a type. But I suppose all of that's not very enlightening, or at least it wasn't for me, when I was wondering about data families and read similar things. Here's the rule of thumb I go by. Whenever you find yourself repeating the pattern that you have a type family, and for every input type, you declare a new <code>data</code> type for the type family to map onto, it's nicer to cut out the middleman and use a data family instead. A real-world example from the vector library. <code>vector</code> has several different kinds of Vectors: boxed vectors, unboxed vectors, primitive vectors, storable vectors. For each <code>Vector</code> type there is a corresponding, mutable <code>MVector</code> type (normal Vectors are immutable). So it looks like this: <pre class="prettyprint"><code>type family Mutable v :: * -> * -> * -- the result type has two type parameters module Data.Vector{.Mutable} where data Vector a = ... data MVector s a = ... type instance Mutable Vector = MVector module Data.Vector.Storable{.Mutable} where data Vector a = ... data MVector s a = ... type instance Mutable Vector = MVector [etc.] </code></pre> Now instead of that, I would rather have: <pre class="prettyprint"><code>data family Mutable v :: * -> * -> * module Data.Vector{.Mutable} where data Vector a = ... data instance Mutable Vector s a = ... type MVector = Mutable Vector module Data.Vector.Storable{.Mutable} where data Vector a = ... data instance Mutable Vector s a = ... type MVector = Mutable Vector [etc.] </code></pre> Which encodes the invariant that for every <code>Vector</code> type there is exactly one <code>Mutable Vector</code> type, and that there is a one-to-one correspondence between them. The mutable version of a <code>Vector</code> is always called <code>Mutable Vector</code>: that is its name, and it has no other. If you have a <code>Mutable Vector</code>, you can get the type of the corresponding immutable <code>Vector</code>, because it's right there as a type argument. With <code>type family Mutable</code>, once you apply it to an argument it evaluates to an unspecified result type (presumably called <code>MVector</code>, but you can't know), and you have no way to map backwards.

data families use cases

1 Answers

One benefit is that data families are injective, unlike type families.

If you have

type family TF a
data family DF a

Then you know that DF a ~ DF b implies that a ~ b, while with TF, you don't -- for any a you can be sure that DF a is a completely new type (just like [a] is a different type from [b], unless of course a ~ b), while a type family can map multiple input types onto the same existing type.

A second is that data families can be partially applied, like any other type constructor, while type families can't.

This is not a particularly real-world example, but for example, you can do:

data instance DF Int    = DInt    Int
data instance DF String = DString String

class C t where
    foo :: t Int -> t String

instance C DF where -- notice we are using DF without an argument
                    -- notice also that you can write instances for data families at all,
                    -- unlike type families
    foo (DInt i) = DString (show i)

Basically, DF and DF a are actual, first-class, legitimate types, in themselves, like any other type you declare with data. TF a is just an intermediate form that evaluates to a type.

But I suppose all of that's not very enlightening, or at least it wasn't for me, when I was wondering about data families and read similar things.

Here's the rule of thumb I go by. Whenever you find yourself repeating the pattern that you have a type family, and for every input type, you declare a new data type for the type family to map onto, it's nicer to cut out the middleman and use a data family instead.

A real-world example from the vector library. vector has several different kinds of Vectors: boxed vectors, unboxed vectors, primitive vectors, storable vectors. For each Vector type there is a corresponding, mutable MVector type (normal Vectors are immutable). So it looks like this:

type family Mutable v :: * -> * -> * -- the result type has two type parameters

module Data.Vector{.Mutable} where
data Vector a = ...
data MVector s a = ...
type instance Mutable Vector = MVector

module Data.Vector.Storable{.Mutable} where
data Vector a = ...
data MVector s a = ...
type instance Mutable Vector = MVector

[etc.]

Now instead of that, I would rather have:

data family Mutable v :: * -> * -> *

module Data.Vector{.Mutable} where
data Vector a = ...
data instance Mutable Vector s a = ...
type MVector = Mutable Vector

module Data.Vector.Storable{.Mutable} where
data Vector a = ...
data instance Mutable Vector s a = ...
type MVector = Mutable Vector

[etc.]

Which encodes the invariant that for every Vector type there is exactly one Mutable Vector type, and that there is a one-to-one correspondence between them. The mutable version of a Vector is always called Mutable Vector: that is its name, and it has no other. If you have a Mutable Vector, you can get the type of the corresponding immutable Vector, because it's right there as a type argument. With type family Mutable, once you apply it to an argument it evaluates to an unspecified result type (presumably called MVector, but you can't know), and you have no way to map backwards.

123

answered Oct 22 '22 06:10

glaebhoerl

Related questions
                            
                                If firewalls don't accept incoming connections by default how do p2p networks work?
                            
                                DotNetZip BadReadException on .Extract
                            
                                Ruby Object Model - ancestors of a class
                            
                                Check for framework's existence at compile time?
                            
                                Using AngularJS within Haml views of a Rails app
                            
                                How can select a column and do a TRANSFORM in Hive?
                            
                                data.table: using setkey with a column name variable
                            
                                why there are two process when i run python manage.py runserver
                            
                                How to use -thread compiler flag with ocamlbuild?
                            
                                python pandas: pivot_table silently drops indices with nans
                            
                                Which code is shared between the original Git and libgit2?
                            
                                Why do we need to use boost::asio::io_service::work?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

data families use cases

Tags:

John Rivers

People also ask

1 Answers

glaebhoerl

Recent Activity

Donate For Us