I'm working through the online LYAH book (the link will take you directly to the section that my question concerns). The author defines a binary tree data type, and shows how it can be made an instance of the type Foldable (defined in Data.Foldable) by implementing the foldMap function: <pre class="prettyprint"><code>import Data.Monoid import qualified Data.Foldable as F data Tree a = Empty | Node a (Tree a) (Tree a) deriving (Show, Read, Eq) instance F.Foldable Tree where foldMap f Empty = mempty foldMap f (Node x l r) = F.foldMap f l `mappend` f x `mappend` F.foldMap f r </code></pre> The type declaration of foldMap is as follows: <pre class="prettyprint"><code>F.foldMap :: (Monoid m, F.Foldable t) => (a -> m) -> t a -> m </code></pre> so it takes a function that takes an instance of type "a" and returns a monoid. Now as an example, the author creates a Tree instance <pre class="prettyprint"><code> testTree = Node 5 (Node 3 (Node 1 Empty Empty) (Node 6 Empty Empty) ) (Node 9 (Node 8 Empty Empty) (Node 10 Empty Empty) ) </code></pre> and performs the following fold (defined for Foldable types): <pre class="prettyprint"><code>F.foldl (+) 0 testTree -- the answer is 42 (sum of the Node Integers) </code></pre> My question is, how does Haskell figure out that addition over the Integer type - querying Haskell for the type of testTree gives Tree [Integer] - can be viewed as a monoid operation (if my terminology is correct)? (My own attempt at the answer: The author a few paragraphs before this section describes how the Num type can be interpreted as a Monoid type in two different ways; by wrapping them into the Sum and Product type with (+) and (*) as the mappend functions and 0 and 1 as the mempty element, respectively. Is the type of "a" in (Tree a) somehow being inferred as belonging to the Sum type (the way Haskell variously interprets numerical values according to the context) or is it something else entirely? ]

<blockquote> My question is, how does Haskell figure out that addition over the Integer type - querying Haskell for the type of testTree gives Tree [Integer] - can be viewed as a monoid operation (if my terminology is correct)? </blockquote> It can't! In fact, there is no <code>Monoid</code> instance for <code>Integer</code>. Now, don't get me wrong--integers are a monoid under addition. They're also a monoid under multiplication, however, and Haskell has no way to know which to use, hence the <code>newtype</code> wrappers. But... none of that is what's going on here. Moving on... <blockquote> (My own attempt at the answer: The author a few paragraphs before this section describes how the Num type can be interpreted as a Monoid type in two different ways; by wrapping them into the Sum and Product type with (+) and (*) as the mappend functions and 0 and 1 as the mempty element, respectively. Is the type of "a" in (Tree a) somehow being inferred as belonging to the Sum type (the way Haskell variously interprets numerical values according to the context) or is it something else entirely? ] </blockquote> Not a bad guess, but that sort of inference (finding the instance using <code>Sum</code> based on the arguments you gave) is beyond what Haskell can do for you. There's two key points here--first of all, the <code>Monoid</code> constraint is only used for certain functions, not folds in general. In particular, <code>foldl</code> doesn't actually need a <code>Monoid</code> instance at all, because you provide both the binary operation and initial value for it to use. The second point is what I suspect you're really after--how does it create a generic <code>foldl</code> that doesn't need a <code>Monoid</code>, when all you defined is <code>foldMap</code>, which does? To answer that, we can simply look at the default implementation of <code>foldl</code>: <pre class="prettyprint"><code>foldl :: (a -> b -> a) -> a -> t b -> a foldl f z t = appEndo (getDual (foldMap (Dual . Endo . flip f) t)) z </code></pre> Here, <code>Endo</code> is another <code>newtype</code> wrapper, specifically for functions <code>a -> a</code> giving the <code>Monoid</code> of composition, with <code>id</code> as the identity, while <code>Dual</code> is a wrapper that reverses the direction of a <code>Monoid</code>. So the <code>Monoid</code> it's actually using here is so it can glue uses of <code>(+)</code> together with function composition, then apply the result to the seed value.

Is this Haskell type inference in action, or something else?

Tags:

haskell

type-inference

I'm working through the online LYAH book (the link will take you directly to the section that my question concerns).

The author defines a binary tree data type, and shows how it can be made an instance of the type Foldable (defined in Data.Foldable) by implementing the foldMap function:

import Data.Monoid
import qualified Data.Foldable as F

data Tree a = Empty | Node a (Tree a) (Tree a) deriving (Show, Read, Eq)

instance F.Foldable Tree where  
  foldMap f Empty = mempty  
  foldMap f (Node x l r) = F.foldMap f l `mappend`  
                           f x           `mappend`  
                           F.foldMap f r

The type declaration of foldMap is as follows:

F.foldMap :: (Monoid m, F.Foldable t) => (a -> m) -> t a -> m

so it takes a function that takes an instance of type "a" and returns a monoid.

Now as an example, the author creates a Tree instance

    testTree = Node 5  
                 (Node 3  
                    (Node 1 Empty Empty)  
                    (Node 6 Empty Empty)  
                 )  
                 (Node 9  
                    (Node 8 Empty Empty)  
                    (Node 10 Empty Empty)  
                 )

and performs the following fold (defined for Foldable types):

F.foldl (+) 0 testTree -- the answer is 42 (sum of the Node Integers)

My question is, how does Haskell figure out that addition over the Integer type - querying Haskell for the type of testTree gives Tree [Integer] - can be viewed as a monoid operation (if my terminology is correct)?

(My own attempt at the answer: The author a few paragraphs before this section describes how the Num type can be interpreted as a Monoid type in two different ways; by wrapping them into the Sum and Product type with (+) and (*) as the mappend functions and 0 and 1 as the mempty element, respectively. Is the type of "a" in (Tree a) somehow being inferred as belonging to the Sum type (the way Haskell variously interprets numerical values according to the context) or is it something else entirely? ]

202

asked Sep 08 '11 01:09

Aky

1 Answers

My question is, how does Haskell figure out that addition over the Integer type - querying Haskell for the type of testTree gives Tree [Integer] - can be viewed as a monoid operation (if my terminology is correct)?

It can't! In fact, there is no Monoid instance for Integer.

Now, don't get me wrong--integers are a monoid under addition. They're also a monoid under multiplication, however, and Haskell has no way to know which to use, hence the newtype wrappers.

But... none of that is what's going on here. Moving on...

(My own attempt at the answer: The author a few paragraphs before this section describes how the Num type can be interpreted as a Monoid type in two different ways; by wrapping them into the Sum and Product type with (+) and (*) as the mappend functions and 0 and 1 as the mempty element, respectively. Is the type of "a" in (Tree a) somehow being inferred as belonging to the Sum type (the way Haskell variously interprets numerical values according to the context) or is it something else entirely? ]

Not a bad guess, but that sort of inference (finding the instance using Sum based on the arguments you gave) is beyond what Haskell can do for you.

There's two key points here--first of all, the Monoid constraint is only used for certain functions, not folds in general. In particular, foldl doesn't actually need a Monoid instance at all, because you provide both the binary operation and initial value for it to use.

The second point is what I suspect you're really after--how does it create a generic foldl that doesn't need a Monoid, when all you defined is foldMap, which does? To answer that, we can simply look at the default implementation of foldl:

foldl :: (a -> b -> a) -> a -> t b -> a
foldl f z t = appEndo (getDual (foldMap (Dual . Endo . flip f) t)) z

Here, Endo is another newtype wrapper, specifically for functions a -> a giving the Monoid of composition, with id as the identity, while Dual is a wrapper that reverses the direction of a Monoid. So the Monoid it's actually using here is so it can glue uses of (+) together with function composition, then apply the result to the seed value.

answered Sep 29 '22 05:09

C. A. McCann

Related questions
                            
                                Example of function definition in the data constructor of a new type
                            
                                Haskell: unnecessary binary growth with module imports
                            
                                Yet another newtype vs. data (stylistic issue)
                            
                                Convert Data.Sequence to a List?
                            
                                Haskell / Miranda: Find the type of the function
                            
                                Point free notation, recursion, and pattern matching
                            
                                What type to use for in-memory image data in Haskell?
                            
                                optparse-applicative: displaying help for programs invoked with no arguments
                            
                                How to enable dead code warnings in Haskell (GHC)
                            
                                Type signatures that never make sense
                            
                                How can I constraint QuickCheck parameters, e.g. only use non-negative ints?
                            
                                Haskell Error: Non type-variable argument in the constraint: Num (a -> a -> a -> a)
                            
                                What is the difference between "bracket (mallocBytes n) free" and "allocaBytes"?
                            
                                Showing that `newtype T a = T (a -> Int)` is a Type Constructor that is Not a Functor
                            
                                Haskell Stack doesn't use system Ghc
                            
                                How to create a generic Complex type in haskell?
                            
                                How does the default definition of (<*>) in Haskell work?
                            
                                Why doesn't this function terminate in Haskell?
                            
                                How to write length function for all Monoids
                            
                                Why is GHC sometimes refusing to be lazy?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With