The defined code is <pre class="prettyprint"><code>fun foldl f e l = let fun g(x, f'') = fn y => f''(f(x, y)) in foldr g (fn x => x) l e end </code></pre> I don't understand how this works; what is the purpose of <code>g(x, f'')</code>? I also find a similar example in Haskell, the definition is quite short <pre class="prettyprint"><code>myFoldl f z xs = foldr step id xs z where step x g a = g (f a x) </code></pre>

Let's dissect the Haskell implementation of <code>myFoldl</code> and then take a look at the <strike>ocaml</strike> SML code. First, we'll look at some type signatures: <pre class="prettyprint lang-hs prettyprint-override"><code>foldr :: (a -> b -> b) -- the step function -> b -- the initial value of the accumulator -> [a] -- the list to fold -> b -- the result </code></pre> It should be noted that although the <code>foldr</code> function accepts only three arguments we are applying it two four arguments: <pre class="prettyprint lang-hs prettyprint-override"><code>foldr step id xs z </code></pre> However, as you can see the second argument to <code>foldr</code> (i.e. the inital value of the accumulator) is <code>id</code> which is a function of the type <code>x -> x</code>. Therefore, the result is also of the type <code>x -> x</code>. Hence, it accepts four arguments. Similarly, the step function is now of the type <code>a -> (x -> x) -> x -> x</code>. Hence, it accepts three arguments instead of two. The accumulator is an endofunction (i.e. a function whose domain and codomain is the same). Endofunctions have a special property, they are composed from left to right instead of from right to left. For example, let's compose a bunch of <code>Int -> Int</code> functions: <pre class="prettyprint lang-hs prettyprint-override"><code>inc :: Int -> Int inc n = n + 1 dbl :: Int -> Int dbl n = n * 2 </code></pre> The normal way to compose these functions is to use the function composition operator as follows: <pre class="prettyprint lang-hs prettyprint-override"><code>incDbl :: Int -> Int incDbl = inc . dbl </code></pre> The <code>incDbl</code> function first doubles a number and then increments it. Note that this reads from right to left. Another way to compose them is to use continuations (denoted by <code>k</code>): <pre class="prettyprint lang-hs prettyprint-override"><code>inc' :: (Int -> Int) -> Int -> Int inc' k n = k (n + 1) dbl' :: (Int -> Int) -> Int -> Int dbl' k n = k (n * 2) </code></pre> Notice that the first argument is a continuation. If we want to recover the original functions then we can do: <pre class="prettyprint lang-hs prettyprint-override"><code>inc :: Int -> Int inc = inc' id dbl :: Int -> Int dbl = dbl' id </code></pre> However, if we want to compose them then we do it as follows: <pre class="prettyprint lang-hs prettyprint-override"><code>incDbl' :: (Int -> Int) -> Int -> Int incDbl' = dbl' . inc' incDbl :: Int -> Int incDbl = incDbl' id </code></pre> Notice that although we are still using the dot operator to compose the functions, it now reads from left to right. This is the key behind making <code>foldr</code> behave as <code>foldl</code>. We fold the list from right to left but instead of folding it into a value, we fold it into an endofunction which when applied to an initial accumulator value actually folds the list from left to right. Consider our <code>incDbl</code> function: <pre class="prettyprint lang-hs prettyprint-override"><code>incDbl = incDbl' id = (dbl' . inc') id = dbl' (inc' id) </code></pre> Now consider the definition of <code>foldr</code>: <pre class="prettyprint lang-hs prettyprint-override"><code>foldr :: (a -> b -> b) -> b -> [a] -> b foldr _ acc [] = acc foldr fun acc (y:ys) = fun y (foldr fun acc ys) </code></pre> In the basis case we simply return the accumulated value. However, in the inductive case we return <code>fun y (foldr fun acc ys)</code>. Our <code>step</code> function is defined as follows: <pre class="prettyprint lang-hs prettyprint-override"><code>step :: a -> (x -> x) -> x -> x step x g a = g (f a x) </code></pre> Here <code>f</code> is the reducer function of <code>foldl</code> and is of the type <code>x -> a -> x</code>. Notice that <code>step x</code> is an endofunction of the type <code>(x -> x) -> x -> x</code> which we know can be composed left to right. Hence the folding operation (i.e. <code>foldr step id</code>) on a list <code>[y1,y2..yn]</code> looks like: <pre class="prettyprint lang-hs prettyprint-override"><code>step y1 (step y2 (... (step yn id))) -- or (step y1 . step y2 . {dots} . step yn) id </code></pre> Each <code>step yx</code> is an endofunction. Hence, this is equivalent to composing the endofunctions from left to right. When this result is applied to an initial accumulator value then the list folds from left to right. Hence, <code>myFoldl f z xs = foldr step id xs z</code>. <hr> Now consider the <code>foldl</code> function (which is written in Standard ML and not OCaml). It is defined as: <pre class="prettyprint lang-ml prettyprint-override"><code>fun foldl f e l = let fun g (x, f'') = fn y => f'' (f (x, y)) in foldr g (fn x => x) l e end </code></pre> The biggest difference between the <code>foldr</code> functions of Haskell and SML are: <ol> <li>In Haskell the reducer function has the type <code>a -> b -> b</code>.</li> <li>In SML the reducer function has the type <code>(a, b) -> b</code>.</li> </ol> Both are correct. It's only a matter of preference. In SML instead of passing two separate arguments, you pass one single tuple which contains both arguments. Now, the similarities: <ol> <li>The <code>id</code> function in Haskell is the anonymous <code>fn x => x</code> function in SML.</li> <li>The <code>step</code> function in Haskell is the function <code>g</code> in SML which takes a tuple containing the first two arguments.</li> <li>The <code>step</code> function is Haskell <code>step x g a</code> has been split into two functions in SML <code>g (x, f'') = fn y => f'' (f (x, y))</code> for more clarity.</li> </ol> If we rewrite the SML function to use the same names as in Haskell then we have: <pre class="prettyprint lang-ml prettyprint-override"><code>fun myFoldl f z xs = let step (x, g) = fn a => g (f (a, x)) in foldr step (fn x => x) xs z end </code></pre> Hence, they are exactly the same function. The expression <code>g (x, f'')</code> simply applies the function <code>g</code> to the tuple <code>(x, f'')</code>. Here <code>f''</code> is a valid identifier.

<h3>Intuition</h3> The foldl function traverses the list head to tail while operating elements with an accumulator: (...(a&otimes;x1)&otimes;...&otimes;xn-1)&otimes;xn And you want to define it via a foldr: x1&oplus;(x2&oplus;...&oplus;(xn&oplus;e)...) Rather unintuitive. The trick is that your foldr will not produce a value, but rather a function. The list traversal will operate the elements as to produce a function that, when applied to the accumulator, performs the computation you desire. Lets see a simple example to illustrate how this works. Consider <code>sum foldl (+) 0 [1,2,3] = ((0+1)+2)+3</code>. We may calculate it via foldr as follows. <pre class="prettyprint"><code> foldr &oplus; [1,2,3] id -> 1&oplus;(2&oplus;(3&oplus;id)) -> 1&oplus;(2&oplus;(id.(+3)) -> 1&oplus;(id.(+3).(+2)) -> (id.(+3).(+2).(+1)) </code></pre> So when we apply this function to 0 we get <pre class="prettyprint"><code> (id.(+3).(+2).(+1)) 0 = ((0+1)+2)+3 </code></pre> We began with the identity function and successively changed it as we traversed the list, using &oplus; where, <pre class="prettyprint"><code>n &oplus; g = g . (+n) </code></pre> Using this intuition, it isn't hard to define a sum with an accumulator via foldr. We built the computation for a given list via <code>foldr &oplus; id xs</code>. Then to calculate the sum we applied it to 0, <code>foldr &oplus; id xs 0</code>. So we have, <pre class="prettyprint"><code>foldl (+) 0 xs = foldr &oplus; id xs 0 where n &oplus; g = g . (+n) </code></pre> or equivalently, denoting <code>n &oplus; g</code> in prefix form by <code>(&oplus;) n g</code> and noting that <code>(&oplus;) n g a = (g . (+n)) a = g (a+n)</code>, <pre class="prettyprint"><code>foldl (+) 0 xs = foldr &oplus; id xs 0 where (&oplus;) n g a = g (a+n) </code></pre> Note that the &oplus; is your step function, and that you can obtain the generic result you're looking for by substituting a function f for +, and accumulator a for 0. Next let us show that the above really is correct. <h3>Formal derivation</h3> Moving on to a more formal approach. It is useful, for simplicity, to be aware of the following universal property of foldr. <pre class="prettyprint"><code>h [] = e h (x:xs) = f x (h xs) iff h = foldr f e </code></pre> This means that rather than defining foldr directly, we may instead and more simply define a function h in the form above. We want to define such an h so that, <pre class="prettyprint"><code>h xs a = foldl f a xs </code></pre> or equivalently, <pre class="prettyprint"><code>h xs = \a -> foldl f a xs </code></pre> So lets determine h. The empty case is simple: <pre class="prettyprint"><code>h [] = \a -> foldl f a [] = \a -> a = id </code></pre> The non-empty case results in: <pre class="prettyprint"><code>h (x:xs) = \a -> foldl f a (x:xs) = \a -> foldl f (f a x) xs = \a -> h xs (f a x) = step x (h xs) where step x g = \a -> g (f a x) = step x (h xs) where step x g a = g (f a x) </code></pre> So we conclude that, <pre class="prettyprint"><code>h [] = id h (x:xs) = step x (h xs) where step x g a = g (f a x) </code></pre> satisfies <code>h xs a = foldl f a xs </code> And by the universal property above (noting that the f in the universal property formula corresponds to step here, and e to id) we know that <code>h = foldr step id</code>. Therefore, <pre class="prettyprint"><code>h = foldr step id h xs a = foldl f a xs ----------------------- foldl f a xs = foldr step id xs a where step x g a = g (f a x) </code></pre>

Defining foldl in terms of foldr in Standard ML

Tags:

functional-programming

haskell

fold

sml

The defined code is

fun foldl f e l = let
    fun g(x, f'') = fn y => f''(f(x, y)) 
    in foldr g (fn x => x) l e end

I don't understand how this works; what is the purpose of g(x, f'')?

I also find a similar example in Haskell, the definition is quite short

myFoldl f z xs = foldr step id xs z
    where
        step x g a = g (f a x)

894

asked Apr 18 '15 10:04

高亮节

2 Answers

Let's dissect the Haskell implementation of myFoldl and then take a look at the ~~ocaml~~ SML code. First, we'll look at some type signatures:

foldr :: (a -> b -> b) -- the step function
      -> b             -- the initial value of the accumulator
      -> [a]           -- the list to fold
      -> b             -- the result

It should be noted that although the foldr function accepts only three arguments we are applying it two four arguments:

foldr step id xs z

However, as you can see the second argument to foldr (i.e. the inital value of the accumulator) is id which is a function of the type x -> x. Therefore, the result is also of the type x -> x. Hence, it accepts four arguments.

Similarly, the step function is now of the type a -> (x -> x) -> x -> x. Hence, it accepts three arguments instead of two. The accumulator is an endofunction (i.e. a function whose domain and codomain is the same).

Endofunctions have a special property, they are composed from left to right instead of from right to left. For example, let's compose a bunch of Int -> Int functions:

inc :: Int -> Int
inc n = n + 1

dbl :: Int -> Int
dbl n = n * 2

The normal way to compose these functions is to use the function composition operator as follows:

incDbl :: Int -> Int
incDbl = inc . dbl

The incDbl function first doubles a number and then increments it. Note that this reads from right to left.

Another way to compose them is to use continuations (denoted by k):

inc' :: (Int -> Int) -> Int -> Int
inc' k n = k (n + 1)

dbl' :: (Int -> Int) -> Int -> Int
dbl' k n = k (n * 2)

Notice that the first argument is a continuation. If we want to recover the original functions then we can do:

inc :: Int -> Int
inc = inc' id

dbl :: Int -> Int
dbl = dbl' id

However, if we want to compose them then we do it as follows:

incDbl' :: (Int -> Int) -> Int -> Int
incDbl' = dbl' . inc'

incDbl :: Int -> Int
incDbl = incDbl' id

Notice that although we are still using the dot operator to compose the functions, it now reads from left to right.

This is the key behind making foldr behave as foldl. We fold the list from right to left but instead of folding it into a value, we fold it into an endofunction which when applied to an initial accumulator value actually folds the list from left to right.

Consider our incDbl function:

incDbl = incDbl' id
       = (dbl' . inc') id
       =  dbl' (inc' id)

Now consider the definition of foldr:

foldr :: (a -> b -> b) -> b -> [a] -> b
foldr _   acc []     = acc
foldr fun acc (y:ys) = fun y (foldr fun acc ys)

In the basis case we simply return the accumulated value. However, in the inductive case we return fun y (foldr fun acc ys). Our step function is defined as follows:

step :: a -> (x -> x) -> x -> x
step x g a = g (f a x)

Here f is the reducer function of foldl and is of the type x -> a -> x. Notice that step x is an endofunction of the type (x -> x) -> x -> x which we know can be composed left to right.

Hence the folding operation (i.e. foldr step id) on a list [y1,y2..yn] looks like:

step y1 (step y2 (... (step yn id)))

-- or

(step y1 . step y2 . {dots} . step yn) id

Each step yx is an endofunction. Hence, this is equivalent to composing the endofunctions from left to right.

When this result is applied to an initial accumulator value then the list folds from left to right. Hence, myFoldl f z xs = foldr step id xs z.

Now consider the foldl function (which is written in Standard ML and not OCaml). It is defined as:

fun foldl f e l = let fun g (x, f'') = fn y => f'' (f (x, y))
                  in  foldr g (fn x => x) l e end

The biggest difference between the foldr functions of Haskell and SML are:

In Haskell the reducer function has the type a -> b -> b.
In SML the reducer function has the type (a, b) -> b.

Both are correct. It's only a matter of preference. In SML instead of passing two separate arguments, you pass one single tuple which contains both arguments.

Now, the similarities:

The id function in Haskell is the anonymous fn x => x function in SML.
The step function in Haskell is the function g in SML which takes a tuple containing the first two arguments.
The step function is Haskell step x g a has been split into two functions in SML g (x, f'') = fn y => f'' (f (x, y)) for more clarity.

If we rewrite the SML function to use the same names as in Haskell then we have:

fun myFoldl f z xs = let step (x, g) = fn a => g (f (a, x))
                     in foldr step (fn x => x) xs z end

Hence, they are exactly the same function. The expression g (x, f'') simply applies the function g to the tuple (x, f''). Here f'' is a valid identifier.

140

answered Oct 18 '22 01:10

Aadit M Shah

Intuition

The foldl function traverses the list head to tail while operating elements with an accumulator:

(...(a⊗x₁)⊗...⊗x_n-1)⊗x_n

And you want to define it via a foldr:

x₁⊕(x₂⊕...⊕(x_n⊕e)...)

Rather unintuitive. The trick is that your foldr will not produce a value, but rather a function. The list traversal will operate the elements as to produce a function that, when applied to the accumulator, performs the computation you desire.

Lets see a simple example to illustrate how this works. Consider sum foldl (+) 0 [1,2,3] = ((0+1)+2)+3. We may calculate it via foldr as follows.

   foldr ⊕ [1,2,3] id 
-> 1⊕(2⊕(3⊕id))
-> 1⊕(2⊕(id.(+3))
-> 1⊕(id.(+3).(+2))
-> (id.(+3).(+2).(+1))

So when we apply this function to 0 we get

   (id.(+3).(+2).(+1)) 0
=  ((0+1)+2)+3

We began with the identity function and successively changed it as we traversed the list, using ⊕ where,

n ⊕ g = g . (+n)

Using this intuition, it isn't hard to define a sum with an accumulator via foldr. We built the computation for a given list via foldr ⊕ id xs. Then to calculate the sum we applied it to 0, foldr ⊕ id xs 0. So we have,

foldl (+) 0 xs = foldr ⊕ id xs 0 
               where n ⊕ g = g . (+n)

or equivalently, denoting n ⊕ g in prefix form by (⊕) n g and noting that (⊕) n g a = (g . (+n)) a = g (a+n),

foldl (+) 0 xs = foldr ⊕ id xs 0
               where (⊕) n g a = g (a+n)

Note that the ⊕ is your step function, and that you can obtain the generic result you're looking for by substituting a function f for +, and accumulator a for 0.

Next let us show that the above really is correct.

Formal derivation

Moving on to a more formal approach. It is useful, for simplicity, to be aware of the following universal property of foldr.

h []     = e
h (x:xs) = f x (h xs)

      iff 

h = foldr f e

This means that rather than defining foldr directly, we may instead and more simply define a function h in the form above.

We want to define such an h so that,

h xs a = foldl f a xs

or equivalently,

h xs = \a -> foldl f a xs

So lets determine h. The empty case is simple:

h [] = \a -> foldl f a []
     = \a -> a
     = id

The non-empty case results in:

h (x:xs) = \a -> foldl f a (x:xs)
         = \a -> foldl f (f a x) xs
         = \a -> h xs (f a x)
         = step x (h xs)   where step x g = \a -> g (f a x) 
         = step x (h xs)   where step x g a = g (f a x)

So we conclude that,

h []     = id
h (x:xs) = step x (h xs) where step x g a = g (f a x)

satisfies h xs a = foldl f a xs

And by the universal property above (noting that the f in the universal property formula corresponds to step here, and e to id) we know that h = foldr step id. Therefore,

h      = foldr step id
h xs a = foldl f a xs
-----------------------
foldl f a xs = foldr step id xs a  
             where step x g a = g (f a x)

answered Oct 18 '22 00:10

Jorge Adriano

Related questions
                            
                                Why haskell use a type of ReadS a to represent a function?
                            
                                Create new record with change in one value
                            
                                What is correct way to implement splitWith from "Real World Haskell?"
                            
                                Tail function for "safe list" using GADTs
                            
                                Traversable and zippers. Necessity and sufficiency
                            
                                Haskell slow to compute Ackermann 4 1?
                            
                                What is the difference between mapcat in Clojure and concatmap in Haskell?
                            
                                Haskell: binding to fast and simple SAT solver
                            
                                Understanding a case of Haskell Type-Ambiguity
                            
                                Testing Haskell traversable with a simple example
                            
                                Clojure can make alphabet list like Haskell? [duplicate]
                            
                                use of 'def' in pandoc scripts
                            
                                Types for parser combinators
                            
                                How do you efficiently find a union of a list of lists of values in haskell?
                            
                                Generating random numbers in Haskell
                            
                                `mfix` not working as expected
                            
                                How to detect end of input with pipes
                            
                                Can anybody explain GHC's definition of IO?
                            
                                Strange behaviour in GHCi
                            
                                Directly generating specific subsets of a powerset?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With