As an exercise, I'm trying to define a <code>ruler</code> value <pre class="prettyprint"><code>ruler :: (Num a, Enum a) => [a] </code></pre> which corresponds to the ruler function <pre class="prettyprint"><code>0,1,0,2,0,1,0,3,0,1,0,2,0,1,0,4,0,1,0,2... </code></pre> where the <code>n</code>'th element of the list (assuming the first element corresponds to <code>n=1</code>) is the largest power of 2 which evenly divides <code>n</code>. To make it more interesting, I'm trying to implement <code>ruler</code> without having to do any divisibility testing. Using a helper function <pre class="prettyprint"><code>interleave :: [a] -> [a] -> [a] </code></pre> which simply alternates the elements from the two given lists, I came up with this - but alas it doesn't work: <pre class="prettyprint"><code>interleave :: [a] -> [a] -> [a] interleave (x:xs) (y:ys) = x : y : interleave xs ys interleave _ _ = [] ruler :: (Num a, Enum a) => [a] ruler = foldr1 interleave . map repeat $ [0..] main :: IO () main = print (take 20 ruler) </code></pre> The program eventually uses up all stack space. Now, what's strange is that the program works just fine if I adjust the definition of <code>interleave</code> so that it reads <pre class="prettyprint"><code>interleave (x:xs) ys = x : head ys : interleave xs (tail ys) </code></pre> I.e. I no longer use pattern matching on the second argument. Why does using <code>head</code> and <code>tail</code> here make <code>ruler</code> terminate - after all, the pattern matching is rather defensive (I only evaluate the first element of the list spine, no?).

You are applying <code>foldr</code> with an strict combination function to an infinite list. Boiled down to a minimal example, you can view this behaviour here: <pre class="prettyprint"><code>*Main> :t const const :: a -> b -> a *Main> :t flip seq flip seq :: c -> a -> c *Main> foldr1 const [0..] 0 *Main> foldr1 (flip seq) [0..] ^CInterrupted. </code></pre> The fix is, as explained in other answers, to make <code>interleave</code> lazy. More concretely, here is what happens. First we resolve the <code>foldr1</code>, replacing every <code>:</code> of the outer list with <code>interleave</code>: <pre class="prettyprint"><code>foldr1 interleave [[0..], [1...], ...] = interleave [0...] (interleave [1...] (...)) </code></pre> In order to make progress, the first <code>interleave</code> wants to evaluate the second argument before producing the first value. But then the second wants to evaluate its second argument, and so on. With the lazy definition of <code>interleave</code>, the first value is produced before evaluating the second argument. In particular, <code>interleave [1...] (...)</code> will evaluate to <code>1 : ...</code> (which helps the first <code>interleave</code> to make progress) before evaluating stuff further down.

The difference is that pattern matching forces the first item in the spine, <code>head/tail</code> do not. You could use lazy patterns to achieve the same goal: <pre class="prettyprint"><code>interleave (x:xs) ~(y:ys) = x : y : interleave xs ys </code></pre> Note the <code>~</code>: this is equivalent to defining <code>y</code> and <code>ys</code> using <code>head</code> and <code>tail</code>. For example: the list below is undefined. <pre class="prettyprint"><code>fix (\ (x:xs) -> 1:x:xs) </code></pre> where <code>fix</code> is the fixed point combinator (e.g. from <code>Data.Function</code>). By comparison, this other list repeats <code>1</code> forever: <pre class="prettyprint"><code>fix (\ ~(x:xs) -> 1:x:xs) </code></pre> This is because the <code>1</code> is produced before the list is split between <code>x</code> and <code>xs</code>. <hr> <blockquote> Why forcing the first item in the spine triggers the problem? </blockquote> When reasoning about a recursive equation such as <pre class="prettyprint"><code>x = f x </code></pre> it often helps to regard <code>x</code> as the value "approached" by the sequence of values <pre class="prettyprint"><code>undefined f undefined f (f undefined) f (f (f undefined)) ... </code></pre> (The above intuition can be made precise through a bit of denotational semantics and the Kleene's fixed point theorem.) For instance, the equation <pre class="prettyprint"><code>x = 1 : x </code></pre> defines the "limit" of the sequence <pre class="prettyprint"><code>undefined 1 : undefined 1 : 1 : undefined ... </code></pre> which clearly converges to the repeated ones list. When using pattern matching to define recursive values, the equation becomes, e.g. <pre class="prettyprint"><code>(y:ys) = 1:y:ys </code></pre> which, due to pattern matching, translates to <pre class="prettyprint"><code>x = case x of (y:ys) -> 1:y:ys </code></pre> Let us consider its approximating sequence <pre class="prettyprint"><code>undefined case undefined of (y:ys) -> .... = undefined case undefined of (y:ys) -> .... = undefined ... </code></pre> At the second step, the <code>case</code> diverges, making the result still <code>undefined</code>. The sequence does not approach the intended "repeated ones" list, but is constantly <code>undefined</code>. Using lazy patterns, instead <pre class="prettyprint"><code>x = case x of ~(y:ys) -> 1:y:ys </code></pre> we obtain the sequence <pre class="prettyprint"><code>undefined case undefined of ~(y:ys) -> 1:y:ys = 1 : (case undefined of (y:_) -> y) : (case undefined of (_:ys) -> ys) = 1 : undefined : undefined -- name this L1 case L1 of ~(y:ys) -> 1:y:ys = 1 : (case L1 of (y:_) -> y) : (case L1 of (_:ys) -> ys) = 1 : 1 : undefined : undefined -- name this L2 case L2 of ~(y:ys) -> 1:y:ys = 1 : (case L2 of (y:_) -> y) : (case L2 of (_:ys) -> ys) = 1 : 1 : 1 : undefined : undefined </code></pre> which does converge to the intended list. Note how lazy patterns are "pushed forward" without evaluating the <code>case</code> argument early. This is what makes them lazy. In this way, the <code>1</code> is produced before the pattern matching is performed, making the result of the recursively defined entity non trivial.

Why would using head/tail instead of pattern matching make evaluation terminate?

Tags:

list

haskell

lazy-evaluation

As an exercise, I'm trying to define a ruler value

ruler :: (Num a, Enum a) => [a]

which corresponds to the ruler function

0,1,0,2,0,1,0,3,0,1,0,2,0,1,0,4,0,1,0,2...

where the n'th element of the list (assuming the first element corresponds to n=1) is the largest power of 2 which evenly divides n. To make it more interesting, I'm trying to implement ruler without having to do any divisibility testing.

Using a helper function

interleave :: [a] -> [a] -> [a]

which simply alternates the elements from the two given lists, I came up with this - but alas it doesn't work:

interleave :: [a] -> [a] -> [a]
interleave  (x:xs) (y:ys) = x : y : interleave xs ys
interleave  _      _      = []

ruler :: (Num a, Enum a) => [a]
ruler = foldr1 interleave . map repeat $ [0..]

main :: IO ()
main = print (take 20 ruler)

The program eventually uses up all stack space.

Now, what's strange is that the program works just fine if I adjust the definition of interleave so that it reads

interleave (x:xs) ys = x : head ys : interleave xs (tail ys)

I.e. I no longer use pattern matching on the second argument. Why does using head and tail here make ruler terminate - after all, the pattern matching is rather defensive (I only evaluate the first element of the list spine, no?).

337

asked Aug 01 '14 10:08

Frerich Raabe

Video Answer

2 Answers

You are applying foldr with an strict combination function to an infinite list.

Boiled down to a minimal example, you can view this behaviour here:

*Main> :t const
const :: a -> b -> a
*Main> :t flip seq
flip seq :: c -> a -> c
*Main> foldr1 const [0..]
0
*Main> foldr1 (flip seq) [0..]
^CInterrupted.

The fix is, as explained in other answers, to make interleave lazy.

More concretely, here is what happens. First we resolve the foldr1, replacing every : of the outer list with interleave:

foldr1 interleave [[0..], [1...], ...]
= interleave [0...] (interleave [1...] (...))

In order to make progress, the first interleave wants to evaluate the second argument before producing the first value. But then the second wants to evaluate its second argument, and so on.

With the lazy definition of interleave, the first value is produced before evaluating the second argument. In particular, interleave [1...] (...) will evaluate to 1 : ... (which helps the first interleave to make progress) before evaluating stuff further down.

160

answered Sep 25 '22 16:09

Joachim Breitner

The difference is that pattern matching forces the first item in the spine, head/tail do not.

You could use lazy patterns to achieve the same goal:

interleave  (x:xs) ~(y:ys) = x : y : interleave xs ys

Note the ~: this is equivalent to defining y and ys using head and tail.

For example: the list below is undefined.

fix (\ (x:xs) -> 1:x:xs)

where fix is the fixed point combinator (e.g. from Data.Function). By comparison, this other list repeats 1 forever:

fix (\ ~(x:xs) -> 1:x:xs)

This is because the 1 is produced before the list is split between x and xs.

Why forcing the first item in the spine triggers the problem?

When reasoning about a recursive equation such as

x = f x

it often helps to regard x as the value "approached" by the sequence of values

undefined
f undefined
f (f undefined)
f (f (f undefined))
...

(The above intuition can be made precise through a bit of denotational semantics and the Kleene's fixed point theorem.)

For instance, the equation

x = 1 : x

defines the "limit" of the sequence

undefined
1 : undefined
1 : 1 : undefined
...

which clearly converges to the repeated ones list.

When using pattern matching to define recursive values, the equation becomes, e.g.

(y:ys) = 1:y:ys

which, due to pattern matching, translates to

x = case x of (y:ys) -> 1:y:ys

Let us consider its approximating sequence

undefined
case undefined of (y:ys) -> ....   = undefined
case undefined of (y:ys) -> ....   = undefined
...

At the second step, the case diverges, making the result still undefined. The sequence does not approach the intended "repeated ones" list, but is constantly undefined.

Using lazy patterns, instead

x = case x of ~(y:ys) -> 1:y:ys

we obtain the sequence

undefined
case undefined of ~(y:ys) -> 1:y:ys 
    = 1 : (case undefined of (y:_) -> y) : (case undefined of (_:ys) -> ys)
    = 1 : undefined : undefined      -- name this L1
case L1 of ~(y:ys) -> 1:y:ys
    = 1 : (case L1 of (y:_) -> y) : (case L1 of (_:ys) -> ys)
    = 1 : 1 : undefined : undefined  -- name this L2
case L2 of ~(y:ys) -> 1:y:ys
    = 1 : (case L2 of (y:_) -> y) : (case L2 of (_:ys) -> ys)
    = 1 : 1 : 1 : undefined : undefined

which does converge to the intended list. Note how lazy patterns are "pushed forward" without evaluating the case argument early. This is what makes them lazy. In this way, the 1 is produced before the pattern matching is performed, making the result of the recursively defined entity non trivial.

answered Sep 23 '22 16:09

chi

Related questions
                            
                                Prolog List. Check if first and last element in list is similar
                            
                                How to create a generic list and populate with a Select SQL Query in c#
                            
                                How can I retrieve a set of unique arrays from a list of arrays using LINQ?
                            
                                C++ list iterator never reaches end() when iterating through
                            
                                Python, get index from list of lists
                            
                                Display Items from a Database in a JavaFX TableView
                            
                                How to store and lookup data, based on multiple xml attributes?
                            
                                How do I remove duplicate arrays in a list in Python
                            
                                C# Split time List into time ranges
                            
                                Making a list distinct in C#
                            
                                Why implement two so similar data structures like List and Tuple [duplicate]
                            
                                How to work with 2 lists at the same time in haskell
                            
                                Sort list of lists in R: sort one lists' value depending on other lists' value
                            
                                Python: keep track of elements moving within a list
                            
                                Filter Dictionary According to Existing List
                            
                                Group By & Aggregate List of Dictionaries in Python
                            
                                Naming list items via loop in R
                            
                                Java time expiring List/Set?
                            
                                Vertically align smaller bullets with larger text
                            
                                Find maximum value and index in a python list?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With