I'm learning Haskell currently (being a programmer by trade, but this is my first attempt at a functional language). I want to write a function that scans a list and returns both the minimum and maximum element of that list. Sort of what the Prelude functions <code>minimum</code> and <code>maximum</code> do, but both at the same time. I've come up with the following code: <pre class="prettyprint"><code>import Data.List -- Declaration of rand minMax :: [Int] -> Maybe (Int, Int) minMax [] = Nothing minMax (x:xs) = Just (foldl' f (x, x) xs) where f (a, b) c = (if c < a then c else a, if c > b then c else b) </code></pre> <code>rand</code> is a function that generates an infinite list of numbers. The thing is that when I append the following <code>main</code> function: <pre class="prettyprint"><code>main = print $ minMax $ take 1000000 $ rand 7666532 </code></pre> compile and run all this with profiling, it shows me it uses over 200 MB of memory, so it's definitely not a constant-space function (which I'd like it to be). The question is why and what should I change to fix it. As I understand <code>foldl'</code> folds the list from left (same way it's generated) and is not lazy, so I don't see why the memory usage is so high. I'm pretty sure it's the <code>minMax</code> function that is incorrect, as simply printing the said list, using <pre class="prettyprint"><code>main = print $ take 1000000 $ rand 7666532 </code></pre> gives me 1MB usage, something that I understand and expect.

Note that <code>foldl'</code> forces the accumulator to weak head normal form. Since the accumulator is a tuple it does not force the evaluation of the two elements of the tuple. If you explicitly force the two elements you get a constant-space function: <pre class="prettyprint"><code>f (a, b) c = a `seq` b `seq` (if c < a then c else a, if c > b then c else b) </code></pre> In your original program you are building a tuple of the kind: <code>(<thunk>, <thunk>)</code> and every time <code>f</code> is applied you simply build a tuple with bigger and bigger thunks. When finally this is consumed by <code>print</code> the call to <code>show</code> forces the full evaluation of the tuple and all the comparisons are made at that point. Using <code>seq</code> you instead force <code>f</code> to evaluate the comparison at that moment, and thus the thunks contained in the accumulator are evaluated before performing the comparison. Hence the result is that the thunks stored in the accumulator have constant size. What <code>foldl'</code> does is simply avoid building the thunk: <code>f (f (f ...) y) x</code>. An alternative solution, as suggested by Jubobs, to avoid explicitly using <code>seq</code> is to use a data type that has strict fields: <pre class="prettyprint"><code>data Pair a b = Pair !a !b deriving Show </code></pre> And so the code would become: <pre class="prettyprint"><code>minMax :: [Int] -> Maybe (Pair Int Int) minMax [] = Nothing minMax (x:xs) = Just (foldl' f (Pair x x) xs) where f (Pair a b) c = Pair (if c < a then c else a) (if c > b then c else b) </code></pre> This avoids thunks altogether.

Why is this code not constant-space?

Tags:

performance

haskell

I'm learning Haskell currently (being a programmer by trade, but this is my first attempt at a functional language).

I want to write a function that scans a list and returns both the minimum and maximum element of that list. Sort of what the Prelude functions minimum and maximum do, but both at the same time. I've come up with the following code:

import Data.List  -- Declaration of rand  minMax :: [Int] -> Maybe (Int, Int) minMax []   = Nothing minMax (x:xs) = Just (foldl' f (x, x) xs)                 where                   f (a, b) c = (if c < a then c else a, if c > b then c else b)

rand is a function that generates an infinite list of numbers. The thing is that when I append the following main function:

main = print $ minMax $ take 1000000 $ rand 7666532

compile and run all this with profiling, it shows me it uses over 200 MB of memory, so it's definitely not a constant-space function (which I'd like it to be).

The question is why and what should I change to fix it. As I understand foldl' folds the list from left (same way it's generated) and is not lazy, so I don't see why the memory usage is so high. I'm pretty sure it's the minMax function that is incorrect, as simply printing the said list, using

main = print $ take 1000000 $ rand 7666532

gives me 1MB usage, something that I understand and expect.

783

asked Sep 02 '15 12:09

Torinthiel

1 Answers

Note that foldl' forces the accumulator to weak head normal form. Since the accumulator is a tuple it does not force the evaluation of the two elements of the tuple.

If you explicitly force the two elements you get a constant-space function:

f (a, b) c = a `seq` b `seq` (if c < a then c else a, if c > b then c else b)

In your original program you are building a tuple of the kind: (<thunk>, <thunk>) and every time f is applied you simply build a tuple with bigger and bigger thunks. When finally this is consumed by print the call to show forces the full evaluation of the tuple and all the comparisons are made at that point.

Using seq you instead force f to evaluate the comparison at that moment, and thus the thunks contained in the accumulator are evaluated before performing the comparison. Hence the result is that the thunks stored in the accumulator have constant size.

What foldl' does is simply avoid building the thunk: f (f (f ...) y) x.

An alternative solution, as suggested by Jubobs, to avoid explicitly using seq is to use a data type that has strict fields:

data Pair a b = Pair !a !b     deriving Show

And so the code would become:

minMax :: [Int] -> Maybe (Pair Int Int) minMax []   = Nothing minMax (x:xs) = Just (foldl' f (Pair x x) xs)                 where                   f (Pair a b) c = Pair (if c < a then c else a) (if c > b then c else b)

This avoids thunks altogether.

answered Oct 10 '22 02:10

Bakuriu

Related questions
                            
                                What is Azul "Zing"? [closed]
                            
                                C++ 11 auto compile time or runtime?
                            
                                Performance of C++ vs Virtual Machine languages in high frequency finance
                            
                                Why does n++ execute faster than n=n+1?
                            
                                x=x+1 vs. x +=1
                            
                                Difference in performance of compiled accelerate code ran from ghci and shell
                            
                                Excessive mysterious system time use in a GHC-compiled binary
                            
                                Why is string.intern() so slow?
                            
                                AppFabric Caching - Proper use of DataCacheFactory and DataCache
                            
                                Is $(document).ready necessary if I put all my JavaScript at the bottom of the page? [duplicate]
                            
                                Why is float() faster than int()?
                            
                                How do I implement threaded comments?
                            
                                Why does DbSet.Add work so slow?
                            
                                Is it possible to force an existing Java application to use no more than x cores?
                            
                                How does .NET make use of IO Threads or IO Completion Ports?
                            
                                Why does breaking the "output dependency" of LZCNT matter?
                            
                                What is the performance impact of Scala implicit type conversions?
                            
                                Preventing performance regressions in R
                            
                                Why does the Sun JVM continue to consume ever more RSS memory even when the heap, etc sizes are stable?
                            
                                What does Intel mean by "retired"?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With