I am solving some problems of Project Euler in Haskell. I wrote a program for a riddle in it and it did not work as I expected. When I looked in the task manager when running the program I saw that it was using > 1 gigabyte of RAM on ghc. A friend of me wrote a program with the same meaning in Java and succeeded in 7 seconds. <pre class="prettyprint"><code>import Data.List opl = find vw $ map (\x-> fromDigits (x++[0,0,9]) ) $ sequence [[1],re,[2],re,[3],re,[4],re,[5],re,[6],re,[7],re,[8],re] vw x = hh^2 == x where hh = (round.sqrt.fromIntegral) x re = [0..9] fromDigits x = foldl1 (\n m->10*n+m) x </code></pre> I know this program would output the number I want given enough RAM and time, but there has to be a better-performing way.

The main problem here is that sequence has a space leak. It is defined like this: <pre class="prettyprint"><code>sequence [] = [[]] sequence (xs:xss) = [ y:ys | y <- xs, ys <- sequence xss ] </code></pre> so the problem is that the list produced by the recursive call <code>sequence xss</code> is re-used for each of the elements of <code>xs</code>, so it can't be discarded until the end. A version without the space leak is <pre class="prettyprint"><code>myseq :: [[a]] -> [[a]] myseq xs = go (reverse xs) [] where go [] acc = [acc] go (xs:xss) acc = concat [ go xss (x:acc) | x <- xs ] </code></pre> PS. the answer seems to be <code>Just 1229314359627783009</code> Edit version avoiding the concat: <pre class="prettyprint"><code>seqlists :: [[a]] -> [[a]] seqlists xss = go (reverse xss) [] [] where go [] acc rest = acc : rest go (xs:xss) acc rest = foldr (\y r -> go xss (y:acc) r) rest xs </code></pre> note that both of these versions generate the results in a different order from the standard <code>sequence</code>, so while they work for this problem we can't use one as a specialised version of <code>sequence</code>.

Following on from the answer given by Simon Marlow, here's a version of sequence that avoids the space leak while otherwise working just like the original, including preserving the order. It still uses the nice, simple list comprehension of the original sequence - the only difference is that a fake data dependency is introduced that prevents the recursive call from being shared. <pre class="prettyprint"><code>sequenceDummy d [] = d `seq` [[]] sequenceDummy _ (xs:xss) = [ y:ys | y <- xs, ys <- sequenceDummy (Just y) xss ] sequenceUnshared = sequenceDummy Nothing </code></pre> I think this is a better way of avoiding the sharing that leads to the space leak. I'd blame the excessive sharing on the "full laziness" transformation. Normally this does a great job of creating sharing that avoids recomputions, but sometimes recompution is very much more efficient than storing shared results. It'd be nice if there was a more direct way to tell the compiler not to share a specific expression - the above dummy <code>Maybe</code> argument works and is efficient, but it's basically a hack that's just complicated enough that ghc can't tell that there's no real dependency. (In a strict language you don't have these issues because you only have sharing where you explicitly bind a variable to a value.)

Space leak in list program

Tags:

performance

haskell

lazy-evaluation

I am solving some problems of Project Euler in Haskell. I wrote a program for a riddle in it and it did not work as I expected.

When I looked in the task manager when running the program I saw that it was using > 1 gigabyte of RAM on ghc. A friend of me wrote a program with the same meaning in Java and succeeded in 7 seconds.

Click to copy

import Data.List

opl = find vw $ map (\x-> fromDigits (x++[0,0,9]) ) 
        $ sequence [[1],re,[2],re,[3],re,[4],re,[5],re,[6],re,[7],re,[8],re]

vw x = hh^2 == x
    where hh = (round.sqrt.fromIntegral) x

re = [0..9]

fromDigits x = foldl1 (\n m->10*n+m) x

I know this program would output the number I want given enough RAM and time, but there has to be a better-performing way.

732

asked Jul 06 '10 20:07

Ingdas

2 Answers

The main problem here is that sequence has a space leak. It is defined like this:

Click to copy

sequence [] = [[]]
sequence (xs:xss) = [ y:ys | y <- xs, ys <- sequence xss ]

so the problem is that the list produced by the recursive call sequence xss is re-used for each of the elements of xs, so it can't be discarded until the end. A version without the space leak is

Click to copy

myseq :: [[a]] -> [[a]]
myseq xs = go (reverse xs) []
 where
  go [] acc = [acc]
  go (xs:xss) acc = concat [ go xss (x:acc) | x <- xs ]

PS. the answer seems to be Just 1229314359627783009

Edit version avoiding the concat:

Click to copy

seqlists :: [[a]] -> [[a]]
seqlists xss = go (reverse xss) [] []
 where
   go []       acc rest = acc : rest
   go (xs:xss) acc rest = foldr (\y r -> go xss (y:acc) r) rest xs

note that both of these versions generate the results in a different order from the standard sequence, so while they work for this problem we can't use one as a specialised version of sequence.

113

answered Sep 23 '22 05:09

Simon Marlow

Following on from the answer given by Simon Marlow, here's a version of sequence that avoids the space leak while otherwise working just like the original, including preserving the order.

It still uses the nice, simple list comprehension of the original sequence - the only difference is that a fake data dependency is introduced that prevents the recursive call from being shared.

Click to copy

sequenceDummy d [] = d `seq` [[]]
sequenceDummy _ (xs:xss) = [ y:ys | y <- xs, ys <- sequenceDummy (Just y) xss ]

sequenceUnshared = sequenceDummy Nothing

I think this is a better way of avoiding the sharing that leads to the space leak.

I'd blame the excessive sharing on the "full laziness" transformation. Normally this does a great job of creating sharing that avoids recomputions, but sometimes recompution is very much more efficient than storing shared results.

It'd be nice if there was a more direct way to tell the compiler not to share a specific expression - the above dummy Maybe argument works and is efficient, but it's basically a hack that's just complicated enough that ghc can't tell that there's no real dependency. (In a strict language you don't have these issues because you only have sharing where you explicitly bind a variable to a value.)

answered Sep 22 '22 05:09

RD1

Related questions
                            
                                How does memchr() work under the hood?
                            
                                Python FAQ: “How fast are exceptions?”
                            
                                two IF statements vs. one AND statement
                            
                                Why is recursion in python so slow?
                            
                                Word frequency in a large text file
                            
                                How do I reverse a UTF-8 string in place?
                            
                                Which order of nested layouts is most efficient in Android
                            
                                Why is it that bytecode might run faster than native code [closed]
                            
                                find pair of numbers whose difference is an input value 'k' in an unsorted array
                            
                                If RAM isn't a concern, is reading line by line faster or reading everything into RAM and access it? - Python
                            
                                Loop Reversal in C# Speeds Up app
                            
                                Cost of file modification time checks
                            
                                Why is processing a sorted array not faster than an unsorted array in Python?
                            
                                Why is my C++ disk write test much slower than a simply file copy using bash?
                            
                                Apache POI Java Excel Performance for Large Spreadsheets
                            
                                What to use ? time() function or $_SERVER['REQUEST_TIME'] ? Which is better?
                            
                                ng-include, ng-template or directive: which one is better for performance
                            
                                Why is the internal data of BitSet in java stored as long[] instead of int[] in Java?
                            
                                Vectorizing or Speeding up Fuzzywuzzy String Matching on PANDAS Column
                            
                                How to find CPU-intensive class in Java?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Space leak in list program

Tags:

performance

haskell

lazy-evaluation

Ingdas

People also ask

2 Answers

Simon Marlow

RD1

Recent Activity

Donate For Us