Why does memoized function consume so much memory in Haskell?

Tags:

haskell

I wrote a piece of Haskell code to calculate the length of a Collatz chain. Given a number n, the next number in the sequence is n/2 if n is even or 3*n+1 if n is odd. The sequence ends when it has converged to 1. I wanted to find the length of the longest chain when starting from any number below some input number.

I tried implementing the length calculation with a memoized function since I expected to need the length of chains starting from some numbers. Thus, the length of the chain starting from 726 would be just 1 + the length of the chain starting from 363, which would already have been calculated. My code is shown below.

collatz :: Int -> Int
collatz n
    | even n = n `div` 2
    | otherwise = 3 * n + 1

collatzLength :: Int -> Int
collatzLength = (fmap len [0 ..] !!)
    where len 0 = 0
          len 1 = 1
          len n = 1 + (collatzLength . collatz $ n)

maxLengthBelow :: Int -> Int
maxLengthBelow = foldl1 max . fmap collatzLength . enumFromTo 1

main :: IO()
main = print $ maxLengthBelow 10000

This code works, but takes a huge amount of memory. When profiling it, running main with the input of 10000, len is only called 21664 times, as expected, but the program takes 16 seconds and 4.5Gb of memory! What is taking up all that memory? I would have expected the memoized function to produce a fast, low-memory solution.

881

asked May 14 '19 13:05

mentoc3000

1 Answers

One of the things that makes the Collatz sequence so fun is that there are some small starting seeds that take you way, way out into the atmosphere on their way to 1. In particular, 9663 makes it all the way out to 27114424 before it collapses -- and that's a long memoization list to build!

And, for what it's worth, I expect your memoization list to use three machine words per element: one for the I# constructor on the Int, one for the contained number, and one for the (:) constructor. Let's ask how much space it would take to store 27114424 elements, then:

> 27114424 * (64*3) / 1024 {-Kb-} / 1024 {-Mb-} / 1024 {-Gb-}
4.8484368324279785

So 4.5Gb sounds about right, perhaps even a little low.

153

answered Oct 04 '22 01:10

Daniel Wagner

Related questions
                            
                                Group a list of tuples by their 1st element
                            
                                Can GHC derive Functor and Applicative instances for a monad transformer?
                            
                                How to properly force evaluation of pure value in IO monad?
                            
                                Printing the free monad
                            
                                Why does Haskell Pipes "use () to close unused inputs and X (the uninhabited type) to close unused outputs"?
                            
                                Why compiler couldn't match type 'a==a' with '`True' for type family?
                            
                                Why is Haskell faster than C++ for a simple fibonacci
                            
                                On this Kmett CRC article, why does ab = a0^n + 0^m b? What does this notation mean?
                            
                                Pattern matching inside lambda
                            
                                How to parse a time value of type UTCTime from string in Haskell?
                            
                                How to round up float to integer
                            
                                Why does haskell's network library use non-blocking sockets?
                            
                                Could not deduce (Bounded a1) arising from a use of 'minBound'
                            
                                Concrete Type Example of a Functor that Fails to be an Applicative? [duplicate]
                            
                                Why can't the Show instance be derived for MaybeT?
                            
                                How does Foldable.foldl work on Num a => a
                            
                                Debugging haskell in VS Code
                            
                                Extensible state machines in Haskell
                            
                                fix vs. ArrowLoop
                            
                                What does Traversable is to Applicative contexts mean?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With