Memory leak in recursive IO function - PAP

Tags:

I've written a library called amqp-worker that provides a function called worker that polls a message queue (like RabbitMQ) for messages, calling a handler when a message is found. Then it goes back to polling.

It's leaking memory. I've profiled it and the graph says PAP (partial function application) is the culprit. Where is the leak in my code? How can I avoid leaks when looping in IO with forever?

enter image description here

Here are some relevant functions. The full source is here.

Example Program. This leaks

main :: IO ()
main = do
  -- connect
  conn <- Worker.connect (fromURI "amqp://guest:guest@localhost:5672")

  -- initialize the queues
  Worker.initQueue conn queue
  Worker.initQueue conn results

  -- publish a message
  Worker.publish conn queue (TestMessage "hello world")

  -- create a worker, the program loops here
  Worker.worker def conn queue onError (onMessage conn)

worker

worker :: (FromJSON a, MonadBaseControl IO m, MonadCatch m) => WorkerOptions -> Connection -> Queue key a -> (WorkerException SomeException -> m ()) -> (Message a -> m ()) -> m ()
worker opts conn queue onError action =
  forever $ do
    eres <- consumeNext (pollDelay opts) conn queue
    case eres of
      Error (ParseError reason bd) ->
        onError (MessageParseError bd reason)

      Parsed msg ->
        catch
          (action msg)
          (onError . OtherException (body msg))
    liftBase $ threadDelay (loopDelay opts)

consumeNext

consumeNext :: (FromJSON msg, MonadBaseControl IO m) => Microseconds -> Connection -> Queue key msg -> m (ConsumeResult msg)
consumeNext pd conn queue =
    poll pd $ consume conn queue

poll

poll :: (MonadBaseControl IO m) => Int -> m (Maybe a) -> m a
poll us action = do
    ma <- action
    case ma of
      Just a -> return a
      Nothing -> do
        liftBase $ threadDelay us
        poll us action

991

asked Dec 23 '16 18:12

Sean Clark Hess

2 Answers

Here is a very simple example that demonstrates your problem:

main :: IO ()
main = worker

{-# NOINLINE worker #-}
worker :: (Monad m) => m ()
worker =
  let loop = poll >> loop
  in loop

poll :: (Monad m) => m a
poll = return () >> poll

If you remove the NOINLINE, or specialize m to IO (while compiling with -O), the leak goes away.

I wrote a detailed blog post about why exactly this code leaks memory. The quick summary is, as Reid points out in his answer, that the code creates and remembers a chain of partial applications of >>s.

I also filed a ghc ticket about this.

181

answered Oct 03 '22 05:10

Roman Cheplyaka

Maybe an easier example to understand is this one

main :: IO ()
main = let c = count 0
       in c >> c

{-# NOINLINE count #-}
count :: Monad m => Int -> m ()
count 1000000 = return ()
count n = return () >> count (n+1)

Evaluating f >> g for IO actions yields some kind of closure that has references to both f and g (it's basically the composition of f and g as functions on state tokens). count 0 returns a thunk c that will evaluate to a big structure of closures of the form return () >> return () >> return () >> .... When we execute c we build up this structure, and since we have to execute c a second time the whole structure is still live. So this program leaks memory (regardless of optimization flags).

When count is specialized to IO and optimizations are enabled, GHC has a variety of tricks available to avoid building up this data structure; but they all rely on knowing that the monad is IO.

Returning to the original count :: Monad m => Int -> m (), we can try to avoid building this big structure by changing the last line to

count n = return () >>= (\_ -> count (n+1))

Now the recursive call is hidden inside a lambda, so c is just a small structure return () >>= (\_ -> BODY). This does actually avoid the space leak when compiling without optimizations. However when optimizations are enabled, GHC floats out count (n+1) from the body of the lambda (since it doesn't depend on the argument) producing

count n = return () >>= (let body = count (n+1) in \_ -> body)

and now c is a large structure again...

answered Oct 03 '22 05:10

Reid Barton

Related questions
                            
                                Help in understanding pointfree code
                            
                                Am I using reactive-banana right?
                            
                                GHC Core as "bytecode"?
                            
                                How do I pretty-print a table in Haskell?
                            
                                Numbers as multiplicative functions (weird but entertaining)
                            
                                Haskell ghc compiling/linking error, not creating executable. (linux)
                            
                                traversal tree with Lens and Zippers
                            
                                Why does a more general type affect runtime in Haskell?
                            
                                Comonadically finding all the ways to focus on a grid [duplicate]
                            
                                Why do nested MaybeT's cause exponential allocation
                            
                                What is AllowAmbiguousTypes and why is it needed in this "forall" example?
                            
                                Haskell - Export data constructor
                            
                                Haskell let expression converges while similar expression using fix does not
                            
                                Code unexpectedly accepted by GHC/GHCi
                            
                                Modeling domain data in Haskell [closed]
                            
                                Is dollar operator ($) considered bad form? Why?
                            
                                Queues instead of method chaining and rules instead of conditionals in Ruby
                            
                                Where is .. defined?
                            
                                What are uses of polymorphic kinds?
                            
                                Cabal - how to install specific version of package

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Memory leak in recursive IO function - PAP

Tags:

memory-leaks

profiling

recursion

haskell

Sean Clark Hess

People also ask

2 Answers

Roman Cheplyaka

Reid Barton

Recent Activity

Donate For Us