How do Haskell compilers decide whether to allocate on the heap or the stack?

Tags:

Haskell doesn't feature explicit memory management, and all objects are passed by value, so there's no obvious reference counting or garbage collection either. How does a Haskell compiler typically decide whether to generate code that allocates on the stack versus code that allocates on the heap for a given variable? Will it consistently heap or stack allocate the same variables across different call sites for the same function? And when it allocates, how does it decide when to free memory? Are stack allocations and deallocations still performed in the same function entrance/exit pattern as in C?

529

asked Feb 27 '11 09:02

Joseph Garvin

1 Answers

When you call a function like this

f 42 (g x y)

then the runtime behaviour is something like the following:

p1 = malloc(2 * sizeof(Word)) p1[0] = &Tag_for_Int p1[1] = 42 p2 = malloc(3 * sizeof(Word)) p2[0] = &Code_for_g_x_y p2[1] = x p2[2] = y f(p1, p2)

That is, arguments are usually passed as pointers to objects on the heap like in Java, but unlike Java these objects may represent suspended computations, a.k.a. thunks, such as (g x y/p2) in our example. Without optimisations, this execution model is quite inefficient, but there are ways to avoid many of these overheads.

GHC does a lot of inlining and unboxing. Inlining removes the function call overhead and often enables further optimisations. Unboxing means changing the calling convention, in the example above we could pass 42 directly instead of creating the heap object p1.
Strictness analysis finds out whether an argument is guaranteed to be evaluated. In that case, we don't need to create a thunk, but evaluate the expression fully and then pass the final result as an argument.
Small objects (currently only 8bit Chars and Ints) are cached. ~~That is, instead of allocating a new pointer for each object, a pointer to the cached object is returned.~~ Even though the object is initially allocated on the heap, the garbage collector will de-duplicate them later (only small Ints and Chars). Since objects are immutable this is safe.
Limited escape analysis. For local functions some arguments may be passed on the stack, because they are known to be dead code by the time the outer function returns.

Edit: For (much) more information see "Implementing Lazy Functional Languages on Stock Hardware: The Spineless Tagless G-machine". This paper uses "push/enter" as the calling convention. Newer versions of GHC use the "eval/apply" calling convention. For a discussion of the trade-offs and reasons for that switch see "How to make a fast curry: push/enter vs eval/apply"

123

answered Nov 10 '22 21:11

nominolo

Related questions
                            
                                How can I write human-language units as postfixes in Haskell, like `3 seconds`?
                            
                                What does it mean to compose two Functors?
                            
                                Prompting for a password in Haskell command line application
                            
                                How do I convert a list to a tuple in Haskell?
                            
                                Installing Haskell packages on Mac
                            
                                call/cc implementation?
                            
                                Why is GHC complaining about non-exhaustive patterns?
                            
                                Is there any standard implementation of the "trivial constraint", or "object class"?
                            
                                Why can't the type of id be specialised to (forall a. a -> a) -> (forall b. b -> b)?
                            
                                "Modern" HList?
                            
                                Haskell random numbers suddenly start to "converge" after months of running
                            
                                haskell regex substitution
                            
                                Resources for learning category theory [closed]
                            
                                Haskell image processing library? [closed]
                            
                                How to inject a Maybe value into MaybeT
                            
                                Is putStrLn thread-safe?
                            
                                Duality approaches in functional programming
                            
                                What is the difference between Fix, Mu and Nu in Ed Kmett's recursion scheme package
                            
                                How to make my Haskell program faster? Comparison with C
                            
                                What's the right way to divide two Int values to obtain a Float?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do Haskell compilers decide whether to allocate on the heap or the stack?

Tags:

memory-management

heap-memory

haskell

compiler-construction

stack-memory

Joseph Garvin

People also ask

1 Answers

nominolo

Recent Activity

Donate For Us