Compiling Tail-Call Optimization In Mutual Recursion Across C and Haskell

Tags:

I'm experimenting with the foreign-function interface in Haskell. I wanted to implement a simple test to see if I could do mutual recursion. So, I created the following Haskell code:

module MutualRecursion where
import Data.Int

foreign import ccall countdownC::Int32->IO ()
foreign export ccall countdownHaskell::Int32->IO()

countdownHaskell::Int32->IO()
countdownHaskell n = print n >> if n > 0 then countdownC (pred n) else return ()

Note that the recursive case is a call to countdownC, so this should be tail-recursive.

In my C code, I have

#include <stdio.h>

#include "MutualRecursionHaskell_stub.h"

void countdownC(int count)
{
    printf("%d\n", count);
    if(count > 0)
        return countdownHaskell(count-1);
}

int main(int argc, char* argv[])
{
    hs_init(&argc, &argv);

    countdownHaskell(10000);

    hs_exit();
    return 0;
}

Which is likewise tail recursive. So then I make a

MutualRecursion: MutualRecursionHaskell_stub
    ghc -O2 -no-hs-main MutualRecursionC.c MutualRecursionHaskell.o -o MutualRecursion
MutualRecursionHaskell_stub:
    ghc -O2 -c MutualRecursionHaskell.hs

and compile with make MutualRecursion.

And... upon running, it segfaults after printing 8991. Just as a test to make sure gcc itself can handle tco in mutual recursion, I did

void countdownC2(int);

void countdownC(int count)
{
    printf("%d\n", count);
    if(count > 0)
        return countdownC2(count-1);
}

void countdownC2(int count)
{
    printf("%d\n", count);
    if(count > 0)
        return countdownC(count-1);
}

and that worked quite fine. It also works in the single-recursion case of just in C and just in Haskell.

So my question is, is there a way to indicate to GHC that the call to the external C function is tail recursive? I'm assuming that the stack frame does come from the call from Haskell to C and not the other way around, since the C code is very clearly a return of a function call.

258

asked Nov 03 '15 18:11

Crazycolorz5

1 Answers

I believe cross-language C-Haskell tail calls are very, very hard to achieve.

I do not know the exact details, but the C runtime and the Haskell runtime are vastly different. The main factors for this difference, as far as I can see, are:

different paradigm: purely functional vs imperative
garbage collection vs manual memory management
lazy semantics vs strict one

The kinds of optimizations which are likely to survive across language boundaries given such differences are next to zero. Perhaps, in theory, one could invent an ad hoc C runtime together with a Haskell runtime so that some optimizations are feasible, but GHC and GCC were not designed in this way.

Just to show an example of the potential differences, assume we have the following Haskell code

p :: Int -> Bool
p x = x==42

main = if p 42
       then putStrLn "A"     -- A
       else putStrLn "B"     -- B

A possible implementation of the main could be the following:

push the address of A on the stack
push the address of B on the stack
push 42 on the stack
jump to p
A: print "A", jump to end
B: print "B", jump to end

while p is implemented as follows:

p: pop x from the stack
pop b from stack
pop a from stack
test x against 42
if equal, jump to a
jump to b

Note how p is invoked with two return addresses, one for each possible result. This is different from C, whose standard implementations use only one return address. When crossing boundaries the compiler must account for this difference and compensate.

Above I also did not account for the case when the argument of p is a thunk, to keep it simple. The GHC allocator can also trigger garbage collection.

Note that the above fictional implementation was actually used in the past by GHC (the so called "push/enter" STG machine). Even if now it is no longer in use, the "eval/apply" STG machine is only marginally closer to the C runtime. I'm not even sure about GHC using the regular C stack: I think it does not, using its own one.

You can check the GHC developer wiki to see the gory details.

128

answered Sep 24 '22 14:09

chi

Related questions
                            
                                Why do certain Linux x86_64 system calls require a stub?
                            
                                Why does -Wunused-variable in GCC produce an error even on static const?
                            
                                C Programming Language Example About Arrays
                            
                                Is __attribute__((constructor)) guaranteed to be called exactly once?
                            
                                How to read the relocation records of an object file
                            
                                Linux Kernel: Static Const vs #Define
                            
                                What signals shouldn't be blocked by thread in multi-threaded program in C?
                            
                                How can I accept multiple clients to a TCP server?
                            
                                how to do a relative jump/call with inline assembly in GCC (x86_64)
                            
                                Does C standard mandate that platforms must not define behaviors beyond those given in standard
                            
                                How to transpose a matrix in C? - error
                            
                                Can exit() from a library be replaced with throwing an exception? [duplicate]
                            
                                How can the offset for a JVM jump instruction be 32768?
                            
                                CMake cross-compile with specific linker doesn't pass arguments to armlink
                            
                                sending fake keypress event to a window using xlib
                            
                                Unexpected performance with global variables
                            
                                Calling Swift from C
                            
                                Is it possible to dump the core but not exit the process?
                            
                                How to reduce code bloat in C from error handling & debugs [duplicate]
                            
                                C Disk I/O - write after read at the same offset of a file will make read throughput very low

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Compiling Tail-Call Optimization In Mutual Recursion Across C and Haskell

Tags:

c

haskell

tail-call-optimization

multiple-languages

mutual-recursion

Crazycolorz5

People also ask

1 Answers

chi

Recent Activity

Donate For Us