Very simply, what is tail-call optimization? More specifically, what are some small code snippets where it could be applied, and where not, with an explanation of why?

Let's walk through a simple example: the factorial function implemented in C. We start with the obvious recursive definition <pre class="prettyprint lang-c prettyprint-override"><code>unsigned fac(unsigned n) { if (n < 2) return 1; return n * fac(n - 1); } </code></pre> A function ends with a tail call if the last operation before the function returns is another function call. If this call invokes the same function, it is tail-recursive. Even though <code>fac()</code> looks tail-recursive at first glance, it is not as what actually happens is <pre class="prettyprint lang-c prettyprint-override"><code>unsigned fac(unsigned n) { if (n < 2) return 1; unsigned acc = fac(n - 1); return n * acc; } </code></pre> ie the last operation is the multiplication and not the function call. However, it's possible to rewrite <code>fac()</code> to be tail-recursive by passing the accumulated value down the call chain as an additional argument and passing only the final result up again as the return value: <pre class="prettyprint lang-c prettyprint-override"><code>unsigned fac(unsigned n) { return fac_tailrec(1, n); } unsigned fac_tailrec(unsigned acc, unsigned n) { if (n < 2) return acc; return fac_tailrec(n * acc, n - 1); } </code></pre> Now, why is this useful? Because we immediately return after the tail call, we can discard the previous stackframe before invoking the function in tail position, or, in case of recursive functions, reuse the stackframe as-is. The tail-call optimization transforms our recursive code into <pre class="prettyprint lang-c prettyprint-override"><code>unsigned fac_tailrec(unsigned acc, unsigned n) { TOP: if (n < 2) return acc; acc = n * acc; n = n - 1; goto TOP; } </code></pre> This can be inlined into <code>fac()</code> and we arrive at <pre class="prettyprint lang-c prettyprint-override"><code>unsigned fac(unsigned n) { unsigned acc = 1; TOP: if (n < 2) return acc; acc = n * acc; n = n - 1; goto TOP; } </code></pre> which is equivalent to <pre class="prettyprint lang-c prettyprint-override"><code>unsigned fac(unsigned n) { unsigned acc = 1; for (; n > 1; --n) acc *= n; return acc; } </code></pre> As we can see here, a sufficiently advanced optimizer can replace tail-recursion with iteration, which is far more efficient as you avoid function call overhead and only use a constant amount of stack space.

What is tail call optimization?

2 Answers

Tail-call optimization is where you are able to avoid allocating a new stack frame for a function because the calling function will simply return the value that it gets from the called function. The most common use is tail-recursion, where a recursive function written to take advantage of tail-call optimization can use constant stack space.

Scheme is one of the few programming languages that guarantee in the spec that any implementation must provide this optimization, so here are two examples of the factorial function in Scheme:

(define (fact x)   (if (= x 0) 1       (* x (fact (- x 1)))))  (define (fact x)   (define (fact-tail x accum)     (if (= x 0) accum         (fact-tail (- x 1) (* x accum))))   (fact-tail x 1))

The first function is not tail recursive because when the recursive call is made, the function needs to keep track of the multiplication it needs to do with the result after the call returns. As such, the stack looks as follows:

(fact 3) (* 3 (fact 2)) (* 3 (* 2 (fact 1))) (* 3 (* 2 (* 1 (fact 0)))) (* 3 (* 2 (* 1 1))) (* 3 (* 2 1)) (* 3 2) 6

In contrast, the stack trace for the tail recursive factorial looks as follows:

(fact 3) (fact-tail 3 1) (fact-tail 2 3) (fact-tail 1 6) (fact-tail 0 6) 6

As you can see, we only need to keep track of the same amount of data for every call to fact-tail because we are simply returning the value we get right through to the top. This means that even if I were to call (fact 1000000), I need only the same amount of space as (fact 3). This is not the case with the non-tail-recursive fact, and as such large values may cause a stack overflow.

104

answered Oct 12 '22 12:10

Kyle Cronin

Let's walk through a simple example: the factorial function implemented in C.

We start with the obvious recursive definition

unsigned fac(unsigned n) {     if (n < 2) return 1;     return n * fac(n - 1); }

A function ends with a tail call if the last operation before the function returns is another function call. If this call invokes the same function, it is tail-recursive.

Even though fac() looks tail-recursive at first glance, it is not as what actually happens is

unsigned fac(unsigned n) {     if (n < 2) return 1;     unsigned acc = fac(n - 1);     return n * acc; }

ie the last operation is the multiplication and not the function call.

However, it's possible to rewrite fac() to be tail-recursive by passing the accumulated value down the call chain as an additional argument and passing only the final result up again as the return value:

unsigned fac(unsigned n) {     return fac_tailrec(1, n); }  unsigned fac_tailrec(unsigned acc, unsigned n) {     if (n < 2) return acc;     return fac_tailrec(n * acc, n - 1); }

Now, why is this useful? Because we immediately return after the tail call, we can discard the previous stackframe before invoking the function in tail position, or, in case of recursive functions, reuse the stackframe as-is.

The tail-call optimization transforms our recursive code into

unsigned fac_tailrec(unsigned acc, unsigned n) { TOP:     if (n < 2) return acc;     acc = n * acc;     n = n - 1;     goto TOP; }

This can be inlined into fac() and we arrive at

unsigned fac(unsigned n) {     unsigned acc = 1;  TOP:     if (n < 2) return acc;     acc = n * acc;     n = n - 1;     goto TOP; }

which is equivalent to

unsigned fac(unsigned n) {     unsigned acc = 1;      for (; n > 1; --n)         acc *= n;      return acc; }

As we can see here, a sufficiently advanced optimizer can replace tail-recursion with iteration, which is far more efficient as you avoid function call overhead and only use a constant amount of stack space.

answered Oct 12 '22 11:10

Christoph

Related questions
                            
                                Best way to reverse a string
                            
                                Why does Java's hashCode() in String use 31 as a multiplier?
                            
                                How to replace all occurrences of a character in string?
                            
                                What is the most efficient/elegant way to parse a flat table into a tree?
                            
                                What algorithms compute directions from point A to point B on a map?
                            
                                How do you detect Credit card type based on number?
                            
                                A simple explanation of Naive Bayes Classification [closed]
                            
                                What is the most effective way for float and double comparison?
                            
                                Algorithm to return all combinations of k elements from n
                            
                                What is the difference between a generative and a discriminative algorithm? [closed]
                            
                                How to check if a number is a power of 2
                            
                                How can building a heap be O(n) time complexity?
                            
                                How do I create a URL shortener? [closed]
                            
                                Expand a random range from 1–5 to 1–7
                            
                                Generate an integer that is not among four billion given ones
                            
                                How to generate all permutations of a list?
                            
                                Sorting 1 million 8-decimal-digit numbers with 1 MB of RAM
                            
                                How do I determine whether my calculation of pi is accurate?
                            
                                Big O, how do you calculate/approximate it?
                            
                                How to count the number of set bits in a 32-bit integer?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is tail call optimization?

Tags:

language-agnostic

algorithm

recursion

tail-recursion

tail-call-optimization

majelbstoat

People also ask

2 Answers

Kyle Cronin

Christoph

Recent Activity

Donate For Us