The implementation of <code>Integer>>#factorial</code> in Pharo is: <pre class="prettyprint"><code>factorial "Answer the factorial of the receiver." self = 0 ifTrue: [^ 1]. self > 0 ifTrue: [^ self * (self - 1) factorial]. self error: 'Not valid for negative integers' </code></pre> This a tail-recursive definition. However, I can evaluate <code>10000 factorial</code> without error in the workspace. Does Pharo perform tail-call optimisation in any circumstances, is it doing some other optimisation, or is it just using a really deep stack?

There is no mystery in the execution model of Pharo. The recursive fragment <pre class="prettyprint"><code>^ self * (self - 1) factorial </code></pre> that happens inside the second <code>ifTrue:</code> compiles to the following sequence of bytecodes: <pre class="prettyprint"><code>39 <70> self ; receiver of outer message * 40 <70> self ; receiver of inner message - 41 <76> pushConstant: 1 ; argument of self - 1 42 <B1> send: - ; subtract 43 <D0> send: factorial ; send factorial (nothing special here!) 44 <B8> send: * ; multiply 45 <7C> returnTop ; return </code></pre> Note that in line 43 nothing special happens. The code just sends <code>factorial</code> in the same way it would, had the selector been any other. In particular we can see that there is no special manipulation of the stack here. This doesn't mean that there cannot be optimizations in the underlying native code. But that is a different discussion. It is the execution model the one that matters to the programmer because any optimization underneath bytecodes is meant to support this model at the conceptual level. UPDATE Interestingly, the non-recursive version <pre class="prettyprint"><code>factorial2 | f | f := 1. 2 to: self do: [:i | f := f * i]. ^f </code></pre> is a little bit slower that the recursive one (Pharo). The reason must be that the overhead associated to increasing <code>i</code> is a little bit greater than the recursive send mechanism. Here are the expressions I tried: <pre class="prettyprint"><code>[25000 factorial] timeToRun [25000 factorial2] timeToRun </code></pre>

Does Pharo provide tail-call optimisation?

Tags:

tail-recursion

smalltalk

pharo

The implementation of Integer>>#factorial in Pharo is:

factorial
        "Answer the factorial of the receiver."

        self = 0 ifTrue: [^ 1].
        self > 0 ifTrue: [^ self * (self - 1) factorial].
        self error: 'Not valid for negative integers'

This a tail-recursive definition. However, I can evaluate 10000 factorial without error in the workspace.

Does Pharo perform tail-call optimisation in any circumstances, is it doing some other optimisation, or is it just using a really deep stack?

707

asked May 07 '16 14:05

Wilfred Hughes

1 Answers

There is no mystery in the execution model of Pharo. The recursive fragment

^ self * (self - 1) factorial

that happens inside the second ifTrue: compiles to the following sequence of bytecodes:

39 <70> self                  ; receiver of outer message *
40 <70> self                  ; receiver of inner message -
41 <76> pushConstant: 1       ; argument of self - 1
42 <B1> send: -               ; subtract
43 <D0> send: factorial       ; send factorial (nothing special here!) 
44 <B8> send: *               ; multiply
45 <7C> returnTop             ; return

Note that in line 43 nothing special happens. The code just sends factorial in the same way it would, had the selector been any other. In particular we can see that there is no special manipulation of the stack here.

This doesn't mean that there cannot be optimizations in the underlying native code. But that is a different discussion. It is the execution model the one that matters to the programmer because any optimization underneath bytecodes is meant to support this model at the conceptual level.

UPDATE

Interestingly, the non-recursive version

factorial2
  | f |
  f := 1.
  2 to: self do: [:i | f := f * i].
  ^f

is a little bit slower that the recursive one (Pharo). The reason must be that the overhead associated to increasing i is a little bit greater than the recursive send mechanism.

Here are the expressions I tried:

[25000 factorial] timeToRun
[25000 factorial2] timeToRun

140

answered Oct 15 '22 06:10

Leandro Caniglia

Related questions
                            
                                Smalltalk superclass vs metaclass?
                            
                                What's the difference of Squeak/Pharo/Newspeak Smalltalk VMs?
                            
                                Smalltalk - Compare two strings for equality
                            
                                Why does add: return the object added in Smalltalk collections?
                            
                                Could someone elaborate on Smalltalks supposedly killer toolchain?
                            
                                Porting code to Pharo 2.0
                            
                                Smalltalk Array Types
                            
                                Inconsistencies in smalltalk
                            
                                Capture string in regex replacement
                            
                                Where is the Smalltalk Archive gone?
                            
                                What is the difficulty in making Smalltalk parallel?
                            
                                Are Traits good or bad?
                            
                                How to copy a Monticello package to another repository under a different name with Gofer
                            
                                smalltalk singleton pattern: how do I initialize the instance variables?
                            
                                Which innovations (like MVC, xunit, Hotspot) did Smalltalk bring?
                            
                                Where could I find more examples of using PetitParser? [closed]
                            
                                Are continuations a key feature in Seaside?
                            
                                Is it possible to deploy a pharo image without .changes and .sources files
                            
                                What is the Smalltalk equivalent of Java's static?
                            
                                Iterate a collection backwards in Pharo Smalltalk

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With