This is from the SICP book that I am sure many of you are familiar with. This is an early example in the book, but I feel an extremely important concept that I am just not able to get my head around yet. Here it is: <pre class="prettyprint"><code>(define (cons x y) (define (dispatch m) (cond ((= m 0) x) ((= m 1) y) (else (error "Argument not 0 or 1 - CONS" m)))) dispatch) (define (car z) (z 0)) (define (cdr z) (z 1)) </code></pre> So here I understand that <code>car</code> and <code>cdr</code> are being defined within the scope of <code>cons</code>, and I get that they map some argument <code>z</code> to 1 and 0 respectively (argument <code>z</code> being some <code>cons</code>). But say I call <code>(cons 3 4)</code>...how are the arguments 3 and 4 evaluated, when we immediately go into this inner-procedure <code>dispatch</code> which takes some argument <code>m</code> that we have not specified yet? And, maybe more importantly, what is the point of returning '<code>dispatch</code>? I don't really get that part at all. Any help is appreciated, thanks!

This is one of the weirder (and possibly one of the more wonderful) examples of exploiting first-class functions in Scheme. Something similar is also in the Little Schemer, which is where I first saw it, and I remember scratching my head for days over it. Let me see if I can explain it in a way that makes sense, but I apologize if it's not clear. I assume you understand the primitives <code>cons</code>, <code>car</code>, and <code>cdr</code> as they are implemented in Scheme already, but just to remind you: <code>cons</code> constructs a pair, <code>car</code> selects the first component of the pair and returns it, and <code>cdr</code> selects the second component and returns it. Here's a simple example of using these functions: <pre class="prettyprint"><code>> (cons 1 2) (1 . 2) > (car (cons 1 2)) 1 > (cdr (cons 1 2)) 2 </code></pre> The version of <code>cons</code>, <code>car</code>, and <code>cdr</code> that you've pasted should behave exactly the same way. I'll try to show you how. First of all, <code>car</code> and <code>cdr</code> are not defined within the scope of <code>cons</code>. In your snippet of code, all three (<code>cons</code>, <code>car</code>, and <code>cdr</code>) are defined at the top-level. The function <code>dispatch</code> is the only one that is defined inside <code>cons</code>. The function <code>cons</code> takes two arguments and returns a function of one argument. What's important about this is that those two arguments are visible to the inner function <code>dispatch</code>, which is what is being returned. I'll get to that in a moment. As I said in my reminder, <code>cons</code> constructs a pair. This version of <code>cons</code> should do the same thing, but instead it's returning a function! That's ok, we don't really care how the pair is implemented or laid out in memory, so long as we can get at the first and second components. So with this new function-based pair, we need to be able to call <code>car</code> and pass the pair as an argument, and get the first component. In the definition of <code>car</code>, this argument is called <code>z</code>. If you were to execute the same REPL session I had above with these new <code>cons</code> ,<code>car</code>, and <code>cdr</code> functions, the argument <code>z</code> in <code>car</code> will be bound to the function-based pair, which is what <code>cons</code> returns, which is <code>dispatch</code>. It's confusing, but just think it through carefully and you'll see. Based on the implementation of <code>car</code>, it appears to be that it take a function of one argument, and applies it to the number <code>0</code>. So it's applying <code>dispatch</code> to <code>0</code>, and as you can see from the definition of <code>dispatch</code>, that's what we want. The <code>cond</code> inside there compares <code>m</code> with <code>0</code> and <code>1</code> and returns either <code>x</code> or <code>y</code>. In this case, it returns <code>x</code>, which is the first argument to <code>cons</code>, in other words the first component of the pair! So <code>car</code> selects the first component, just as the normal primitive does in Scheme. If you follow this same logic for <code>cdr</code>, you'll see that it behaves almost the same way, but returns the second argument to <code>cons</code>, <code>y</code>, which is the second component of the pair. There are a couple of things that might help you understand this better. One is to go back to the description of the substitution model of evaluation in Chapter 1. If you carefully and meticulously follow that substitution model for some very simple example of using these functions, you'll see that they work. Another way, which is less tedious, is to try playing with the <code>dispatch</code> function directly at the REPL. Below, the variable <code>p</code> is defined to refer to the <code>dispatch</code> function returned by <code>cons</code>. <pre class="prettyprint"><code>> (define p (cons 1 2)) #<function> ;; what the REPL prints here will be implementation specific > (p 0) 1 > (p 1) 2 </code></pre>

The code in the question shows how to redefine the primitive procedure <code>cons</code> that creates a cons-cell (a pair of two elements: the car and the cdr), using only closures and message-dispatching. The <code>dispatch</code> procedure acts as a selector for the arguments passed to <code>cons</code>: <code>x</code> and <code>y</code>. If the message <code>0</code> is received, then the first argument of <code>cons</code> is returned (the <code>car</code> of the cell). Likewise, if <code>1</code> is received, then the second argument of <code>cons</code> is returned (the <code>cdr</code> of the cell). Both arguments are stored inside the closure defined implicitly for the <code>dispatch</code> procedure, a closure that captures <code>x</code> and <code>y</code> and is returned as the product of invoking this procedural implementation of <code>cons</code>. The next redefinitions of <code>car</code> and <code>cdr</code> build on this: <code>car</code> is implemented as a procedure that passes <code>0</code> to a closure as returned in the above definition, and <code>cdr</code> is implemented as a procedure that passes <code>1</code> to the closure, in each case ultimately returning the original value that was passed as <code>x</code> and <code>y</code> respectively. The really nice part of this example is that it shows that the cons-cell, the most basic unit of data in a Lisp system can be defined as a procedure, therefore blurring the distinction between data and procedure.

Scheme: Procedures that return another inner procedure

Tags:

functional-programming

lisp

scheme

sicp

This is from the SICP book that I am sure many of you are familiar with. This is an early example in the book, but I feel an extremely important concept that I am just not able to get my head around yet. Here it is:

(define (cons x y)
 (define (dispatch m)
   (cond ((= m 0) x)
         ((= m 1) y)
         (else (error "Argument not 0 or 1 - CONS" m))))
 dispatch)
(define (car z) (z 0))
(define (cdr z) (z 1))

So here I understand that car and cdr are being defined within the scope of cons, and I get that they map some argument z to 1 and 0 respectively (argument z being some cons). But say I call (cons 3 4)...how are the arguments 3 and 4 evaluated, when we immediately go into this inner-procedure dispatch which takes some argument m that we have not specified yet? And, maybe more importantly, what is the point of returning 'dispatch? I don't really get that part at all. Any help is appreciated, thanks!

213

asked Sep 19 '12 14:09

Houdini

2 Answers

This is one of the weirder (and possibly one of the more wonderful) examples of exploiting first-class functions in Scheme. Something similar is also in the Little Schemer, which is where I first saw it, and I remember scratching my head for days over it. Let me see if I can explain it in a way that makes sense, but I apologize if it's not clear.

I assume you understand the primitives cons, car, and cdr as they are implemented in Scheme already, but just to remind you: cons constructs a pair, car selects the first component of the pair and returns it, and cdr selects the second component and returns it. Here's a simple example of using these functions:

> (cons 1 2)
(1 . 2)
> (car (cons 1 2))
1
> (cdr (cons 1 2))
2

The version of cons, car, and cdr that you've pasted should behave exactly the same way. I'll try to show you how.

First of all, car and cdr are not defined within the scope of cons. In your snippet of code, all three (cons, car, and cdr) are defined at the top-level. The function dispatch is the only one that is defined inside cons.

The function cons takes two arguments and returns a function of one argument. What's important about this is that those two arguments are visible to the inner function dispatch, which is what is being returned. I'll get to that in a moment.

As I said in my reminder, cons constructs a pair. This version of cons should do the same thing, but instead it's returning a function! That's ok, we don't really care how the pair is implemented or laid out in memory, so long as we can get at the first and second components.

So with this new function-based pair, we need to be able to call car and pass the pair as an argument, and get the first component. In the definition of car, this argument is called z. If you were to execute the same REPL session I had above with these new cons ,car, and cdr functions, the argument z in car will be bound to the function-based pair, which is what cons returns, which is dispatch. It's confusing, but just think it through carefully and you'll see.

Based on the implementation of car, it appears to be that it take a function of one argument, and applies it to the number 0. So it's applying dispatch to 0, and as you can see from the definition of dispatch, that's what we want. The cond inside there compares m with 0 and 1 and returns either x or y. In this case, it returns x, which is the first argument to cons, in other words the first component of the pair! So car selects the first component, just as the normal primitive does in Scheme.

If you follow this same logic for cdr, you'll see that it behaves almost the same way, but returns the second argument to cons, y, which is the second component of the pair.

There are a couple of things that might help you understand this better. One is to go back to the description of the substitution model of evaluation in Chapter 1. If you carefully and meticulously follow that substitution model for some very simple example of using these functions, you'll see that they work.

Another way, which is less tedious, is to try playing with the dispatch function directly at the REPL. Below, the variable p is defined to refer to the dispatch function returned by cons.

> (define p (cons 1 2))
#<function> ;; what the REPL prints here will be implementation specific
> (p 0)
1
> (p 1)
2

152

answered Sep 25 '22 02:09

michiakig

The code in the question shows how to redefine the primitive procedure cons that creates a cons-cell (a pair of two elements: the car and the cdr), using only closures and message-dispatching.

The dispatch procedure acts as a selector for the arguments passed to cons: x and y. If the message 0 is received, then the first argument of cons is returned (the car of the cell). Likewise, if 1 is received, then the second argument of cons is returned (the cdr of the cell). Both arguments are stored inside the closure defined implicitly for the dispatch procedure, a closure that captures x and y and is returned as the product of invoking this procedural implementation of cons.

The next redefinitions of car and cdr build on this: car is implemented as a procedure that passes 0 to a closure as returned in the above definition, and cdr is implemented as a procedure that passes 1 to the closure, in each case ultimately returning the original value that was passed as x and y respectively.

The really nice part of this example is that it shows that the cons-cell, the most basic unit of data in a Lisp system can be defined as a procedure, therefore blurring the distinction between data and procedure.

answered Sep 24 '22 02:09

Óscar López

Related questions
                            
                                Validation versus disjunction
                            
                                Function composition, Kleisli arrow, and Monadic laws
                            
                                How does one avoid creating an ad-hoc type system in dynamically typed languages?
                            
                                Scala: collecting updates/changes of immutable state
                            
                                What Self Balancing Tree is simplest in Functional Programming?
                            
                                ngrx dealing with nested array in object
                            
                                How does this compile?
                            
                                What do you call the data wrapped inside a monad?
                            
                                How can quotient types help safely expose module internals?
                            
                                Understanding `ap` in a point-free function in Haskell
                            
                                What does it mean to rely on the consistency of a coercion language?
                            
                                How exactly does Stream Fusion work?
                            
                                Case statements evaluate to strings
                            
                                Haskell: Typeclass vs passing a function
                            
                                Where does the name "xs" for pattern matching come from? [duplicate]
                            
                                Why can Array.prototype.forEach not be chained?
                            
                                Which features of Perl make it a functional programming language?
                            
                                Is there some literature on this type of programming?
                            
                                Does functional programming avoid state?
                            
                                What is a function that takes no arguments called?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With