<pre class="prettyprint"><code>type foo = A of int * int | B of (int * int) </code></pre> What is the difference between <code>int * int</code> and <code>(int * int)</code> there? The only difference I see is in pattern matching: <pre class="prettyprint"><code>let test_foo = function | A (f, s) -> (f, s) | B b -> b </code></pre> Is it just a syntactic sugar? How do you select which one to use? Is there any performance difference between these two forms?

Yes, there is a performance difference: In memory <code>A (23, 42)</code> will contain a tag identifying it as an <code>A</code> and the two integers 23 and 42. <code>B (23, 42)</code> will contain a tag identifying it as a <code>B</code> and a pointer to a tuple containing the integers <code>23</code> and <code>42</code>. So there will be one additional memory allocation when creating a <code>B</code> and one additional level of indirection when accessing the individual values inside a <code>B</code>. So in cases where you don't actually use the constructor arguments as a tuple, using <code>A</code> will involve less overhead than using <code>B</code>. On the other hand your <code>test_foo</code> function will create a new tuple every time it is called with an <code>A</code> value, but when it is called with a <code>B</code> value it will simply return the tuple that already exists in memory. So <code>test_foo</code> is a cheaper operation for <code>B</code> than it is for <code>A</code>. So if you'll be using the constructor's arguments as a tuple and you will do so multiple times for the same value, using <code>B</code> will be cheaper. So if you're going to be using the constructor arguments as a tuple, it makes sense to use a constructor taking a tuple because you can get at the tuple using pattern matching with less code and because it will avoid having to create tuples from the same value multiple times. In all other cases not using a tuple is preferable because it involves less memory allocation and less indirection.

As already said, the constructor of <code>A</code> takes two <code>int</code>, whereas the constructor of <code>B</code> takes an ordered pair. so you can write <pre class="prettyprint"><code>let bar = A (1, 2) </code></pre> or <pre class="prettyprint"><code>let bar = B (1, 2) </code></pre> or <pre class="prettyprint"><code>let bar = (1, 2) let baz = B bar </code></pre> but you cannot write <pre class="prettyprint"><code>let bar = (1, 2) let baz = A bar </code></pre> Moreover, in your pattern matching, you can still match the content of B as two int, but you cannot match the content of A as value bound to an ordered pair <pre class="prettyprint"><code>let test_foo = function | A a -> a (* wrong *) | B (f, s) -> (f, s) (* ok *) </code></pre>

int * int vs (int * int) in OCaml sum type

Tags:

ocaml

type foo = A of int * int | B of (int * int)

What is the difference between int * int and (int * int) there? The only difference I see is in pattern matching:

let test_foo = function
  | A (f, s) -> (f, s)
  | B b -> b

Is it just a syntactic sugar? How do you select which one to use? Is there any performance difference between these two forms?

390

asked Feb 11 '13 18:02

Stas

2 Answers

Yes, there is a performance difference:

In memory A (23, 42) will contain a tag identifying it as an A and the two integers 23 and 42. B (23, 42) will contain a tag identifying it as a B and a pointer to a tuple containing the integers 23 and 42. So there will be one additional memory allocation when creating a B and one additional level of indirection when accessing the individual values inside a B. So in cases where you don't actually use the constructor arguments as a tuple, using A will involve less overhead than using B.

On the other hand your test_foo function will create a new tuple every time it is called with an A value, but when it is called with a B value it will simply return the tuple that already exists in memory. So test_foo is a cheaper operation for B than it is for A. So if you'll be using the constructor's arguments as a tuple and you will do so multiple times for the same value, using B will be cheaper.

So if you're going to be using the constructor arguments as a tuple, it makes sense to use a constructor taking a tuple because you can get at the tuple using pattern matching with less code and because it will avoid having to create tuples from the same value multiple times. In all other cases not using a tuple is preferable because it involves less memory allocation and less indirection.

answered Sep 27 '22 17:09

sepp2k

As already said, the constructor of A takes two int, whereas the constructor of B takes an ordered pair.

so you can write

let bar = A (1, 2)

let bar = B (1, 2)

let bar = (1, 2)
let baz = B bar

but you cannot write

let bar = (1, 2)
let baz = A bar

Moreover, in your pattern matching, you can still match the content of B as two int, but you cannot match the content of A as value bound to an ordered pair

let test_foo = function
  | A a -> a (* wrong *)
  | B (f, s) -> (f, s) (* ok *)

answered Sep 27 '22 16:09

Benoît Guédas

Related questions
                            
                                where is the source code for the standard library in OCaml?
                            
                                OCaml: draw binary trees
                            
                                Verify that an OCaml function is tail-recursive
                            
                                signatures/types in functional programming (OCaml)
                            
                                ocamlfind cannot see installed package
                            
                                OCaml function with variable number of arguments
                            
                                Ocaml modules implementation
                            
                                Absolute value for floats in core OCaml
                            
                                Does OCaml have String.split function like Python?
                            
                                Why is OCaml's pattern matching weaker than Erlang's?
                            
                                What language could I use for fast execution of this database summarization task?
                            
                                Converting F# pipeline operators ( <|, >>, << ) to OCaml
                            
                                How to convert CPS-style gcd computation to use the Continuation Monad
                            
                                How can F# "remove a lot of subtle bug" from OCaml "+"?
                            
                                Properly compiling modules in subfolders (ocamlbuild)
                            
                                Resolving reduce/reduce conflict in yacc/ocamlyacc
                            
                                Haskell or Ocaml with OpenGL and SDL precompiled distribution for Windows
                            
                                Improving the lambda-code OCaml generates for assertions
                            
                                Structural typing implementation of OCaml, Scala, and Go

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With