I'm currently designing a numerical algorithm which as part of its operations requires updating a vector of <code>doubles</code> many times. Due to the fact that the algorithm has to be as space and time efficient as possible, I do not want to code the traditional type of FP code which creates many versions of the data structure under the hood after each operation on it. Neither do I want to create mutable data structures and have them globally available. Consequently, I have decided to use a mutable data structure but then choose to do the the mutable operations in a <code>State</code> monad. Since this is my first stab at using a <code>State</code> monad, I want to confirm the whether I have or have not <ol> <li>preserved referential transparency</li> <li>maintained functional purity</li> </ol> The <code>update</code> function transitions the data structure state. Since the destructive update is localised within this function and a handle to the data structure cannot be got at from outside, I think this function is pure and referentially transparent. <pre class="prettyprint"><code>def update(i : Int,d : Double) = State[ArraySeq[Double], Unit]{ case xs: ArraySeq[Double] => {xs(i) = d; (xs, ())} } </code></pre> The <code>app</code> function is a toy function which will consume a sequence of <code>double</code>s and modify it's state: <pre class="prettyprint"><code>def app : State[ArraySeq[Double], Unit] = for{ _ <- update(0, 3.142) // do a heap of stuff on ArraySeq }yield() </code></pre> Call: <pre class="prettyprint"><code>app(Vector(0.0, 1.0, 2.0, 3.0, 4.0).to[ArraySeq])._1.to[Vector] </code></pre> Result: <pre class="prettyprint"><code>res0: Vector[Double] = Vector(3.142, 1.0, 2.0, 3.0, 4.0) </code></pre>

I guess you could say that your <code>update</code> itself is pure, in the sense that it only represents some mutation, but as soon as you run it all bets are off: <pre class="prettyprint"><code>scala> val xs = List(1.0, 2.0, 3.0).to[ArraySeq] xs: scala.collection.mutable.ArraySeq[Double] = ArraySeq(1.0, 2.0, 3.0) scala> update(0, 10).eval(xs) res0: scalaz.Id.Id[Unit] = () scala> xs res1: scala.collection.mutable.ArraySeq[Double] = ArraySeq(10.0, 2.0, 3.0) </code></pre> This is a bad scene, and it's the opposite of pure or referentially transparent. <code>State</code> isn't really buying you anything in your example—the fact that you're calling <code>app</code> in such a way that you have an <code>ArraySeq</code> that nobody else can mutate is. You might as well bite the bullet and work with a mutable data structure in the usual way in a scope that you control—i.e., write <code>app</code> like this: <pre class="prettyprint"><code>def app(xs: Vector[Double]): Vector[Double] = { val arr = xs.to[ArraySeq] // Perform all your updates in the usual way arr.toVector } </code></pre> This actually is pure and referentially transparent, but it's also more honest than the <code>State</code> version. If I see a value of type <code>State[Foo, Unit]</code>, my assumption is going to be that this value represents some kind of operation that changes a <code>Foo</code> into a new <code>Foo</code>, without mutating the original <code>Foo</code>. This is all the state monad is—it provides a nice way of modeling operations on immutable data structures and composing them in a way that looks kind of like mutation. If you mix it with actual mutation you're likely to confuse the hell out of anyone using your code. If you really want real mutation and purity at the same time, you can look at Scalaz's <code>STArray</code>. It's a very clever solution to this problem, and in languages like Haskell it's an approach that makes a lot of sense. My own feeling is that it's pretty much always the wrong solution in Scala, though. If you really need the performance of a mutable array, just use a local mutable array and make sure you don't leak it to the outside world. If you don't need that kind of performance (and most of the time you don't), use something like <code>State</code>.

Purity, Referential Transparency and State Monad

Tags:

functional-programming

scala

I'm currently designing a numerical algorithm which as part of its operations requires updating a vector of doubles many times. Due to the fact that the algorithm has to be as space and time efficient as possible, I do not want to code the traditional type of FP code which creates many versions of the data structure under the hood after each operation on it. Neither do I want to create mutable data structures and have them globally available. Consequently, I have decided to use a mutable data structure but then choose to do the the mutable operations in a State monad. Since this is my first stab at using a State monad, I want to confirm the whether I have or have not

preserved referential transparency
maintained functional purity

The update function transitions the data structure state. Since the destructive update is localised within this function and a handle to the data structure cannot be got at from outside, I think this function is pure and referentially transparent.

def update(i : Int,d : Double) = State[ArraySeq[Double], Unit]{
  case xs: ArraySeq[Double] => {xs(i) = d; (xs, ())}
}

The app function is a toy function which will consume a sequence of doubles and modify it's state:

def app : State[ArraySeq[Double], Unit] = for{
    _ <- update(0, 3.142)
  // do a heap of stuff on ArraySeq
}yield()

Call:

app(Vector(0.0, 1.0, 2.0, 3.0, 4.0).to[ArraySeq])._1.to[Vector]

Result:

res0: Vector[Double] = Vector(3.142, 1.0, 2.0, 3.0, 4.0)

602

asked Jun 01 '15 23:06

M.K.

1 Answers

I guess you could say that your update itself is pure, in the sense that it only represents some mutation, but as soon as you run it all bets are off:

scala> val xs = List(1.0, 2.0, 3.0).to[ArraySeq]
xs: scala.collection.mutable.ArraySeq[Double] = ArraySeq(1.0, 2.0, 3.0)

scala> update(0, 10).eval(xs)
res0: scalaz.Id.Id[Unit] = ()

scala> xs
res1: scala.collection.mutable.ArraySeq[Double] = ArraySeq(10.0, 2.0, 3.0)

This is a bad scene, and it's the opposite of pure or referentially transparent.

State isn't really buying you anything in your example—the fact that you're calling app in such a way that you have an ArraySeq that nobody else can mutate is. You might as well bite the bullet and work with a mutable data structure in the usual way in a scope that you control—i.e., write app like this:

def app(xs: Vector[Double]): Vector[Double] = {
  val arr = xs.to[ArraySeq]
  // Perform all your updates in the usual way
  arr.toVector
}

This actually is pure and referentially transparent, but it's also more honest than the State version. If I see a value of type State[Foo, Unit], my assumption is going to be that this value represents some kind of operation that changes a Foo into a new Foo, without mutating the original Foo. This is all the state monad is—it provides a nice way of modeling operations on immutable data structures and composing them in a way that looks kind of like mutation. If you mix it with actual mutation you're likely to confuse the hell out of anyone using your code.

If you really want real mutation and purity at the same time, you can look at Scalaz's STArray. It's a very clever solution to this problem, and in languages like Haskell it's an approach that makes a lot of sense. My own feeling is that it's pretty much always the wrong solution in Scala, though. If you really need the performance of a mutable array, just use a local mutable array and make sure you don't leak it to the outside world. If you don't need that kind of performance (and most of the time you don't), use something like State.

182

answered Sep 28 '22 03:09

Travis Brown

Related questions
                            
                                Read JSON Tree structure in Scala Play Framework
                            
                                Create a generic Json serialization function
                            
                                How to return None in Scala
                            
                                Catching unhandled errors in Scala futures
                            
                                What are the benefits of Reader monad?
                            
                                sbt not working on amazon ec2 micro instance
                            
                                What does the _ parameter signify in this context?
                            
                                Make method actually inline
                            
                                Failing maven-build when Gatling-test has too high fail-percentage
                            
                                Difference between isInstance and isInstanceOf
                            
                                How to define a list of functions of the same arity in Scala?
                            
                                abandon calling `get` on Option and generate compile error
                            
                                Can't use a negative number in named parameters in Scala
                            
                                How to get the product of two RDDs?
                            
                                Are Futures in Scala really functional?
                            
                                Correct way to postpone messages in Akka
                            
                                Using SORM with Play Framework 2.3.8
                            
                                How to show the scheme (including type) of a parquet file from command line or spark shell?
                            
                                Why do I get conflicting cross-version in sbt on one environment but not another?
                            
                                How to convert from Map[String,Any] to (String, String)*

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With