I've been looking at the new Scala 2.9 parallel collections and am hoping to abandon a whole lot of my crufty amateur versions of similar things. In particular, I'd like to replace the fork join pool which underlies the default implementation with something of my own (for example, something that distributes evaluation of tasks across a network, via actors). My understanding is that this is simply a matter of applying Scala's paradigm of "stackable modifications", but the collections library is intimidating enough that I'm not exactly sure which bits need modifying! Some concrete questions: <ol> <li>Is it correct that the standard parallel implementations interact with the fork join pool solely through the code in <code>ForkJoinTasks</code>?</li> <li>I see that there's an alternative trait, <code>FutureThreadPoolTasks</code>. How would I build a collection which uses this trait instead of <code>ForkJoinTasks</code>?</li> <li>Can I just write yet another alternative (and perhaps a corresponding boilerplate class that mixes in <code>AdaptiveWorkStealingTasks</code> and somehow instantiate collections instances that use this new trait?</li> </ol> (For reference, all of the traits mentioned above are defined in Tasks.scala.) Especially code examples are very welcome!

Just to provide some more information on how things fit together (which I suspect you already know): the fork-join pool is "plugged in" via the <code>parallel</code> package object's <code>tasksupport</code> value which implements the <code>scala.collection.parallel.TaskSupport</code> trait. This, in turn, inherits from <code>Tasks</code> (which you mention) and defines such operations as: <pre class="prettyprint"><code>def execute[R, Tp](fjtask: Task[R, Tp]): () => R def executeAndWaitResult[R, Tp](task: Task[R, Tp]): R </code></pre> However, it's not immediately obvious to me how you can override the behaviour which is explicitly imported by the collections themselves by supplying your own <code>TaskSupport</code> implementation. For example, in <code>ParSeqLike</code> line 47: <pre class="prettyprint"><code>import tasksupport._ </code></pre> In fact,I would go so far as saying it looks like the parallelism is definitively not overridable (unless I am very much mistaken, though I often am).

Here is a document describing how to switch <code>TaskSupport</code> objects in Scala 2.10.

How do I replace the fork join pool for a Scala 2.9 parallel collection?

Tags:

parallel-processing

scala

scala-2.9

parallel-collections

I've been looking at the new Scala 2.9 parallel collections and am hoping to abandon a whole lot of my crufty amateur versions of similar things. In particular, I'd like to replace the fork join pool which underlies the default implementation with something of my own (for example, something that distributes evaluation of tasks across a network, via actors). My understanding is that this is simply a matter of applying Scala's paradigm of "stackable modifications", but the collections library is intimidating enough that I'm not exactly sure which bits need modifying!

Some concrete questions:

Is it correct that the standard parallel implementations interact with the fork join pool solely through the code in ForkJoinTasks?
I see that there's an alternative trait, FutureThreadPoolTasks. How would I build a collection which uses this trait instead of ForkJoinTasks?
Can I just write yet another alternative (and perhaps a corresponding boilerplate class that mixes in AdaptiveWorkStealingTasks and somehow instantiate collections instances that use this new trait?

(For reference, all of the traits mentioned above are defined in Tasks.scala.)

Especially code examples are very welcome!

963

asked May 18 '11 02:05

Scott Morrison

2 Answers

Just to provide some more information on how things fit together (which I suspect you already know): the fork-join pool is "plugged in" via the parallel package object's tasksupport value which implements the scala.collection.parallel.TaskSupport trait.

This, in turn, inherits from Tasks (which you mention) and defines such operations as:

def execute[R, Tp](fjtask: Task[R, Tp]): () => R

def executeAndWaitResult[R, Tp](task: Task[R, Tp]): R

However, it's not immediately obvious to me how you can override the behaviour which is explicitly imported by the collections themselves by supplying your own TaskSupport implementation. For example, in ParSeqLike line 47:

import tasksupport._

In fact,I would go so far as saying it looks like the parallelism is definitively not overridable (unless I am very much mistaken, though I often am).

122

answered Oct 21 '22 08:10

oxbow_lakes

Here is a document describing how to switch TaskSupport objects in Scala 2.10.

answered Oct 21 '22 09:10

axel22

Related questions
                            
                                What is a Singleton Type exactly?
                            
                                Mixing Scala and Java files in an Eclipse project
                            
                                Tell SBT to collect all my dependencies together
                            
                                Play Framework & JSON Web Token
                            
                                Sample of `forSome { val `?
                            
                                foldRight on infinite lazy structure
                            
                                Good examples of idiomatic scala code
                            
                                Comparing Subcut and Scaldi
                            
                                Magnet pattern and overloaded methods
                            
                                Can you recommend a good shared hosting provider for a webapp made with Lift framework with Scala? [closed]
                            
                                Is there something like AutoMapper for Scala?
                            
                                Could not find implicit value for evidence parameter of type scala.reflect.ClassManifest[T]
                            
                                Why is Akka Streams swallowing my exceptions?
                            
                                configure ant for scala
                            
                                How to get full stacktrace in SBT 0.10.0?
                            
                                Inheriting a trait twice
                            
                                Play! framework 2.0: Validate field in forms using other fields
                            
                                Scheduling a task at a fixed time of the day with Akka
                            
                                What do you call the data wrapped inside a monad?
                            
                                Hash function in spark

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With