I have spent a while learning the topic of Scala execution contexts, underlying threading models and concurrency. Can you explain in what ways does <code>scala.concurrent.blocking</code> "adjust the runtime behavior" and "may improve performance or avoid deadlocks" as described in the scaladoc? In the documentation, it is presented as a means to await api that doesn't implement Awaitable. (Perhaps also just long running computation should be wrapped?). What is it that it actually does? Following through the source doesn't easily betray its secrets.

<code>blocking</code> is meant to act as a hint to the <code>ExecutionContext</code> that the contained code is blocking and could lead to thread starvation. This will give the thread pool a chance to spawn new threads in order to prevent starvation. This is what is meant by "adjust the runtime behavior". It's not magic though, and won't work with every <code>ExecutionContext</code>. Consider this example: <pre class="prettyprint"><code>import scala.concurrent._ val ec = scala.concurrent.ExecutionContext.Implicits.global (0 to 100) foreach { n => Future { println("starting Future: " + n) blocking { Thread.sleep(3000) } println("ending Future: " + n) }(ec) } </code></pre> This is using the default global <code>ExecutionContext</code>. Running the code as-is, you will notice that the 100 <code>Future</code>s are all executed immediately, but if you remove <code>blocking</code>, they only execute a few at a time. The default <code>ExecutionContext</code> will react to blocking calls (marked as such) by spawning new threads, and thus doesn't get overloaded with running <code>Future</code>s. Now look at this example with a fixed pool of 4 threads: <pre class="prettyprint"><code>import java.util.concurrent.Executors val executorService = Executors.newFixedThreadPool(4) val ec = ExecutionContext.fromExecutorService(executorService) (0 to 100) foreach { n => Future { println("starting Future: " + n) blocking { Thread.sleep(3000) } println("ending Future: " + n) }(ec) } </code></pre> This <code>ExecutionContext</code> isn't built to handle spawning new threads, and so even with my blocking code surrounded with <code>blocking</code>, you can see that it will still only execute at most 4 <code>Future</code>s at a time. And so that's why we say it "may improve performance or avoid deadlocks"--it's not guaranteed. As we see in the latter <code>ExecutionContext</code>, it's not guaranteed at all. How does it work? As linked, <code>blocking</code> executes this code: <pre class="prettyprint"><code>BlockContext.current.blockOn(body)(scala.concurrent.AwaitPermission) </code></pre> <code>BlockContext.current</code> retrieves the <code>BlockContext</code> from the current thread, seen here. A <code>BlockContext</code> is usually just a <code>Thread</code> with the <code>BlockContext</code> trait mixed in. As seen in the source, it is either stored in a <code>ThreadLocal</code>, or if it's not found there, it is pattern matched out of the current thread. If the current thread is not a <code>BlockContext</code>, then the <code>DefaultBlockContext</code> is used instead. Next, <code>blockOn</code> is called on the current <code>BlockContext</code>. <code>blockOn</code> is an abstract method in <code>BlockContext</code>, so it's implementation is dependent on how the <code>ExecutionContext</code> handles it. If we look at the implementation for <code>DefaultBlockContext</code> (when the current thread is not a <code>BlockContext</code>), we see that <code>blockOn</code> actually does nothing there. So using <code>blocking</code> in a non-<code>BlockContext</code> means that nothing special is done at all, and the code is run as-is, with no side-effects. What about threads that are <code>BlockContext</code>s? For instance, in the <code>global</code> context, seen here, <code>blockOn</code> does quite a bit more. Digging deeper, you can see that it's using a <code>ForkJoinPool</code> under the hood, with the <code>DefaultThreadFactory</code> defined in the same snippet being used for spawning new threads in the <code>ForkJoinPool</code>. Without the implementation of <code>blockOn</code> from the <code>BlockContext</code> (thread), the <code>ForkJoinPool</code> doesn't know you're blocking, and won't try to spawn more threads in response. Scala's <code>Await</code> too, uses <code>blocking</code> for its implementation.

scala.concurrent.blocking - what does it actually do?

Tags:

concurrency

scala

java.util.concurrent

I have spent a while learning the topic of Scala execution contexts, underlying threading models and concurrency. Can you explain in what ways does scala.concurrent.blocking "adjust the runtime behavior" and "may improve performance or avoid deadlocks" as described in the scaladoc?

In the documentation, it is presented as a means to await api that doesn't implement Awaitable. (Perhaps also just long running computation should be wrapped?).

What is it that it actually does?

Following through the source doesn't easily betray its secrets.

848

asked Mar 16 '15 00:03

matanster

1 Answers

blocking is meant to act as a hint to the ExecutionContext that the contained code is blocking and could lead to thread starvation. This will give the thread pool a chance to spawn new threads in order to prevent starvation. This is what is meant by "adjust the runtime behavior". It's not magic though, and won't work with every ExecutionContext.

Consider this example:

import scala.concurrent._
val ec = scala.concurrent.ExecutionContext.Implicits.global

(0 to 100) foreach { n =>
    Future {
        println("starting Future: " + n)
        blocking { Thread.sleep(3000) }
        println("ending Future: " + n)
    }(ec)
}

This is using the default global ExecutionContext. Running the code as-is, you will notice that the 100 Futures are all executed immediately, but if you remove blocking, they only execute a few at a time. The default ExecutionContext will react to blocking calls (marked as such) by spawning new threads, and thus doesn't get overloaded with running Futures.

Now look at this example with a fixed pool of 4 threads:

import java.util.concurrent.Executors
val executorService = Executors.newFixedThreadPool(4)
val ec = ExecutionContext.fromExecutorService(executorService)

(0 to 100) foreach { n =>
    Future {
        println("starting Future: " + n)
        blocking { Thread.sleep(3000) }
        println("ending Future: " + n)
    }(ec)
}

This ExecutionContext isn't built to handle spawning new threads, and so even with my blocking code surrounded with blocking, you can see that it will still only execute at most 4 Futures at a time. And so that's why we say it "may improve performance or avoid deadlocks"--it's not guaranteed. As we see in the latter ExecutionContext, it's not guaranteed at all.

How does it work? As linked, blocking executes this code:

BlockContext.current.blockOn(body)(scala.concurrent.AwaitPermission)

BlockContext.current retrieves the BlockContext from the current thread, seen here. A BlockContext is usually just a Thread with the BlockContext trait mixed in. As seen in the source, it is either stored in a ThreadLocal, or if it's not found there, it is pattern matched out of the current thread. If the current thread is not a BlockContext, then the DefaultBlockContext is used instead.

Next, blockOn is called on the current BlockContext. blockOn is an abstract method in BlockContext, so it's implementation is dependent on how the ExecutionContext handles it. If we look at the implementation for DefaultBlockContext (when the current thread is not a BlockContext), we see that blockOn actually does nothing there. So using blocking in a non-BlockContext means that nothing special is done at all, and the code is run as-is, with no side-effects.

What about threads that are BlockContexts? For instance, in the global context, seen here, blockOn does quite a bit more. Digging deeper, you can see that it's using a ForkJoinPool under the hood, with the DefaultThreadFactory defined in the same snippet being used for spawning new threads in the ForkJoinPool. Without the implementation of blockOn from the BlockContext (thread), the ForkJoinPool doesn't know you're blocking, and won't try to spawn more threads in response.

Scala's Await too, uses blocking for its implementation.

answered Oct 17 '22 06:10

Michael Zajac

Related questions
                            
                                Different Scala Actor Implementations Overview
                            
                                Accessing value returned by scala futures
                            
                                What types are special to the Scala compiler?
                            
                                How to append or prepend on a Scala mutable.Seq
                            
                                Derive multiple columns from a single column in a Spark DataFrame
                            
                                Scala: How to define "generic" function parameters?
                            
                                Is the Lift framework as "easy" as Ruby on Rails or Django?
                            
                                When to use case class or regular class
                            
                                Scala - Get last two characters from string
                            
                                How to wait for N seconds between statements in Scala?
                            
                                java.lang.NoSuchMethodError: scala.Predef$.refArrayOps
                            
                                Increase JVM heap size for Scala?
                            
                                why is the lift web framework scalable?
                            
                                scala, guidelines on return type - when prefer seq, iterable, traversable
                            
                                How are coroutines implemented in JVM langs without JVM support?
                            
                                What is the relation between Iterable and Iterator?
                            
                                Should I use Unit or leave out the return type for my scala method?
                            
                                Using Either to process failures in Scala code
                            
                                Valid identifier characters in Scala
                            
                                The cost of nested methods

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With