I understand that with threadless async there are more threads available to service inputs (e.g. a HTTP request), but I don't understand how that doesn't potentially cause cause thread starvation when the async operations complete and a thread is needed to run their continuation. Let's say we only have 3 threads <pre class="prettyprint"><code>Thread 1 | Thread 2 | Thread 3 | </code></pre> and they get blocked on long-running operations that require threads (e.g. make database query on separate db server) <pre class="prettyprint"><code>Thread 1 | --- | Start servicing request 1 | Long-running operation .................. | Thread 2 | ------------ | Start servicing request 2 | Long-running operation ......... | Thread 3 | ------------------- | Start servicing request 3 | Long-running operation ...| | request 1 | request 2 | request 3 | request 4 - BOOM!!!! </code></pre> With async-await you can make this like <pre class="prettyprint"><code>Thread 1 | --- | Start servicing request 1 | --- | Start servicing request 4 | ----- | Thread 2 | ------------ | Start servicing request 2 | ------------------------------ | Thread 3 | ------------------- | Start servicing request 3 | ----------------------- | | request 1 | request 2 | request 3 | request 4 - OK </code></pre> However, this seems to me like it could result in a surplus of async operations that are "in-flight" and if too many finish at the same time then there are no threads available to run their continuation. <pre class="prettyprint"><code>Thread 1 | --- | Start servicing request 1 | --- | Start servicing request 4 | ----- | Thread 2 | ------------ | Start servicing request 2 | ------------------------------ | Thread 3 | ------------------- | Start servicing request 3 | ----------------------- | | request 1 | request 2 | request 3 | request 4 - OK | longer-running operation 1 completes - BOOM!!!! </code></pre>

Suppose you have web application which handles a request with a very common flow: <ul> <li>Preprocess request parameters</li> <li>Perform some IO</li> <li>Post process IO results and return back to client</li> </ul> IO in this case can be database query, socket read\write, file read\write and so on. For an example of IO let's take file reading and some arbitrary but realistic timings: <ol> <li>Preprocessing of request parameters (validation etc) takes 1ms</li> <li>File reading (IO) takes 300 ms</li> <li>Post processing takes 1ms</li> </ol> Now suppose 100 requests come in with interval of 1ms. How many threads you will need to handle those requests without delay with synchronous processing like this? <pre class="prettyprint"><code>public IActionResult GetSomeFile(RequestParameters p) { string filePath = Preprocess(p); var data = System.IO.File.ReadAllBytes(filePath); return PostProcess(data); } </code></pre> Well, 100 threads obviously. Since file read takes 300ms in our example, when 100th request comes in - previous 99 are busy blocked by file reading. Now let's "use async await": <pre class="prettyprint"><code>public async Task<IActionResult> GetSomeFileAsync(RequestParameters p) { string filePath = Preprocess(p); byte[] data; using (var fs = System.IO.File.OpenRead(filePath)) { data = new byte[fs.Length]; await fs.ReadAsync(data, 0, data.Length); } return PostProcess(data); } </code></pre> How many threads are needed now to handle 100 requests without delay? Still 100. That's because file can be opened in "synchornous" and "asynchronous" modes, and by default it opens in "synchronous". That means even though you are using <code>ReadAsync</code> - underlying IO is not asynchronous and some thread from a thread pool is blocked waiting for result. Did we achieve anything useful by doing that? In context of web applicaiton - not at all. Now let's open file in "asynchronous" mode: <pre class="prettyprint"><code>public async Task<IActionResult> GetSomeFileReallyAsync(RequestParameters p) { string filePath = Preprocess(p); byte[] data; using (var fs = new FileStream(filePath, FileMode.Open, FileAccess.Read, FileShare.Read, 4096, FileOptions.Asynchronous)) { data = new byte[fs.Length]; await fs.ReadAsync(data, 0, data.Length); } return PostProcess(data); } </code></pre> How many threads we need now? Now 1 thread is enough, in theory. When you open file in "asynchronous" mode - reads and writes will utilize (on windows) windows overlapped IO. In simplified terms it works like this: there is a queue-like object (IO completion port) where OS can post notifications about completions of certain IO operations. .NET thread pool registers one such IO completion port. There is only one thread pool per .NET application, so there is one IO completion port. When file is opened in "asynchronous" mode - it binds its file handle to this IO completion port. Now when you do <code>ReadAsync</code>, while actual read is performed - no dedicated (for this specific operation) thread is blocked waiting for that read to complete. When OS notify .NET completion port that IO for this file handle has completed - .NET thread pool executes continuation on thread pool thread. Now let's see how processing of 100 requests with 1ms interval can go in our scenario: <ul> <li>Request 1 goes in, we grab thread from a pool to execute 1ms pre-processing step. Then thread performs asynchronous read. It doesn't need to block waiting for completion, so it returns to the pool.</li> <li>Request 2 goes in. We have a thread in a pool already which just completed pre-processing of request 1. We don't need an additional thread - we can use that one again.</li> <li>Same is true for all 100 requests.</li> <li>After handling pre-processing of 100 requests, there are 200ms until first IO completion will arrive, in which our 1 thread can do even more useful work.</li> <li>IO completion events start to arrive - but our post-processing step is also very short (1ms). Only one thread again can handle them all.</li> </ul> This is an idealized scenario of course, but it shows how not "async await" but specifically asynchronous IO can help you to "save threads". What if our post-processing step is not short but we instead decided to do heavy CPU bound work in it? Well, that will cause thread pool starvation. Thread pool will create new threads without delay, until it reaches configurable "low watermark" (which you can obtain via <code>ThreadPool.GetMinThreads()</code> and change via <code>ThreadPool.SetMinThreads()</code>). After that amount of threads is reached - thread pool will try to wait for one of the busy threads to become free. It will not wait forever of course, usually it will wait for 0.5-1 seconds and if no thread become free - it will create a new one. Still, that delay might slow your web application quite a bit in heavy load scenarios. So don't violate thread pool assumptions - don't run long CPU-bound work on thread pool threads.

How does async-await "save threads"?

Tags:

c#

.net

asynchronous

multithreading

async-await

I understand that with threadless async there are more threads available to service inputs (e.g. a HTTP request), but I don't understand how that doesn't potentially cause cause thread starvation when the async operations complete and a thread is needed to run their continuation.

Let's say we only have 3 threads

Click to copy

Thread 1 | 
Thread 2 |
Thread 3 |

and they get blocked on long-running operations that require threads (e.g. make database query on separate db server)

Click to copy

Thread 1 | --- | Start servicing request 1 | Long-running operation .................. |
Thread 2 | ------------ | Start servicing request 2 | Long-running operation ......... |
Thread 3 | ------------------- | Start servicing request 3 | Long-running operation ...|
               |
              request 1
                        |
                      request 2
                                |
                              request 3
                                               |
                                           request 4 - BOOM!!!!

With async-await you can make this like

Click to copy

Thread 1 | --- | Start servicing request 1 | --- | Start servicing request 4 | ----- |
Thread 2 | ------------ | Start servicing request 2 | ------------------------------ |
Thread 3 | ------------------- | Start servicing request 3 | ----------------------- |
               |
              request 1
                        |
                      request 2
                                |
                              request 3
                                                 |
                                           request 4 - OK

However, this seems to me like it could result in a surplus of async operations that are "in-flight" and if too many finish at the same time then there are no threads available to run their continuation.

Click to copy

Thread 1 | --- | Start servicing request 1 | --- | Start servicing request 4 | ----- |
Thread 2 | ------------ | Start servicing request 2 | ------------------------------ |
Thread 3 | ------------------- | Start servicing request 3 | ----------------------- |
               |
              request 1
                        |
                      request 2
                                |
                              request 3
                                                 |
                                           request 4 - OK   
                                                      | longer-running operation 1 completes - BOOM!!!!

966

asked Apr 15 '18 01:04

user7127000

1 Answers

Suppose you have web application which handles a request with a very common flow:

Preprocess request parameters
Perform some IO
Post process IO results and return back to client

IO in this case can be database query, socket read\write, file read\write and so on.

For an example of IO let's take file reading and some arbitrary but realistic timings:

Preprocessing of request parameters (validation etc) takes 1ms
File reading (IO) takes 300 ms
Post processing takes 1ms

Now suppose 100 requests come in with interval of 1ms. How many threads you will need to handle those requests without delay with synchronous processing like this?

Click to copy

public IActionResult GetSomeFile(RequestParameters p) {
    string filePath = Preprocess(p);
    var data = System.IO.File.ReadAllBytes(filePath);
    return PostProcess(data);
}

Well, 100 threads obviously. Since file read takes 300ms in our example, when 100th request comes in - previous 99 are busy blocked by file reading.

Now let's "use async await":

Click to copy

public async Task<IActionResult> GetSomeFileAsync(RequestParameters p) {
    string filePath = Preprocess(p);
    byte[] data;
    using (var fs = System.IO.File.OpenRead(filePath)) {
        data = new byte[fs.Length];
        await fs.ReadAsync(data, 0, data.Length);
    }
    return PostProcess(data);
}

How many threads are needed now to handle 100 requests without delay? Still 100. That's because file can be opened in "synchornous" and "asynchronous" modes, and by default it opens in "synchronous". That means even though you are using ReadAsync - underlying IO is not asynchronous and some thread from a thread pool is blocked waiting for result. Did we achieve anything useful by doing that? In context of web applicaiton - not at all.

Now let's open file in "asynchronous" mode:

Click to copy

public async Task<IActionResult> GetSomeFileReallyAsync(RequestParameters p) {
    string filePath = Preprocess(p);
    byte[] data;
    using (var fs = new FileStream(filePath, FileMode.Open, FileAccess.Read, FileShare.Read, 4096, FileOptions.Asynchronous)) {
        data = new byte[fs.Length];
        await fs.ReadAsync(data, 0, data.Length);
    }

    return PostProcess(data);
}

How many threads we need now? Now 1 thread is enough, in theory. When you open file in "asynchronous" mode - reads and writes will utilize (on windows) windows overlapped IO.

In simplified terms it works like this: there is a queue-like object (IO completion port) where OS can post notifications about completions of certain IO operations. .NET thread pool registers one such IO completion port. There is only one thread pool per .NET application, so there is one IO completion port.

When file is opened in "asynchronous" mode - it binds its file handle to this IO completion port. Now when you do ReadAsync, while actual read is performed - no dedicated (for this specific operation) thread is blocked waiting for that read to complete. When OS notify .NET completion port that IO for this file handle has completed - .NET thread pool executes continuation on thread pool thread.

Now let's see how processing of 100 requests with 1ms interval can go in our scenario:

Request 1 goes in, we grab thread from a pool to execute 1ms pre-processing step. Then thread performs asynchronous read. It doesn't need to block waiting for completion, so it returns to the pool.
Request 2 goes in. We have a thread in a pool already which just completed pre-processing of request 1. We don't need an additional thread - we can use that one again.
Same is true for all 100 requests.
After handling pre-processing of 100 requests, there are 200ms until first IO completion will arrive, in which our 1 thread can do even more useful work.
IO completion events start to arrive - but our post-processing step is also very short (1ms). Only one thread again can handle them all.

This is an idealized scenario of course, but it shows how not "async await" but specifically asynchronous IO can help you to "save threads".

What if our post-processing step is not short but we instead decided to do heavy CPU bound work in it? Well, that will cause thread pool starvation. Thread pool will create new threads without delay, until it reaches configurable "low watermark" (which you can obtain via ThreadPool.GetMinThreads() and change via ThreadPool.SetMinThreads()). After that amount of threads is reached - thread pool will try to wait for one of the busy threads to become free. It will not wait forever of course, usually it will wait for 0.5-1 seconds and if no thread become free - it will create a new one. Still, that delay might slow your web application quite a bit in heavy load scenarios. So don't violate thread pool assumptions - don't run long CPU-bound work on thread pool threads.

answered Oct 13 '22 02:10

Evk

Related questions
                            
                                Caveats if go package name doesn't start with github.com?
                            
                                What is the time complexity of accessing an element of a List by index in Dart?
                            
                                How to mock chain of method calls using Mockito
                            
                                Permissions for creating OAuth credentials in Google Cloud
                            
                                NPM Install Error Woes: "npm ERR! code EINVAL"
                            
                                leaflet marker cluster icons not displaying
                            
                                Tentative definition of struct with incomplete type
                            
                                How to change nginx's default homepage via a Dockerfile and then launch it by running a container
                            
                                subsetting world map to northern temperate latitudes ggplot2
                            
                                Log4Net Setup in a WPF app
                            
                                How to add active class/state for li element in ReactJS?
                            
                                How to get jenkins to use current npm version?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does async-await "save threads"?

Tags:

c#

.net

asynchronous

multithreading

async-await

user7127000

People also ask

1 Answers

Evk

Recent Activity

Donate For Us