Does async(launch::async) in C++11 make thread pools obsolete for avoiding expensive thread creation?

Performance Test

So, I tested the performance of various methods of calling things and came up with these numbers on an 8 core (AMD Ryzen 7 2700X) system running Fedora 29 compiled with clang version 7.0.1 and libc++ (not libstdc++):

   Do nothing calls per second:   35365257                                      
        Empty calls per second:   35210682                                      
   New thread calls per second:      62356                                      
 Async launch calls per second:      68869                                      
Worker thread calls per second:     970415

And native, on my MacBook Pro 15" (Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz) with Apple LLVM version 10.0.0 (clang-1000.10.44.4) under OSX 10.13.6, I get this:

   Do nothing calls per second:   22078079
        Empty calls per second:   21847547
   New thread calls per second:      43326
 Async launch calls per second:      58684
Worker thread calls per second:    2053775

For the worker thread, I started up a thread, then used a lockless queue to send requests to another thread and then wait for a "It's done" reply to be sent back.

The "Do nothing" is just to test the overhead of the test harness.

It's clear that the overhead of launching a thread is enormous. And even the worker thread with the inter-thread queue slows things down by a factor of 20 or so on Fedora 25 in a VM, and by about 8 on native OS X.

I created a Bitbucket project holding the code I used for the performance test. It can be found here: https://bitbucket.org/omnifarious/launch_thread_performance

Related questions
                            
                                Why does cout print "2 + 3 = 15" in this snippet of code?
                            
                                When is a C++ destructor called?
                            
                                How to forward declare a C++ template class?
                            
                                Why isn't `int pow(int base, int exponent)` in the standard C++ libraries?
                            
                                Override compile flags for single files
                            
                                Lambda returning itself: is this legal?
                            
                                When to make a type non-movable in C++11?
                            
                                Flags to enable thorough and verbose g++ warnings
                            
                                Why does Clang optimize away x * 1.0 but NOT x + 0.0?
                            
                                What is uint_fast32_t and why should it be used instead of the regular int and uint32_t?
                            
                                Is sizeof(bool) defined in the C++ language standard?
                            
                                How do I explicitly instantiate a template function?
                            
                                Segmentation fault on large array sizes
                            
                                Reading and writing binary file
                            
                                How to clear stringstream? [duplicate]
                            
                                What does static_assert do, and what would you use it for?
                            
                                Is there a simple way to convert C++ enum to string?
                            
                                Colorizing text in the console with C++
                            
                                Modern way to filter STL container?
                            
                                What does template <unsigned int N> mean?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does async(launch::async) in C++11 make thread pools obsolete for avoiding expensive thread creation?

Tags:

c++

asynchronous

multithreading

c++11

threadpool

People also ask

Performance Test

Recent Activity

Donate For Us