Today I wrote some code to test the performance of mutex. This is the boost(1.54) version, compiled on vs2010 with O2 optimization: <pre class="prettyprint"><code>boost::mutex m; auto start = boost::chrono::system_clock::now(); for (size_t i = 0; i < 50000000; ++i) { boost::lock_guard<boost::mutex> lock(m); } auto end = boost::chrono::system_clock::now(); boost::chrono::duration<double> elapsed_seconds = end - start; std::cout << elapsed_seconds.count() << std::endl; </code></pre> And this is the std version, compiled on VS2013, with O2 optimization too: <pre class="prettyprint"><code>std::mutex m; auto start = std::chrono::system_clock::now(); for (size_t i = 0; i < 50000000; ++i) { std::lock_guard<std::mutex> lock(m); } auto end = std::chrono::system_clock::now(); std::chrono::duration<double> elapsed_seconds = end - start; std::cout << elapsed_seconds.count() << std::endl; </code></pre> A bit different but doing just the same thing. My CPU is Intel Core i7-2600K, my OS is Windows 7 64bit, and the result is: 0.7020s vs 2.1684s, 3.08 times. boost::mutex will try _interlockedbittestandset first, and if it failed, the big cheese WaitForSingleObject will come second, it's simple to understand. It seems that std::mutex of VS2013 is much more complex, I have already tried to understand it but I could not get the point, why it's so complex ? is there a faster way ?

It seems that <code>stl::mutex</code> might only use system calls, which take a LOT of overhead; but <code>boost::mutex</code> implements at least some of its functionality programmatically -- i.e. it tries to avoid system calls whenever possible, which would be the reason for the <code>try _interlockedbittestandset</code> check before <code>WaitForSingleObject</code>. I don't know the actual internals of MS's stl, but I've seen performance differences like this from examples in an operating systems class.

Why is boost::mutex faster than std::mutex as of vs2013?

Tags:

c++

performance

c++11

mutex

boost

Today I wrote some code to test the performance of mutex.

This is the boost(1.54) version, compiled on vs2010 with O2 optimization:

boost::mutex m;
auto start = boost::chrono::system_clock::now();
for (size_t i = 0; i < 50000000; ++i) {
    boost::lock_guard<boost::mutex> lock(m);
}
auto end = boost::chrono::system_clock::now();
boost::chrono::duration<double> elapsed_seconds = end - start;
std::cout << elapsed_seconds.count() << std::endl;

And this is the std version, compiled on VS2013, with O2 optimization too:

std::mutex m;
auto start = std::chrono::system_clock::now();
for (size_t i = 0; i < 50000000; ++i) {
    std::lock_guard<std::mutex> lock(m);
}
auto end = std::chrono::system_clock::now();
std::chrono::duration<double> elapsed_seconds = end - start;
std::cout << elapsed_seconds.count() << std::endl;

A bit different but doing just the same thing. My CPU is Intel Core i7-2600K, my OS is Windows 7 64bit, and the result is: 0.7020s vs 2.1684s, 3.08 times.

boost::mutex will try _interlockedbittestandset first, and if it failed, the big cheese WaitForSingleObject will come second, it's simple to understand.

It seems that std::mutex of VS2013 is much more complex, I have already tried to understand it but I could not get the point, why it's so complex ? is there a faster way ?

509

asked Oct 20 '13 15:10

amanjiang

1 Answers

It seems that stl::mutex might only use system calls, which take a LOT of overhead; but boost::mutex implements at least some of its functionality programmatically -- i.e. it tries to avoid system calls whenever possible, which would be the reason for the try _interlockedbittestandset check before WaitForSingleObject.

I don't know the actual internals of MS's stl, but I've seen performance differences like this from examples in an operating systems class.

189

answered Sep 28 '22 03:09

Jed Schaaf

Related questions
                            
                                How to disable return value optimization in Visual Studio 2010?
                            
                                Virtual Inheritance and dreaded diamond
                            
                                How to insert a duplicate element into a vector?
                            
                                c++: OpenMP and non-random-access STL containers - a possible workaround
                            
                                Why does Glibmm/Gtkmm not include the unary dereferencing operator, *, for Glib::RefPtr?
                            
                                Adding MFC support to a Qt project
                            
                                What is the difference between a "container" and a "data structure"?
                            
                                Modifying package gbm of R
                            
                                Overloading std::function argument to match lambda [duplicate]
                            
                                How to perform template substitution in the clang library?
                            
                                Forcing a constant expression to be evaluated during compile-time?
                            
                                "Looking At" an object with a Quaternion
                            
                                More efficient and fast way of inverting matrices in c++ (big and small)
                            
                                In which namespace do hash<T> functors for user types belong?
                            
                                C++: Strict aliasing vs union abuse
                            
                                openMP conditional pragma "if else"
                            
                                Namespace and static class members linking
                            
                                C++ 11 Thread vs Boost Thread is there any difference? [duplicate]
                            
                                Visual Studio 2012: C++ compiler ignoring user-specified include directories
                            
                                static_cast and reinterpret_cast for std::aligned_storage

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With