What's are practical example where acquire release memory order differs from sequential consistency?

Tags:

Clearly, sequential consistent atomic operations differ in their valid observable behavior from acquire-release only operations in a valid C++ program. Definitions are given in the C++ standard (since C++11) or here.

However, I have never come across a real-world example of an algorithm or a data structure that where acquire-release semantics is insufficient and sequential consistency is needed.

What would be an practical example of a real world algorithm or data structure, where sequential consistency is needed and acquire-release memory order is not enough?

Note, that even std::mutex does not guarantee sequential consistency.

816

asked Jan 25 '17 18:01

Ralph Tandetzky

1 Answers

Peterson's algorithm is an example of something that requires sequential consistency.

In the days before mutexes, the algoritm was used to give a single thread access to a protected area. The algoritm only works with 2 threads, each managing a flag that expresses the intention to access the protected area. If both set the flag at (about) the same time, both will backoff and try again. The real algorithm is more advanced since it uses a 'turn' flag to manage fair access, but to show the difference between seq/cst and acq/rel this is not necessary.

Below is a ready-to-compile, simplified version of the Peterson algorithm that actually shows that the algoritms is broken if something weaker than sequential consistency is used. Interestingly enough, this is broken even on X86 since that platform allows store-loads to be reordered.
The problem with store-load reordering is that both threads may express their intention to access the protected area by setting their me flag to true, while both are reading false from the him flag (since the value was not propagated yet to both threads) and enter the protected area. This is not possible with sequential consistency.

With gcc, I had to compile with -O3 optimizations to have the assert fire, with clang that was not necessary. Both compilers use a different approach to implement sequential consistency.

Click to copy

#include <thread>
#include <atomic>
#include <assert.h>

std::atomic<bool> flag1{false};
std::atomic<bool> flag2{false};

std::atomic<int> counter{0};

// Change these to memory_order_seq_cst to fix the algorithm
static const auto store_ordering = std::memory_order_release;
static const auto load_ordering  = std::memory_order_acquire;

void busy(int n)
{
    auto &me  = (n==1) ? flag1 : flag2;
    auto &him = (n==1) ? flag2 : flag1;

    for (;;)
    {
        for (;;)
        {
            me.store(true, store_ordering);
            if (him.load(load_ordering) == false)
            {
                // got the 'lock'
                break;
            }

            // retention, no wait period -> busy loop
            me.store(false, store_ordering);
        }

        int tmp = counter.fetch_add(1, std::memory_order_relaxed);
        assert(tmp == 0);

        /*
         * critical area
         */

        tmp = counter.fetch_sub(1, std::memory_order_relaxed);
        assert(tmp == 1);

        me.store(false, store_ordering);
    }
}


int main()
{
    std::thread t1{busy, 1};
    std::thread t2{busy, 2};

    t1.join();
    t2.join();
}

answered Oct 19 '22 03:10

LWimsey

Related questions
                            
                                C++ non-type template parameter const char*
                            
                                Do unrestricted unions require placement new and a constructor definition?
                            
                                Does multiplying unsigned short cause undefined behaviour?
                            
                                Can you use a subexpression within fold expressions?
                            
                                What is the difference between Callable concept and the std::is_function type traits?
                            
                                SDL2 - Why does SDL_CreateTextureFromSurface() need a renderer*?
                            
                                Compiler optimizations and temporary assignments in C and C++
                            
                                Why do both struct and class exist in C++?
                            
                                HPX minimal two node example set-up?
                            
                                How do I remove a C-style cast from a #define so I can use it in a preprocessor #if?
                            
                                #elseif vs #elif (C/C++ preprocessor)
                            
                                Nesting Lambda functions - Performance implications
                            
                                C++: rvalue reference memory
                            
                                Typedef of anonymous class
                            
                                What is a "virtual thunk" to a virtual function that inherits from a virtual base class?
                            
                                What is the purpose of 61 in tm_sec field from the tm structure
                            
                                Reducing the footprint of debug symbols (executable is bloated to 4 GB)
                            
                                How to make [std::operator""s] visible in a namespace?
                            
                                How to setup Visual Studio Code with OpenGL?
                            
                                Does adding an arguments with default values to functions break ABI?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's are practical example where acquire release memory order differs from sequential consistency?

Tags:

c++

multithreading

c++11

memory-model

atomic

Ralph Tandetzky

People also ask

1 Answers

LWimsey

Recent Activity

Donate For Us