In C++20, we got the capability to sleep on atomic variables, waiting for their value to change. We do so by using the <code>std::atomic::wait</code> method. Unfortunately, while <code>wait</code> has been standardized, <code>wait_for</code> and <code>wait_until</code> are not. Meaning that we cannot sleep on an atomic variable with a timeout. Sleeping on an atomic variable is anyway implemented behind the scenes with WaitOnAddress on Windows and the futex system call on Linux. Working around the above problem (no way to sleep on an atomic variable with a timeout), I could pass the memory address of an <code>std::atomic</code> to <code>WaitOnAddress</code> on Windows and it will (kinda) work with no UB, as the function gets <code>void*</code> as a parameter, and it's valid to cast <code>std::atomic<type></code> to <code>void*</code> On Linux, it is unclear whether it's ok to mix <code>std::atomic</code> with <code>futex</code>. <code>futex</code> gets either a <code>uint32_t*</code> or a <code>int32_t*</code> (depending which manual you read), and casting <code>std::atomic<u/int></code> to <code>u/int*</code> is UB. On the other hand, the manual says <blockquote> The uaddr argument points to the futex word. On all platforms, futexes are four-byte integers that must be aligned on a four- byte boundary. The operation to perform on the futex is specified in the futex_op argument; val is a value whose meaning and purpose depends on futex_op. </blockquote> Hinting that <code>alignas(4) std::atomic<int></code> should work, and it doesn't matter which integer type is it is as long as the type has the size of 4 bytes and the alignment of 4. Also, I have seen many places where this trick of combining atomics and futexes is implemented, including boost and TBB. So what is the best way to sleep on an atomic variable with a timeout in a non UB way? Do we have to implement our own atomic class with OS primitives to achieve it correctly? (Solutions like mixing atomics and condition variables exist, but sub-optimal)

You shouldn't necessarily have to implement a full custom <code>atomic</code> API, it should actually be safe to simply pull out a pointer to the underlying data from the <code>atomic<T></code> and pass it to the system. Since <code>std::atomic</code> does not offer some equivalent of <code>native_handle</code> like other synchronization primitives offer, you're going to be stuck doing some implementation-specific hacks to try to get it to interface with the native API. For the most part, it's reasonably safe to assume that first member of these types in implementations will be the same as the <code>T</code> type -- at least for integral values [1]. This is an assurance that will make it possible to extract out this value. <blockquote> ... and casting <code>std::atomic<u/int></code> to <code>u/int*</code> is UB </blockquote> This isn't actually the case. <code>std::atomic</code> is guaranteed by the standard to be Standard-Layout Type. One helpful but often esoteric properties of standard layout types is that it is safe to <code>reinterpret_cast</code> a <code>T</code> to a value or reference of the first sub-object (e.g. the first member of the <code>std::atomic</code>). As long as we can guarantee that the <code>std::atomic<u/int></code> contains only the <code>u/int</code> as a member (or at least, as its first member), then it's completely safe to extract out the type in this manner: <pre class="prettyprint lang-cpp prettyprint-override"><code>auto* r = reinterpret_cast<std::uint32_t*>(&atomic); // Pass to futex API... </code></pre> This approach should also hold on windows to cast the <code>atomic</code> to the underlying type before passing it to the <code>void*</code> API. Note: Passing a <code>T*</code> pointer to a <code>void*</code> that gets reinterpreted as a <code>U*</code> (such as an <code>atomic<T>*</code> to <code>void*</code> when it expects a <code>T*</code>) is undefined behavior -- even with standard-layout guarantees (as far as I'm aware). It will still likely work because the compiler can't see into the system APIs -- but that doesn't make the code well-formed. Note 2: I can't speak on the <code>WaitOnAddress</code> API as I haven't actually used this -- but any atomics API that depends on the address of a properly aligned integral value (<code>void*</code> or otherwise) should work properly by extracting out a pointer to the underlying value. <hr> [1] Since this is tagged <code>C++20</code>, you can verify this with <code>std::is_layout_compatible</code> with a <code>static_assert</code>: <pre class="prettyprint lang-cpp prettyprint-override"><code>static_assert(std::is_layout_compatible_v<int,std::atomic<int>>); </code></pre> (Thanks to @apmccartney for this suggestion in the comments). I can confirm that this will be layout compatible for Microsoft's STL, libc++, and libstdc++; however if you don't have access to <code>is_layout_compatible</code> and you're using a different system, you might want to check your compiler's headers to ensure this assumption holds.

Using std::atomic with futex system call

Tags:

c++

linux

c++20

stdatomic

futex

In C++20, we got the capability to sleep on atomic variables, waiting for their value to change. We do so by using the std::atomic::wait method.

Unfortunately, while wait has been standardized, wait_for and wait_until are not. Meaning that we cannot sleep on an atomic variable with a timeout.

Sleeping on an atomic variable is anyway implemented behind the scenes with WaitOnAddress on Windows and the futex system call on Linux.

Working around the above problem (no way to sleep on an atomic variable with a timeout), I could pass the memory address of an std::atomic to WaitOnAddress on Windows and it will (kinda) work with no UB, as the function gets void* as a parameter, and it's valid to cast std::atomic<type> to void*

On Linux, it is unclear whether it's ok to mix std::atomic with futex. futex gets either a uint32_t* or a int32_t* (depending which manual you read), and casting std::atomic<u/int> to u/int* is UB. On the other hand, the manual says

The uaddr argument points to the futex word. On all platforms, futexes are four-byte integers that must be aligned on a four- byte boundary. The operation to perform on the futex is specified in the futex_op argument; val is a value whose meaning and purpose depends on futex_op.

Hinting that alignas(4) std::atomic<int> should work, and it doesn't matter which integer type is it is as long as the type has the size of 4 bytes and the alignment of 4.

Also, I have seen many places where this trick of combining atomics and futexes is implemented, including boost and TBB.

So what is the best way to sleep on an atomic variable with a timeout in a non UB way? Do we have to implement our own atomic class with OS primitives to achieve it correctly?

(Solutions like mixing atomics and condition variables exist, but sub-optimal)

655

asked Apr 10 '21 11:04

David Haim

1 Answers

You shouldn't necessarily have to implement a full custom atomic API, it should actually be safe to simply pull out a pointer to the underlying data from the atomic<T> and pass it to the system.

Since std::atomic does not offer some equivalent of native_handle like other synchronization primitives offer, you're going to be stuck doing some implementation-specific hacks to try to get it to interface with the native API.

For the most part, it's reasonably safe to assume that first member of these types in implementations will be the same as the T type -- at least for integral values ^[1]. This is an assurance that will make it possible to extract out this value.

... and casting std::atomic<u/int> to u/int* is UB

This isn't actually the case.

std::atomic is guaranteed by the standard to be Standard-Layout Type. One helpful but often esoteric properties of standard layout types is that it is safe to reinterpret_cast a T to a value or reference of the first sub-object (e.g. the first member of the std::atomic).

As long as we can guarantee that the std::atomic<u/int> contains only the u/int as a member (or at least, as its first member), then it's completely safe to extract out the type in this manner:

auto* r = reinterpret_cast<std::uint32_t*>(&atomic);
// Pass to futex API...

This approach should also hold on windows to cast the atomic to the underlying type before passing it to the void* API.

Note: Passing a T* pointer to a void* that gets reinterpreted as a U* (such as an atomic<T>* to void* when it expects a T*) is undefined behavior -- even with standard-layout guarantees (as far as I'm aware). It will still likely work because the compiler can't see into the system APIs -- but that doesn't make the code well-formed.

Note 2: I can't speak on the WaitOnAddress API as I haven't actually used this -- but any atomics API that depends on the address of a properly aligned integral value (void* or otherwise) should work properly by extracting out a pointer to the underlying value.

^[1] Since this is tagged C++20, you can verify this with std::is_layout_compatible with a static_assert:

static_assert(std::is_layout_compatible_v<int,std::atomic<int>>);

(Thanks to @apmccartney for this suggestion in the comments).

I can confirm that this will be layout compatible for Microsoft's STL, libc++, and libstdc++; however if you don't have access to is_layout_compatible and you're using a different system, you might want to check your compiler's headers to ensure this assumption holds.

172

answered Oct 23 '22 08:10

Human-Compiler

Related questions
                            
                                Template variables with template argument deduction and default template parameters
                            
                                Nested class explicit specilization: different compiler behavior
                            
                                How to define a Hash class for custom std::basic_string<> specialization class just like std::string?
                            
                                Fibers use cases
                            
                                Include Pistache in C++ project
                            
                                Is the compiler allowed to optimize out dynamic_cast of a volatile pointer when the compiler doesn't see a possible type which can fulfill the cast?
                            
                                Strict aliasing rules broken with templates and inheritance
                            
                                Is getting the decltype of a deduced member function inside the trailing return type of another member function well-formed?
                            
                                C++17: Generic (multiple-inheritance based?) check for template in parameter pack
                            
                                What is the meaning of "if the context from which the specialization is referenced depends on a template parameter"?
                            
                                Variation on the type punning theme: in-place trivial construction
                            
                                unique_ptr < 0 OR what does less than operator do?
                            
                                Conversion to void** on different compilers
                            
                                What is guaranteed with C++ std::atomic at the programmer level?
                            
                                Valid syntax of calling pseudo-destructor for a floating constant
                            
                                Counting parameters of a template template type
                            
                                Why is this friend method not found as expected?
                            
                                Is the transformation of fetch_add(0, memory_order_relaxed/release) to mfence + mov legal?
                            
                                Why does Clang generate different code for reference and non-null pointer arguments?
                            
                                Why is there not a no-throw guarantee in the standard for std::set::extract() and std::set::insert(nh)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With