What is bus-locking in the context of atomic variables?

Tags:

I use C++ since a long time, and now I'm starting to learn assembly and learn how processors work (not just for fun, but I have to as a part of a test program). While learning assembly, I started hearing some of the terms that I hear here and there when discussing multithreading, given that I do lots of multithreading in scientific computing. I'm struggling to get the full picture, and I'd appreciate helping me to widen my picture.

I learned that a bus, in its simplest form, is something like a multiplexer followed by a demultiplexer. Each of the ends takes an address as input, in order to connect the two ends with some external component. The two ends can, based on the address, point to memory, graphics card, RAM, CPU registers, or anything else.

Now getting to my question: I keep hearing people arguing on whether to use a mutex or an atomic for thread safety (I know there's no ultimate answer, this is not what my question is, but my question is about the comparison). Here for example, the claim was made that atomics are so bad that they will prevent a processor from doing a decent job because of bus-locking.

Could someone please explain what bus-locking is, in a little detail, and why it is not like mutexes, while AFAIK, mutexes need at least two atomic operations to lock and unlock.

622

asked Apr 12 '17 09:04

The Quantum Physicist

2 Answers

From Intel® 64 and IA-32 Architectures Software Developer’s Manual:

Beginning with the P6 family processors, when the LOCK prefix is prefixed to an instruction and the memory area being accessed is cached internally in the processor, the LOCK# signal is generally not asserted. Instead, only the processor’s cache is locked. Here, the processor’s cache coherency mechanism ensures that the operation is carried out atomically with regards to memory.

There are special non-temporal store instructions to bypass the cache. All other loads and stores normally go through the cache, unless the memory page is marked as non-cacheable (like GPU or PCIe device memory).

175

answered Sep 18 '22 00:09

Maxim Egorushkin

"I learned that a bus, in its simplest form, is something like a multiplexer followed by a demultiplexer. Each of the ends"

Well, that's not correct. In its simplest form there's nothing to multiplex or demultiplex. It's just two things talking directly to each other. And in the nost-so simple case, a bus may have three or more devices connected. In that case, you start needing bus addresses because you no longer can talk about "the other end".

Now if you've got multiple devices on a single bus, they generally can't all talk at the same time. There must be some mechanism to prevent them from all talking at the same time. Yet for all devices to be able to share that bus, they must be able to alternate who is talking to who. Bus locking as a broad term means any deviation from the usual pattern, where two devices reserve the bus for their mutual conversation.

In the particular context of the x86 memory bus, this means keeping the bus locked during a read-modify-write cycle (as Kerrek SB pointed out in comments). Now this may sound like a simple bus with 2 devices (memory and CPU) but DMA and multi-core chips make this not that simple.

answered Sep 18 '22 00:09

MSalters

Related questions
                            
                                auto keyword behavior with references
                            
                                Looking for a pattern to avoid using dynamic_cast in implementation of a virtual function
                            
                                Holding two mutex locks at the same time
                            
                                How to avoid embedded `std::complex` when providing `T=std::complex<Q>`?
                            
                                Tab order in Qt
                            
                                extract individual words from string c++
                            
                                What are common values for uninitialized memory for debugging?
                            
                                change pointer in std::unique_ptr without destroy it
                            
                                Getting double result from abs(double) instead of int
                            
                                How to detect std::reference_wrapper in C++ at compile time
                            
                                c++ towupper() doesn't convert certain characters
                            
                                Why isn't move construction used when initiating a vector from initializer list (via implicit constructor)
                            
                                Issue with std::thread from c++11
                            
                                C++ constexpr values for types
                            
                                qt 5.8 sql connection error:QMYSQL driver not loaded on windows 10
                            
                                What does the address of an lvalue reference to a prvalue represent?
                            
                                in OpenCV is mask a bitwise and operation
                            
                                What's the meaning of `struct decay<T, R(A..., ...)>`
                            
                                Why is std::cout so time consuming?
                            
                                std::byte on odd platforms

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is bus-locking in the context of atomic variables?

Tags:

c++

multithreading

atomic

mutex

bus

The Quantum Physicist

People also ask

2 Answers

Maxim Egorushkin

MSalters

Recent Activity

Donate For Us