I'm coming largely from a c++ background, but I think this question applies to threading in any language. Here's the scenario: <ol> <li>We have two threads (ThreadA and ThreadB), and a value x in shared memory</li> <li>Assume that access to x is appropriately controlled by a mutex (or other suitable synchronization control)</li> <li>If the threads happen to run on different processors, what happens if ThreadA performs a write operation, but its processor places the result in its L2 cache rather than the main memory? Then, if ThreadB tries to read the value, will it not just look in its own L1/L2 cache / main memory and then work with whatever old value was there?</li> </ol> If that's not the case, then how is this issue managed? If that is the case, then what can be done about it?

Most locking primitives like mutexes imply memory barriers. These force a cache flush and reload to occur. For example, <pre class="prettyprint"><code>ThreadA { x = 5; // probably writes to cache unlock mutex; // forcibly writes local CPU cache to global memory } ThreadB { lock mutex; // discards data in local cache y = x; // x must read from global memory } </code></pre>

How do threaded systems cope with shared data being being cached by different cpus?

Tags:

caching

multithreading

multiprocessor

I'm coming largely from a c++ background, but I think this question applies to threading in any language. Here's the scenario:

We have two threads (ThreadA and ThreadB), and a value x in shared memory
Assume that access to x is appropriately controlled by a mutex (or other suitable synchronization control)
If the threads happen to run on different processors, what happens if ThreadA performs a write operation, but its processor places the result in its L2 cache rather than the main memory? Then, if ThreadB tries to read the value, will it not just look in its own L1/L2 cache / main memory and then work with whatever old value was there?

If that's not the case, then how is this issue managed?

If that is the case, then what can be done about it?

575

asked Jul 09 '09 17:07

csj

2 Answers

Your example would work just fine.

Multiple processors use a coherency protocol such as MESI to ensure that data remains in sync between the caches. With MESI, each cache line is considered to be either modified, exclusively held, shared between CPU's, or invalid. Writing a cache line that is shared between processors forces it to become invalid in the other CPU's, keeping the caches in sync.

However, this is not quite enough. Different processors have different memory models, and most modern processors support some level of re-ordering memory accesses. In these cases, memory barriers are needed.

For instance if you have Thread A:

DoWork();
workDone = true;

And Thread B:

while (!workDone) {}
DoSomethingWithResults()

With both running on separate processors, there is no guarantee that the writes done within DoWork() will be visible to thread B before the write to workDone and DoSomethingWithResults() would proceed with potentially inconsistent state. Memory barriers guarantee some ordering of the reads and writes - adding a memory barrier after DoWork() in Thread A would force all reads/writes done by DoWork to complete before the write to workDone, so that Thread B would get a consistent view. Mutexes inherently provide a memory barrier, so that reads/writes cannot pass a call to lock and unlock.

In your case, one processor would signal to the others that it dirtied a cache line and force the other processors to reload from memory. Acquiring the mutex to read and write the value guarantees that the change to memory is visible to the other processor in the order expected.

answered Oct 26 '22 13:10

Michael

Most locking primitives like mutexes imply memory barriers. These force a cache flush and reload to occur.

For example,

ThreadA {
    x = 5;         // probably writes to cache
    unlock mutex;  // forcibly writes local CPU cache to global memory
}
ThreadB {
    lock mutex;    // discards data in local cache
    y = x;         // x must read from global memory
}

answered Oct 26 '22 11:10

ephemient

Related questions
                            
                                Waiting for WebBrowser ajax content
                            
                                How to figure out who owns a worker thread that is still running when my app exits?
                            
                                What is the "VM Periodic Task Thread"?
                            
                                AsyncCall with Delphi 2007
                            
                                How to manage memory when using ExecutorService?
                            
                                Run multiple exec commands at once (But wait for the last one to finish)
                            
                                How to make C (P/invoke) code called from C# "Thread-safe"
                            
                                Running threads inside my rails controller method
                            
                                What is the Scala equivalent of Clojure's Atom?
                            
                                Thread safe vector
                            
                                Does QThread::quit() immediately end the thread or does it wait until returning to the event loop?
                            
                                Flask: Background thread sees a non-empty queue as empty
                            
                                std::list<std::future> destructor does not block
                            
                                Catching panic! when Rust called from C FFI, without spawning threads
                            
                                Why exception is null in ThreadPoolExecutor's afterExecute()?
                            
                                Why can only async-signal-safe functions be called from signal handlers safely?
                            
                                Is creating a separate thread for a logger ok?
                            
                                HashMap order changes when using Thread but is constant without Thread
                            
                                iOS 13: threading violation: expected the main thread
                            
                                Multithreaded access to file

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With