False Sharing and Atomic Variables

Tags:

When different variables are inside the same cache line, you can experience False Sharing, which means that even if two different threads (running on different cores) are accessing two different variables, if those two variables reside in the same cache line, you will have performance hit, as each time cache coherence will be triggered.

Now say those variables are atomic variables (By atomic I mean variables which introduce a memory fence, such as the atomic<t> of C++), will false sharing matter there, or it does not matter if atomic variables are in the same cache line or not, as supposedly they will introduce cache coherence anyway. In other words, will putting atomic variables in the same cache line make application slower than not putting them in the same cache line?

674

asked Apr 13 '12 15:04

pythonic

2 Answers

A clarification: for negative consequences at least some accesses to "falsely shared" variables should be writes. If writes are rare, performance impact of false sharing is rather negligible; the more writes (and so cache line invalidate messages) the worse performance.

Even with atomics, cache line sharing (either false or true) still matters. Look for some evidence here: http://www.1024cores.net/home/lock-free-algorithms/first-things-first. Thus, the answer is - yes, placing atomic variables used by different threads to the same cache line may make application slower compared to placing them to two different lines. However, I think it will be mostly unnoticed, unless the app spends a significant portion of its time updating these atomic variables.

179

answered Oct 03 '22 12:10

Alexey Kukanov

If you use atomic variables with the strongest consistency requirements, a full memory barrier, the effect of false sharing will probably not be noticeable. For such an access the performance of an atomic operation is basically limited by the memory access latency. So things are slow anyhow, I don't think they would get much slower in the presence of false sharing.

If you have other less intrusive memory orderings the performance hit by the atomics itself may be less, and so the impact of false sharing might be significant.

In total, I would first look at the performance of the atomic operation itself before worrying about false sharing for such operations.

answered Oct 03 '22 12:10

Jens Gustedt

Related questions
                            
                                overloading pre-increment and post-increment
                            
                                RVO, move operations and a dilemma
                            
                                Are static class variables the same as extern variables, only with class scope?
                            
                                DirectShow - Getting video frames
                            
                                What does the term "lexical" means in C++?
                            
                                Best way to get cos(a) from sin(a)
                            
                                friend declaration of template specialization fails
                            
                                C++ error: Member declaration not found
                            
                                Eclipse CDT indexing and std::unique_ptr
                            
                                std::array initializer list initialization in initialization list
                            
                                Wrapping C++ for use in C#
                            
                                Is it possible to disable compiler warning C4503?
                            
                                Change attribute type when parsing binary with boost::spirit
                            
                                Trying to disable Processor idle states (C states) on Windows PC
                            
                                Subclasses and get_shared_from_this()
                            
                                C++ Struct inheritance in Cython
                            
                                C++ Eclipse CDT How to add gcc 4.6.3
                            
                                Quaternion camera. How do I make it rotate correctly?
                            
                                C++ Function bind repeating arguments to curried function
                            
                                Embedding QWidget into X11 Window

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

False Sharing and Atomic Variables

Tags:

c++

c

caching

multithreading

x86-64

pythonic

People also ask

2 Answers

Alexey Kukanov

Jens Gustedt

Recent Activity

Donate For Us