Why is false sharing an issue if the variable being modified by a thread is marked as volatile

Tags:

I've been looking at the Martin Thompson article. Which is an explanation of false sharing.

http://mechanical-sympathy.blogspot.co.uk/2011/07/false-sharing.html

    public final class FalseSharing
    implements Runnable
    {
        public final static int NUM_THREADS = 4; // change
        public final static long ITERATIONS = 500L * 1000L * 1000L;
        private final int arrayIndex;

        private static VolatileLong[] longs = new VolatileLong[NUM_THREADS];


        static
        {    
            for (int i = 0; i < longs.length; i++)
            {
                longs[i] = new VolatileLong();
            }
        }

        public FalseSharing(final int arrayIndex)
        {
            this.arrayIndex = arrayIndex;
        }

        public static void main(final String[] args) throws Exception
        {
            final long start = System.nanoTime();
            runTest();
            System.out.println("duration = " + (System.nanoTime() -start));
        }

        private static void runTest() throws InterruptedException
        {
            Thread[] threads = new Thread[NUM_THREADS];

            for (int i = 0; i < threads.length; i++)
            {
                threads[i] = new Thread(new FalseSharing(i));
            }

            for (Thread t : threads)
            {
                t.start();
            }

            for (Thread t : threads)
            {
                t.join();
            }
        }

        public void run()
        {
            long i = ITERATIONS + 1;
            while (0 != --i)
            {
                longs[arrayIndex].value = i;
            }
        }

        public final static class VolatileLong
        {
            public volatile long value = 0L;
            public long p1, p2, p3, p4, p5, p6; // comment out
        }
    }

The example demonstrates the slow down experienced by multiple threads invalidating the cache line of each other even though there each only updating one variable exclusively.

BlockqFigure 1. above illustrates the issue of false sharing. A thread running on core 1 wants to update variable X while a thread on core 2 wants to update variable Y. Unfortunately these two hot variables reside in the same cache line. Each thread will race for ownership of the cache line so they can update it. If core 1 gets ownership then the cache sub-system will need to invalidate the corresponding cache line for core 2. When Core 2 gets ownership and performs its update, then core 1 will be told to invalidate its copy of the cache line. This will ping pong back and forth via the L3 cache greatly impacting performance. The issue would be further exacerbated if competing cores are on different sockets and additionally have to cross the socket interconnect.

My question is the following. If all the variables being updated are volatile, why does this padding cause a performance increase? My understanding is that a volatile variable always writes and reads through to main memory. Therefore I'd assume that every write and read to any variable in this example will result in a flush of the current cores cache line.

So according to my understanding. If thread one invalidates thread two's cacheline, this will not become apparant to thread two until it goes to read a value from its own cache line. The value it's reading is a volatile value so this effectively renders the cache dirty anyway resulting in a read from main memory.

Where have I gone wrong in my understanding?

Thanks

216

asked Jul 02 '15 11:07

David Wales

1 Answers

If all the variables being updated are volatile, why does this padding cause a performance increase?

So there are two things going on here:

We are dealing with an array of VolatileLong objects with each thread working on their own VolatileLong. (See private final int arrayIndex).
Each of the VolatileLong object has a single volatile field.

The volatile access means that the threads have to both invalidate the cache "line" that holds their volatile long value and they need to lock that cache line to update it. As the article states, a cache line is typically ~64 bytes or so.

The article is saying that by adding padding to the VolatileLong object, it moves the object that each of the threads is locking into different cache lines. So even though the different threads are still crossing memory barriers as they assign their volatile long value, they are in a different cache line an so won't cause excessive L2 cache bandwidth.

In summary, the performance increase happens because even though the threads are still locking their cache line to update the volatile field, these locks are now on different memory blocks and so they are not clashing with the other threads' locks and causing cache invalidations.

answered Nov 02 '22 08:11

Gray

Related questions
                            
                                Exception in thread "main" javax.ws.rs.NotAcceptableException: HTTP 406 Not Acceptable
                            
                                Migrating from existing dynamic web project to gradle
                            
                                Java Order by on one to many relationship with different child object type
                            
                                How can I change the color of the clusters on my android Google map?
                            
                                Httpclient deprecated
                            
                                Hibernate mapping: ignore a super class field
                            
                                Log4j2 api cannot find Log4j2 core in OSGi environment
                            
                                Generate infinite sequence of Natural numbers using RxJava
                            
                                How to make "%n" equal to "\n"
                            
                                Java TreeMap custom comparator weird behaviour
                            
                                How to emulate pressing media keys in Java?
                            
                                How to group elements of a List by elements of another in Java 8
                            
                                Java 8 change in UTF-8 decoding
                            
                                Running wait() on a Thread instance from within main() in Java
                            
                                Java label irregularity (possible bug?)
                            
                                Why Enum singleton are serialization safe?
                            
                                Uploading compress image to server using retrofit
                            
                                ZooKeeper Recipes and Apache Curator
                            
                                How to add new mime type to apache tika
                            
                                Java.util.logger new file every day

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is false sharing an issue if the variable being modified by a thread is marked as volatile

Tags:

java

caching

multithreading

David Wales

People also ask

1 Answers

Gray

Recent Activity

Donate For Us