I could find the answer if I read a complete chapter/book about multithreading, but I'd like a quicker answer. (I know this stackoverflow question is similar, but not sufficiently.) Assume there is this class: <pre class="prettyprint"><code>public class TestClass { private int someValue; public int getSomeValue() { return someValue; } public void setSomeValue(int value) { someValue = value; } } </code></pre> There are two threads (A and B) that access the instance of this class. Consider the following sequence: <ol> <li>A: getSomeValue()</li> <li>B: setSomeValue()</li> <li>A: getSomeValue()</li> </ol> If I'm right, someValue must be volatile, otherwise the 3rd step might not return the up-to-date value (because A may have a cached value). Is this correct? Second scenario: <ol> <li>B: setSomeValue()</li> <li>A: getSomeValue()</li> </ol> In this case, A will always get the correct value, because this is its first access so he can't have a cached value yet. Is this right? If a class is accessed only in the second way, there is no need for volatile/synchronization, or is it? Note that this example was simplified, and actually I'm wondering about particular member variables and methods in a complex class, and not about whole classes (i.e. which variables should be volatile or have synced access). The main point is: if more threads access certain data, is synchronized access needed by all means, or does it depend on the way (e.g. order) they access it? <hr> After reading the comments, I try to present the source of my confusion with another example: <ol> <li>From UI thread: <code>threadA.start()</code> </li> <li>threadA calls <code>getSomeValue()</code>, and informs the UI thread</li> <li>UI thread gets the message (in its message queue), so it calls: <code>threadB.start()</code> </li> <li>threadB calls <code>setSomeValue()</code>, and informs the UI thread</li> <li>UI thread gets the message, and informs threadA (in some way, e.g. message queue)</li> <li>threadA calls <code>getSomeValue()</code> </li> </ol> This is a totally synchronized structure, but why does this imply that threadA will get the most up-to-date value in step 6? (if <code>someValue</code> is not volatile, or not put into a monitor when accessed from anywhere)

As we all know, that its the crucial state of the data that we need to protect, and the atomic statements which govern the crucial state of the data must be Synchronized. I had this example, where is used volatile, and then i used 2 threads which used to increment the value of a counter by 1 each time till 10000. So it must be a total of 20000. but to my surprise it didnt happened always. Then i used synchronized keyword to make it work. Synchronization makes sure that when a thread is accessing the synchronized method, no other thread is allowed to access this or any other synchronized method of that object, making sure that data corruption is not done. Thread-Safe class means that it will maintain its correctness in the presence of the scheduling and interleaving of the underlining Runtime environment, without any thread-safe mechanism from the Client side, which access that class.

Let's look at the book. <blockquote> A field may be declared volatile, in which case the Java memory model (§17) ensures that all threads see a consistent value for the variable. </blockquote> So <code>volatile</code> is a guarantee that the declared variable won't be copied into thread local storage, which is otherwise allowed. It's further explained that this is an intentional alternative to locking for very simple kinds of synchronized access to shared storage. Also see this earlier article, which explains that <code>int</code> access is necessarily atomic (but not <code>double</code> or <code>long</code>). These together mean that if your <code>int</code> field is declared <code>volatile</code> then no locks are necessary to guarantee atomicity: you will always see a value that was last written to the memory location, not some confused value resulting from a half-complete write (as is possible with double or long). However you seem to imply that your getters and setters themselves are atomic. This is not guaranteed. The JVM can interrupt execution at intermediate points of during the call or return sequence. In this example, this has no consequences. But if the calls had side effects, e.g. <code>setSomeValue(++val)</code>, then you would have a different story.

Multithreaded access and variable cache of threads

Tags:

java

android

multithreading

volatile

I could find the answer if I read a complete chapter/book about multithreading, but I'd like a quicker answer. (I know this stackoverflow question is similar, but not sufficiently.)

Assume there is this class:

public class TestClass {
   private int someValue;

   public int getSomeValue() { return someValue; }
   public void setSomeValue(int value) {  someValue = value; }
}

There are two threads (A and B) that access the instance of this class. Consider the following sequence:

A: getSomeValue()
B: setSomeValue()
A: getSomeValue()

If I'm right, someValue must be volatile, otherwise the 3rd step might not return the up-to-date value (because A may have a cached value). Is this correct?

Second scenario:

B: setSomeValue()
A: getSomeValue()

In this case, A will always get the correct value, because this is its first access so he can't have a cached value yet. Is this right?

If a class is accessed only in the second way, there is no need for volatile/synchronization, or is it?

Note that this example was simplified, and actually I'm wondering about particular member variables and methods in a complex class, and not about whole classes (i.e. which variables should be volatile or have synced access). The main point is: if more threads access certain data, is synchronized access needed by all means, or does it depend on the way (e.g. order) they access it?

After reading the comments, I try to present the source of my confusion with another example:

From UI thread: threadA.start()
threadA calls getSomeValue(), and informs the UI thread
UI thread gets the message (in its message queue), so it calls: threadB.start()
threadB calls setSomeValue(), and informs the UI thread
UI thread gets the message, and informs threadA (in some way, e.g. message queue)
threadA calls getSomeValue()

This is a totally synchronized structure, but why does this imply that threadA will get the most up-to-date value in step 6? (if someValue is not volatile, or not put into a monitor when accessed from anywhere)

859

asked Jun 29 '12 22:06

Thomas Calc

4 Answers

If two threads are calling the same methods, you can't make any guarantees about the order that said methods are called. Consequently, your original premise, which depends on calling order, is invalid.

It's not about the order in which the methods are called; it's about synchronization. It's about using some mechanism to make one thread wait while the other fully completes its write operation. Once you've made the decision to have more than one thread, you must provide that synchronization mechanism to avoid data corruption.

182

answered Oct 13 '22 06:10

Robert Harvey

As we all know, that its the crucial state of the data that we need to protect, and the atomic statements which govern the crucial state of the data must be Synchronized.

I had this example, where is used volatile, and then i used 2 threads which used to increment the value of a counter by 1 each time till 10000. So it must be a total of 20000. but to my surprise it didnt happened always.

Then i used synchronized keyword to make it work.

Synchronization makes sure that when a thread is accessing the synchronized method, no other thread is allowed to access this or any other synchronized method of that object, making sure that data corruption is not done.

Thread-Safe class means that it will maintain its correctness in the presence of the scheduling and interleaving of the underlining Runtime environment, without any thread-safe mechanism from the Client side, which access that class.

answered Oct 13 '22 07:10

Kumar Vivek Mitra

Let's look at the book.

A field may be declared volatile, in which case the Java memory model (§17) ensures that all threads see a consistent value for the variable.

So volatile is a guarantee that the declared variable won't be copied into thread local storage, which is otherwise allowed. It's further explained that this is an intentional alternative to locking for very simple kinds of synchronized access to shared storage.

Also see this earlier article, which explains that int access is necessarily atomic (but not double or long).

These together mean that if your int field is declared volatile then no locks are necessary to guarantee atomicity: you will always see a value that was last written to the memory location, not some confused value resulting from a half-complete write (as is possible with double or long).

However you seem to imply that your getters and setters themselves are atomic. This is not guaranteed. The JVM can interrupt execution at intermediate points of during the call or return sequence. In this example, this has no consequences. But if the calls had side effects, e.g. setSomeValue(++val), then you would have a different story.

answered Oct 13 '22 06:10

Gene

The issue is that java is simply a specification. There are many JVM implementations and examples of physical operating environments. On any given combination an an action may be safe or unsafe. For instance On single processor systems the volatile keyword in your example is probably completely unnecessary. Since the writers of the memory and language specifications can't reasonably account for possible sets of operating conditions, they choose to white-list certain patterns that are guaranteed to work on all compliant implementations. Adhering to to these guidelines ensures both that your code will work on your target system and that it will be reasonably portable.

In this case "caching" typically refers to activity that is going on at the hardware level. There are certain events that occur in java that cause cores on a multi processor systems to "Synchronize" their caches. Accesses to volatile variables are an example of this, synchronized blocks are another. Imagine a scenario where these two threads X and Y are scheduled to run on different processors.

X starts and is scheduled on proc 1
y starts and is scheduled on proc 2

.. now you have two threads executing simultaneously
to speed things up the processors check local caches
before going to main memory because its expensive.

x calls setSomeValue('x-value') //assuming proc 1's cache is empty the cache is set
                                //this value is dropped on the bus to be flushed
                                //to main memory
                                //now all get's will retrieve from cache instead
                                //of engaging the memory bus to go to main memory 
y calls setSomeValue('y-value') //same thing happens for proc 2

//Now in this situation depending on to order in which things are scheduled and
//what thread you are calling from calls to getSomeValue() may return 'x-value' or
//'y-value. The results are completely unpredictable.

The point is that volatile(on compliant implementations) ensures that ordered writes will always be flushed to main memory and that other processor's caches will be flagged as 'dirty' before the next access regardless of the thread from which that access occurs.

disclaimer: volatile DOES NOT LOCK. This is important especially in the following case:

volatile int counter;

public incrementSomeValue(){
    counter++; // Bad thread juju - this is at least three instructions 
               // read - increment - write             
               // there is no guarantee that this operation is atomic
}

this could be relevant to your question if your intent is that setSomeValue must always be called before getSomeValue

If the intent is that getSomeValue() must always reflect the most recent call to setSomeValue() then this is a good place for the use of the volatile keyword. Just remember that without it there is no guarantee that getSomeValue() will reflect to most recent call to setSomeValue() even if setSomeValue() was scheduled first.

answered Oct 13 '22 07:10

nsfyn55

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multithreaded access and variable cache of threads

Tags:

java

android

multithreading

volatile

Thomas Calc

People also ask

4 Answers

Robert Harvey

Kumar Vivek Mitra

Gene

nsfyn55

Recent Activity

Donate For Us

Multithreaded access and variable cache of threads

Tags:

java

android

multithreading

volatile

Thomas Calc

People also ask

4 Answers

Robert Harvey

Kumar Vivek Mitra

Gene

nsfyn55

Related questions

Recent Activity

Donate For Us