I have two threads running. They share an array. One of the threads adds new elements to the array (and removes them) and the other uses this array (read operations only). Is it necessary for me to lock the array before I add/remove to/from it or read from it? Further details: <ul> <li>I will need to keep iterating over the entire array in the other thread. No write operations over there as previously mentioned. "Just scanning something like a fixed-size circular buffer"</li> <li>The easy thing to do in such cases is to use a lock. However locks can be very slow. I did not want to use locks if their use can be avoided. Also, as it came out from the discussions, it might not be necessary (it actually isn't) to lock all operations on the array. Just locking the management of an iterator for the array (count variable that will be used by the other thread) is enough</li> </ul> I don't think the question is "too broad". If it still comes out to be so, please let me know. I know the question isn't perfect. I had to combine at least 3 answers in order to be able to solve the question - which suggests most people were not able to fully understand all the issues and were forced to do some guess work. But most of it came out through the comments which I have tried to incorporate in the question. The answers helped me solve my problem quite objectively and I think the answers provided here are quite a helpful resource for someone starting out with multithreading.

If two threads perform an operation on the same memory location, and at least one operation is a write operation, you have a so-called data race. According to C11 and C++11, the behaviour of programs with data races is undefined. So, you have to use some kind of synchronization mechanism, for example: <ul> <li>std::atomic</li> <li>std::mutex</li> </ul>

<blockquote> One of the threads adds new elements to the array [...] and the other [reads] this array </blockquote> In order to add and remove elements to/from an array, you will need an index that specifies the last place of the array where the valid data is stored. Such index is necessary, because arrays cannot be resized without potential reallocation (which is a different story altogether). You may also need a second index to mark the initial location from which the reading is allowed. If you have an index or two like this, and assuming that you never re-allocate the array, it is not necessary to lock when you write to the array itself, as long as you lock the writes of valid indexes. <pre class="prettyprint"><code>int lastValid = 0; int shared[MAX]; ... int count = toAddCount; // Add the new data for (int i = lastValid ; count != 0 ; count--, i++) { shared[i] = new_data(...); } // Lock a mutex before modifying lastValid // You need to use the same mutex to protect the read of lastValid variable lock_mutex(lastValid_mutex); lastValid += toAddCount; unlock_mutex(lastValid_mutex); </code></pre> The reason this works is that when you perform writes to <code>shared[]</code> outside the locked region, the reader does not "look" past the <code>lastValid</code> index. Once the writing is complete, you lock the mutex, which normally causes a flush of the CPU cache, so the writes to <code>shared[]</code> would be complete before the reader is allowed to see the data.

Is it necessary to lock an array that is only written to from one thread and only read from another?

Tags:

c++

arrays

c

multithreading

I have two threads running. They share an array. One of the threads adds new elements to the array (and removes them) and the other uses this array (read operations only). Is it necessary for me to lock the array before I add/remove to/from it or read from it?

Further details:

I will need to keep iterating over the entire array in the other thread. No write operations over there as previously mentioned. "Just scanning something like a fixed-size circular buffer"
The easy thing to do in such cases is to use a lock. However locks can be very slow. I did not want to use locks if their use can be avoided. Also, as it came out from the discussions, it might not be necessary (it actually isn't) to lock all operations on the array. Just locking the management of an iterator for the array (count variable that will be used by the other thread) is enough

I don't think the question is "too broad". If it still comes out to be so, please let me know. I know the question isn't perfect. I had to combine at least 3 answers in order to be able to solve the question - which suggests most people were not able to fully understand all the issues and were forced to do some guess work. But most of it came out through the comments which I have tried to incorporate in the question. The answers helped me solve my problem quite objectively and I think the answers provided here are quite a helpful resource for someone starting out with multithreading.

527

asked Jul 10 '14 17:07

Chani

4 Answers

If two threads perform an operation on the same memory location, and at least one operation is a write operation, you have a so-called data race. According to C11 and C++11, the behaviour of programs with data races is undefined.

So, you have to use some kind of synchronization mechanism, for example:

std::atomic
std::mutex

145

answered Oct 20 '22 08:10

nosid

If you are writing and reading from the same location from multiple threads you will need to to perform locking or use atomics. We can see this by looking at the C11 draft standard(The C++11 standard looks almost identical, the equivalent section would be 1.10) says the following in section 5.1.2.4 Multi-threaded executions and data races:

Two expression evaluations conflict if one of them modifies a memory location and the other one reads or modifies the same memory location.

and:

The execution of a program contains a data race if it contains two conflicting actions in different threads, at least one of which is not atomic, and neither happens before the other. Any such data race results in undefined behavior.

and:

Compiler transformations that introduce assignments to a potentially shared memory location that would not be modified by the abstract machine are generally precluded by this standard, since such an assignment might overwrite another assignment by a different thread in cases in which an abstract machine execution would not have encountered a data race. This includes implementations of data member assignment that overwrite adjacent members in separate memory locations. We also generally preclude reordering of atomic loads in cases in which the atomics in question may alias, since this may violate the "visible sequence" rules.

If you were just adding data to the array then in the C++ world a std::atomic index would be sufficient since you can add more elements and then atomically increment the index. But since you want to grow and shrink the array then you will need to use a mutex, in the C++ world std::lock_guard would be a typical choice.

answered Oct 20 '22 07:10

Shafik Yaghmour

To answer your question: maybe.

Simply put, the way that the question is framed doesn't provide enough information about whether or not a lock is required.

In most standard use cases, the answer would be yes. And most of the answers here are covering that case pretty well.

I'll cover the other case.

When would you not need a lock given the information you have provided?

There are some other questions here that would help better define whether you need a lock, whether you can use a lock-free synchronization method, or whether or not you can get away with no explicit synchronization.

Will writing data ever be non-atomic? Meaning, will writing data ever result in "torn data"? If your data is a single 32 bit value on an x86 system, and your data is aligned, then you would have a case where writing your data is already atomic. It's safe to assume that if your data is of any size larger than the size of a pointer (4 bytes on x86, 8 on x64), then your writes cannot be atomic without a lock.

Will the size of your array ever change in a way that requires reallocation? If your reader is walking through your data, will the data suddenly be "gone" (memory has been "delete"d)? Unless your reader takes this into account (unlikely), you'll need a lock if reallocation is possible.

When you write data to your array, is it ok if the reader "sees" old data?

If your data can be written atomically, your array won't suddenly not be there, and it's ok for the reader to see old data... then you won't need a lock. Even with those conditions being met, it would be appropriate to use the built in atomic functions for reading and storing. But, that's a case where you wouldn't need a lock :)

Probably safest to use a lock since you were unsure enough to ask this question. But, if you want to play around with the edge case of where you don't need a lock... there you go :)

answered Oct 20 '22 07:10

Michael Gazonda

One of the threads adds new elements to the array [...] and the other [reads] this array

In order to add and remove elements to/from an array, you will need an index that specifies the last place of the array where the valid data is stored. Such index is necessary, because arrays cannot be resized without potential reallocation (which is a different story altogether). You may also need a second index to mark the initial location from which the reading is allowed.

If you have an index or two like this, and assuming that you never re-allocate the array, it is not necessary to lock when you write to the array itself, as long as you lock the writes of valid indexes.

int lastValid = 0;
int shared[MAX];
...
int count = toAddCount;
// Add the new data
for (int i = lastValid ; count != 0 ; count--, i++) {
    shared[i] = new_data(...);
}
// Lock a mutex before modifying lastValid
// You need to use the same mutex to protect the read of lastValid variable
lock_mutex(lastValid_mutex);
lastValid += toAddCount;
unlock_mutex(lastValid_mutex);

The reason this works is that when you perform writes to shared[] outside the locked region, the reader does not "look" past the lastValid index. Once the writing is complete, you lock the mutex, which normally causes a flush of the CPU cache, so the writes to shared[] would be complete before the reader is allowed to see the data.

answered Oct 20 '22 07:10

Sergey Kalinichenko

Related questions
                            
                                C++ iterating through a set [duplicate]
                            
                                simple openGL program fails to link in ubuntu
                            
                                boost::unit_test case fails because a child process exits with nonzero
                            
                                Export template function
                            
                                how to make a copy of boost::filesystem::directory_iterator?
                            
                                Qt place new window correctly on screen, center over mouse, move into screen
                            
                                Calling a member-function from within overloaded operator delete?
                            
                                gnu c++0x backwards compatibility status - can I just switch it on and go?
                            
                                operator overloading c++ / where to put my code?
                            
                                stl compliant container
                            
                                How to safely store strings (i.e. password) in a C++ application?
                            
                                How to perform advanced search of all process memory from Visual Studio debugger?
                            
                                Boost any_range vs. "canonical form" - what is the latter?
                            
                                What happens to other threads when one thread forks()?
                            
                                Is there a utility to indent C++ programs [closed]
                            
                                Uninstall boost and install another version
                            
                                How can I expose iterators without exposing the container used?
                            
                                C++ vector::clear
                            
                                need to call a function at periodic time intervals in c++
                            
                                Use of external C++ headers in Objective-C

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it necessary to lock an array that is only written to from one thread and only read from another?

Tags:

c++

arrays

c

multithreading

Chani

People also ask

4 Answers

nosid

Shafik Yaghmour

Michael Gazonda

Sergey Kalinichenko

Recent Activity

Donate For Us

Is it necessary to lock an array that is *only written to* from one thread and *only read from* another?

Tags:

c++

arrays

c

multithreading

Chani

People also ask

4 Answers

nosid

Shafik Yaghmour

Michael Gazonda

Sergey Kalinichenko

Related questions

Recent Activity

Donate For Us

Is it necessary to lock an array that is only written to from one thread and only read from another?