I recently read a book about system software. There is an example in it that I don't understand. <pre class="prettyprint"><code>volatile T* pInst = 0; T* GetInstance() { if (pInst == NULL) { lock(); if (pInst == NULL) pInst = new T; unlock(); } return pInst; } </code></pre> Why does the author check <code>(pInst == NULL)</code> twice?

I assume <code>lock()</code> is costly operation. I also assume that read on <code>T*</code> pointers is done atomically on this platform, so you don't need to lock simple comparisons <code>pInst == NULL</code>, as the load operation of <code>pInst</code> value will be ex. a single assembly instruction on this platform. Assuming that: If <code>lock()</code> is a costly operation, it's best not to execute it, if we don't have to. So first we check if <code>pInst == NULL</code>. This will be a single assembly instruction, so we don't need to <code>lock()</code> it. If <code>pInst == NULL</code>, we need to modify it's value, allocate new <code>pInst = new ...</code>. But - imagine a situation, where 2 (or more) threads are right in the point between first <code>pInst == NULL</code> and right before <code>lock()</code>. Both threads will to <code>pInst = new</code>. They already checked the first <code>pInst == NULL</code> and for both of them it was true. The first (any) thread starts it's execution and does <code>lock(); pInst = new T; unlock()</code>. Then the second thread waiting on <code>lock()</code> starts it's execution. When it starts, <code>pInst != NULL</code>, because another thread allocated that. So we need to check it <code>pInst == NULL</code> inside <code>lock()</code> again, so that memory is not leaked and <code>pInst</code> overwritten..

What is the reason for double NULL check of pointer for mutex lock

Tags:

c++

if-statement

locking

I recently read a book about system software. There is an example in it that I don't understand.

volatile T* pInst = 0;
T* GetInstance()
{
  if (pInst == NULL)
  {
   lock();
   if (pInst == NULL)
     pInst = new T;
   unlock();
  }
  return pInst;
}

Why does the author check (pInst == NULL) twice?

267

asked Jun 04 '19 09:06

BigDongle

2 Answers

When two threads try call GetInstance() for the first time at the same time, both will see pInst == NULL at the first check. One thread will get the lock first, which allows it to modify pInst.

The second thread will wait for the lock to get available. When the first thread releases the lock, the second will get it, and now the value of pInst has already been modified by the first thread, so the second one doesn't need to create a new instance.

Only the second check between lock() and unlock() is safe. It would work without the first check, but it would be slower because every call to GetInstance() would call lock() and unlock(). The first check avoids unnecessary lock() calls.

volatile T* pInst = 0;
T* GetInstance()
{
  if (pInst == NULL) // unsafe check to avoid unnecessary and maybe slow lock()
  {
   lock(); // after this, only one thread can access pInst
   if (pInst == NULL) // check again because other thread may have modified it between first check and returning from lock()
     pInst = new T;
   unlock();
  }
  return pInst;
}

See also https://en.wikipedia.org/wiki/Double-checked_locking (copied from interjay's comment).

Note: This implementation requires that both read and write accesses to volatile T* pInst are atomic. Otherwise the second thread may read a partially written value just being written by the first thread. For modern processors, accessing a pointer value (not the data being pointed to) is an atomic operation, although not guaranteed for all architectures.

If access to pInst was not atomic, the second thread may read a partially written non-NULL value when checking pInst before getting the lock and then may execute return pInst before the first thread has finished its operation, which would result in returning a wrong pointer value.

answered Nov 08 '22 09:11

Bodo

I assume lock() is costly operation. I also assume that read on T* pointers is done atomically on this platform, so you don't need to lock simple comparisons pInst == NULL, as the load operation of pInst value will be ex. a single assembly instruction on this platform.

Assuming that: If lock() is a costly operation, it's best not to execute it, if we don't have to. So first we check if pInst == NULL. This will be a single assembly instruction, so we don't need to lock() it. If pInst == NULL, we need to modify it's value, allocate new pInst = new ....

But - imagine a situation, where 2 (or more) threads are right in the point between first pInst == NULL and right before lock(). Both threads will to pInst = new. They already checked the first pInst == NULL and for both of them it was true.

The first (any) thread starts it's execution and does lock(); pInst = new T; unlock(). Then the second thread waiting on lock() starts it's execution. When it starts, pInst != NULL, because another thread allocated that. So we need to check it pInst == NULL inside lock() again, so that memory is not leaked and pInst overwritten..

answered Nov 08 '22 07:11

KamilCuk

Related questions
                            
                                Is there a way to write make_unique() in VS2012?
                            
                                How to output a percent sign itself using boost.format?
                            
                                Is lambda comparison deterministic?
                            
                                Using Boost Python & std::shared_ptr
                            
                                to system() or fork()/exec()?
                            
                                uint8_t iostream behavior
                            
                                Creating dynamic type in C++
                            
                                C++11 'native_handle' is not a member of 'std::this_thread'
                            
                                Boost log, GCC 4.4 and CMake
                            
                                Template class constructor [duplicate]
                            
                                unexpected copies with foreach over a map
                            
                                Cannot run qmake in Mac Terminal
                            
                                Correct way to return an rvalue reference to this
                            
                                Example why someone should use triple-pointers in C/C++?
                            
                                How to detect -stdlib=libc++ in the preprocessor?
                            
                                Static Variables Initialization Quiz
                            
                                Server-side warning: Aggregation query used without partition key
                            
                                Is it legal to use #elif with #ifdef?
                            
                                Why does std::bitset suggest more available bits than sizeof says there are?
                            
                                Why is implicit conversion not ambiguous for non-primitive types?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With