How can I wait for a detached thread to finish in C++? I don't care about an exit status, I just want to know whether or not the thread has finished. I'm trying to provide a synchronous wrapper around an asynchronous thirdarty tool. The problem is a weird race condition crash involving a callback. The progression is: <ol> <li>I call the thirdparty, and register a callback</li> <li>when the thirdparty finishes, it notifies me using the callback -- in a detached thread I have no real control over.</li> <li>I want the thread from (1) to wait until (2) is called.</li> </ol> I want to wrap this in a mechanism that provides a blocking call. So far, I have: <pre class="prettyprint"><code>class Wait { public: void callback() { pthread_mutex_lock(&m_mutex); m_done = true; pthread_cond_broadcast(&m_cond); pthread_mutex_unlock(&m_mutex); } void wait() { pthread_mutex_lock(&m_mutex); while (!m_done) { pthread_cond_wait(&m_cond, &m_mutex); } pthread_mutex_unlock(&m_mutex); } private: pthread_mutex_t m_mutex; pthread_cond_t m_cond; bool m_done; }; // elsewhere... Wait waiter; thirdparty_utility(&waiter); waiter.wait(); </code></pre> As far as I can tell, this should work, and it usually does, but sometimes it crashes. As far as I can determine from the corefile, my guess as to the problem is this: <ol> <li>When the callback broadcasts the end of m_done, the wait thread wakes up</li> <li>The wait thread is now done here, and Wait is destroyed. All of Wait's members are destroyed, including the mutex and cond.</li> <li>The callback thread tries to continue from the broadcast point, but is now using memory that's been released, which results in memory corruption.</li> <li>When the callback thread tries to return (above the level of my poor callback method), the program crashes (usually with a SIGSEGV, but I've seen SIGILL a couple of times).</li> </ol> I've tried a lot of different mechanisms to try to fix this, but none of them solve the problem. I still see occasional crashes. EDIT: More details: This is part of a massively multithreaded application, so creating a static Wait isn't practical. I ran a test, creating Wait on the heap, and deliberately leaking the memory (i.e. the Wait objects are never deallocated), and that resulted in no crashes. So I'm sure it's a problem of Wait being deallocated too soon. I've also tried a test with a <code>sleep(5)</code> after the unlock in <code>wait</code>, and that also produced no crashes. I hate to rely on a kludge like that though. EDIT: ThirdParty details: I didn't think this was relevant at first, but the more I think about it, the more I think it's the real problem: The thirdparty stuff I mentioned, and why I have no control over the thread: this is using CORBA. So, it's possible that CORBA is holding onto a reference to my object longer than intended.

Yes, I believe that what you're describing is happening (race condition on deallocate). One quick way to fix this is to create a static instance of Wait, one that won't get destroyed. This will work as long as you don't need to have more than one waiter at the same time. You will also permanently use that memory, it will not deallocate. But it doesn't look like that's too bad. The main issue is that it's hard to coordinate lifetimes of your thread communication constructs between threads: you will always need at least one leftover communication construct to communicate when it is safe to destroy (at least in languages without garbage collection, like C++). EDIT: See comments for some ideas about refcounting with a global mutex.

Wait for a detached thread to finish in C++

Tags:

c++

multithreading

pthreads

corba

How can I wait for a detached thread to finish in C++?

I don't care about an exit status, I just want to know whether or not the thread has finished.

I'm trying to provide a synchronous wrapper around an asynchronous thirdarty tool. The problem is a weird race condition crash involving a callback. The progression is:

I call the thirdparty, and register a callback
when the thirdparty finishes, it notifies me using the callback -- in a detached thread I have no real control over.
I want the thread from (1) to wait until (2) is called.

I want to wrap this in a mechanism that provides a blocking call. So far, I have:

class Wait {
  public:
  void callback() {
    pthread_mutex_lock(&m_mutex);
    m_done = true;
    pthread_cond_broadcast(&m_cond);
    pthread_mutex_unlock(&m_mutex);
  }

  void wait() {
    pthread_mutex_lock(&m_mutex);
    while (!m_done) {
      pthread_cond_wait(&m_cond, &m_mutex);
    }
    pthread_mutex_unlock(&m_mutex);
  }

  private:
  pthread_mutex_t m_mutex;
  pthread_cond_t  m_cond;
  bool            m_done;
};

// elsewhere...
Wait waiter;
thirdparty_utility(&waiter);
waiter.wait();

As far as I can tell, this should work, and it usually does, but sometimes it crashes. As far as I can determine from the corefile, my guess as to the problem is this:

When the callback broadcasts the end of m_done, the wait thread wakes up
The wait thread is now done here, and Wait is destroyed. All of Wait's members are destroyed, including the mutex and cond.
The callback thread tries to continue from the broadcast point, but is now using memory that's been released, which results in memory corruption.
When the callback thread tries to return (above the level of my poor callback method), the program crashes (usually with a SIGSEGV, but I've seen SIGILL a couple of times).

I've tried a lot of different mechanisms to try to fix this, but none of them solve the problem. I still see occasional crashes.

EDIT: More details:

This is part of a massively multithreaded application, so creating a static Wait isn't practical.

I ran a test, creating Wait on the heap, and deliberately leaking the memory (i.e. the Wait objects are never deallocated), and that resulted in no crashes. So I'm sure it's a problem of Wait being deallocated too soon.

I've also tried a test with a sleep(5) after the unlock in wait, and that also produced no crashes. I hate to rely on a kludge like that though.

EDIT: ThirdParty details:

I didn't think this was relevant at first, but the more I think about it, the more I think it's the real problem:

The thirdparty stuff I mentioned, and why I have no control over the thread: this is using CORBA.

So, it's possible that CORBA is holding onto a reference to my object longer than intended.

429

asked Nov 15 '09 02:11

Tim

1 Answers

Yes, I believe that what you're describing is happening (race condition on deallocate). One quick way to fix this is to create a static instance of Wait, one that won't get destroyed. This will work as long as you don't need to have more than one waiter at the same time.

You will also permanently use that memory, it will not deallocate. But it doesn't look like that's too bad.

The main issue is that it's hard to coordinate lifetimes of your thread communication constructs between threads: you will always need at least one leftover communication construct to communicate when it is safe to destroy (at least in languages without garbage collection, like C++).

EDIT: See comments for some ideas about refcounting with a global mutex.

138

answered Oct 06 '22 01:10

Adam Goode

Related questions
                            
                                Why does an empty string literal in a multidimensional array decay to a null pointer?
                            
                                Bad type deduction when passing overloaded function pointer and its arguments
                            
                                Does Boost provide an implementation of span for C++14?
                            
                                Is it UB to resume a member function coroutine of an object whose lifetime has ended?
                            
                                Can pointers be used to modify readonly field? But why?
                            
                                Conversion from wstring to u16string and back (standard conform) in C++17 / C++20
                            
                                How can I build a setup.py to compile C++ extension using Python, pybind11 and Mingw-w64?
                            
                                c++ Algorithm to Compare various length vectors and isolate "unique", sort of
                            
                                What is the purpose for std::construct_at to cast through a pointer to volatile when using placement new?
                            
                                `u8string_view` into a `char` array without violating strict-aliasing?
                            
                                What are the rules for out-of-line definitions of constrained member templates?
                            
                                Why does std::basic_istream::ignore() extract more characters than specified?
                            
                                Is there a way to recursively iterate through all possible sub-matrices of a matrix while preventing some sub-matrices from being visited?
                            
                                constexpr variable not captured
                            
                                How to compare two standard conversion sequences use the rank of contained conversions
                            
                                What are the properties of template conversion `operator const T &` in C++?
                            
                                Can dangling pointer be equal to valid pointer during constant evaluation in C++?
                            
                                What is the best way to communicate with a MySQL server?
                            
                                Acoustic Echo Cancellation (AEC) with Speex and DirectSound
                            
                                Binary version of iostream

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With