When one worker thread fails, how to abort remaining workers?

Tags:

2 Answers

I've been thinking about your situation for sometime and this maybe of some help to you. You could probably try doing a couple of different methods to achieve you goal. There are 2-3 options that maybe of use or a combination of all three. I will at minimum show the first option for I'm still learning and trying to master the concepts of Template Specializations as well as using Lambdas.

Using a Manager Class
Using Template Specialization Encapsulation
Using Lambdas.

Pseudo code of a Manager Class would look something like this:

class ThreadManager { private:     std::unique_ptr<MainThread> mainThread_;     std::list<std::shared_ptr<WorkerThread> lWorkers_;  // List to hold finished workers     std::queue<std::shared_ptr<WorkerThread> qWorkers_; // Queue to hold inactive and waiting threads.     std::map<unsigned, std::shared_ptr<WorkerThread> mThreadIds_; // Map to associate a WorkerThread with an ID value.     std::map<unsigned, bool> mFinishedThreads_; // A map to keep track of finished and unfinished threads.      bool threadError_; // Not needed if using exception handling public:     explicit ThreadManager( const MainThread& main_thread );      void shutdownThread( const unsigned& threadId );     void shutdownAllThreads();      void addWorker( const WorkerThread& worker_thread );               bool isThreadDone( const unsigned& threadId );      void spawnMainThread() const; // Method to start main thread's work.      void spawnWorkerThread( unsigned threadId, bool& error );      bool getThreadError( unsigned& threadID ); // Returns True If Thread Encountered An Error and passes the ID of that thread,   };

Only for demonstration purposes did I use bool value to determine if a thread failed for simplicity of the structure, and of course this can be substituted to your like if you prefer to use exceptions or invalid unsigned values, etc.

Now to use a class of this sort would be something like this: Also note that a class of this type would be considered better if it was a Singleton type object since you wouldn't want more than 1 ManagerClass since you are working with shared pointers.

SomeClass::SomeClass( ... ) {     // This class could contain a private static smart pointer of this Manager Class     // Initialize the smart pointer giving it new memory for the Manager Class and by passing it a pointer of the Main Thread object     threadManager_ = new ThreadManager( main_thread ); // Wouldn't actually use raw pointers here unless if you had a need to, but just shown for simplicity        }  SomeClass::addThreads( ... ) {     for ( unsigned u = 1, u <= threadCount; u++ ) {          threadManager_->addWorker( some_worker_thread );     } }  SomeClass::someFunctionThatSpawnsThreads( ... ) {     threadManager_->spawnMainThread();      bool error = false;            for ( unsigned u = 1; u <= threadCount; u++ ) {         threadManager_->spawnWorkerThread( u, error );          if ( error ) { // This Thread Failed To Start, Shutdown All Threads             threadManager->shutdownAllThreads();         }     }      // If all threads spawn successfully we can do a while loop here to listen if one fails.     unsigned threadId;     while ( threadManager_->getThreadError( threadId ) ) {          // If the function passed to this while loop returns true and we end up here, it will pass the id value of the failed thread.          // We can now go through a for loop and stop all active threads.          for ( unsigned u = threadID + 1; u <= threadCount; u++ ) {              threadManager_->shutdownThread( u );          }           // We have successfully shutdown all threads          break;     } }

I like the design of manager class since I have used them in other projects, and they come in handy quite often especially when working with a code base that contains many and multiple resources such as a working Game Engine that has many assets such as Sprites, Textures, Audio Files, Maps, Game Items etc. Using a Manager Class helps to keep track and maintain all of the assets. This same concept can be applied to "Managing" Active, Inactive, Waiting Threads, and knows how to intuitively handle and shutdown all threads properly. I would recommend using an ExceptionHandler if your code base and libraries support exceptions as well as thread safe exception handling instead of passing and using bools for errors. Also having a Logger class is good to where it can write to a log file and or a console window to give an explicit message of what function the exception was thrown in and what caused the exception where a log message might look like this:

Exception Thrown: someFunctionNamedThis in ThisFile on Line# (x)     threadID 021342 failed to execute.

This way you can look at the log file and find out very quickly what thread is causing the exception, instead of using passed around bool variables.

answered Sep 16 '22 18:09

Francis Cugler

The implementation of the long-running task is provided by a library whose code I cannot modify.

That means you have no way to synchronize the job done by working threads

If an error occurs in one of the workers,

Let's suppose that you can really detect worker errors; some of then can be easily detected if reported by the used library others cannot i.e.

the library code loops.
the library code prematurely exit with an uncaught exception.

I want the remaining workers to stop **gracefully**

That's just not possible

The best you can do is writing a thread manager checking on worker thread status and if an error condition is detected it just (ungracefully) "kills" all the worker threads and exits.

You should also consider detecting a looped working thread (by timeout) and offer to the user the option to kill or continue waiting for the process to finish.

answered Sep 20 '22 18:09

Pat

Related questions
                            
                                Large Graph Representation in C++
                            
                                boost shared_ptr casting to void*
                            
                                Dynamically create a function pointer that calls a method on a given instance
                            
                                Identify spoken language in the audio files
                            
                                std::vector vs normal array
                            
                                What is operator"" that I saw in GoingNative2012
                            
                                How can I avoid preemption of my thread in user mode
                            
                                boost::any typeid optimization for C++11 [duplicate]
                            
                                Square/cubic root lookup table
                            
                                vector, move semantics, nothrow and g++ 4.7
                            
                                C++ - critical values probability distribution
                            
                                Use fundamental matrix to compute coordinates translation using OpenCV
                            
                                How do I convert a boost::spirit::lex token's value from iterator_range to a string?
                            
                                How to take reliable QGLWidget snapshot
                            
                                Size of a struct containing 1 Pointer
                            
                                return value optimization vs auto_ptr for large vectors
                            
                                Custom allocation using boost singleton_pool slower than default
                            
                                Why has std::accumulate not been made constexpr in C++20?
                            
                                How could I speed up comparison of std::string against string literals?
                            
                                Can an out-of-range enum conversion produce a value outside the underlying type?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When one worker thread fails, how to abort remaining workers?

Tags:

c++

multithreading

Gareth Stockwell

People also ask

2 Answers

Francis Cugler

Pat

Recent Activity

Donate For Us