The following program shows that we can use <code>return</code> or <code>pthread_exit</code> to return a <code>void*</code> variable that is available to <code>pthread_join</code>'s status variable. <ol> <li>Should there be a preference for using one over the other?</li> <li>Why does using return work? Normally we think of return putting a value on the stack but since the thread is completed the stack should vanish. Or does the stack not get destroyed until after <code>pthread_join</code>?</li> <li>In your work, do you see much use of the status variable? It seems 90% of the code I see just NULLs out the status parameter. Since anything changed via the <code>void*</code> ptr is already reflected in the calling thread there doesn't seem much point to returning it. Any new <code>void*</code> ptr returned would have to point to something <code>malloc</code>ed by the start thread, which leaves the receiving thread with the responsibility to dispose of it. Am I wrong in thinking the status variable is semi-pointless?</li> </ol> Here is the code: <pre class="prettyprint"><code>#include <iostream> #include <pthread.h> using namespace std; struct taskdata { int x; float y; string z; }; void* task1(void *data) { taskdata *t = (taskdata *) data; t->x += 25; t->y -= 4.5; t->z = "Goodbye"; return(data); } void* task2(void *data) { taskdata *t = (taskdata *) data; t->x -= 25; t->y += 4.5; t->z = "World"; pthread_exit(data); } int main(int argc, char *argv[]) { pthread_t threadID; taskdata t = {10, 10.0, "Hello"}; void *status; cout << "before " << t.x << " " << t.y << " " << t.z << endl; //by return() pthread_create(&threadID, NULL, task1, (void *) &t); pthread_join(threadID, &status); taskdata *ts = (taskdata *) status; cout << "after task1 " << ts->x << " " << ts->y << " " << ts->z << endl; //by pthread_exit() pthread_create(&threadID, NULL, task2, (void *) &t); pthread_join(threadID, &status); ts = (taskdata *) status; cout << "after task2 " << ts->x << " " << ts->y << " " << ts->z << endl; } </code></pre> With output of: <pre class="prettyprint"><code>before 10 10 Hello after task1 35 5.5 Goodbye after task2 10 10 World </code></pre>

(1) In C++ code, using <code>return</code> causes the stack to be unwound and local variables destroyed, whereas <code>pthread_exit</code> is only guaranteed to invoke cancellation handlers registered with <code>pthread_cancel_push()</code>. On some systems this mechanism will also cause the destructors for C++ local variables to be called, but this is not guaranteed for portable code --- check your platform documentation. Also, in <code>main()</code>, <code>return</code> will implicitly call <code>exit()</code>, and thus terminate the program, whereas <code>pthread_exit()</code> will merely terminate the thread, and the program will remain running until all threads have terminated or some thread calls <code>exit()</code>, <code>abort()</code> or another function that terminates the program. (2) The use of <code>return</code> works because the POSIX specification says so. The returned value is stored in a place where <code>pthread_join()</code> can retrieve it. The resources used by the thread are not reclaimed until <code>pthread_join()</code> is called. (3) I never use the return value of a thread in raw POSIX threads. However, I tend to use higher level facilities such as the Boost thread library, and more recently the C++0x thread library, which provide alternative means for transferring values between threads such as futures, which avoid the problems associated with memory management that you allude to.

I think that <code>return</code> from the <code>start_routine</code> is preferable, because it ensures that the call stack is properly unwound. This is even more important for C than C++ since it doesn't have the destructor magic that cleans up the mess after preliminary exits. So your code should go through all final parts of routines on the call stack to do <code>free</code>s and alike. For why this works, this is simple <blockquote> If the start_routine returns, the effect shall be as if there was an implicit call to pthread_exit() using the return value of start_routine as the exit status </blockquote> For my personal experience I tend to not use the status of terminated threads much. This is why I often have the threads started <code>detached</code>. But this should depend much on the application and is certainly not generalizable.

return() versus pthread_exit() in pthread start functions

Tags:

The following program shows that we can use return or pthread_exit to return a void* variable that is available to pthread_join's status variable.

Should there be a preference for using one over the other?
Why does using return work? Normally we think of return putting a value on the stack but since the thread is completed the stack should vanish. Or does the stack not get destroyed until after pthread_join?
In your work, do you see much use of the status variable? It seems 90% of the code I see just NULLs out the status parameter. Since anything changed via the void* ptr is already reflected in the calling thread there doesn't seem much point to returning it. Any new void* ptr returned would have to point to something malloced by the start thread, which leaves the receiving thread with the responsibility to dispose of it. Am I wrong in thinking the status variable is semi-pointless?

Here is the code:

#include <iostream> #include <pthread.h>  using namespace std;  struct taskdata {        int  x;      float  y;     string  z; };   void* task1(void *data) {     taskdata *t = (taskdata *) data;      t->x += 25;     t->y -= 4.5;     t->z = "Goodbye";      return(data); }  void* task2(void *data) {     taskdata *t = (taskdata *) data;      t->x -= 25;     t->y += 4.5;     t->z = "World";      pthread_exit(data); }   int main(int argc, char *argv[]) {     pthread_t threadID;      taskdata t = {10, 10.0, "Hello"};      void *status;      cout << "before " << t.x << " " << t.y << " " << t.z << endl;      //by return()      pthread_create(&threadID, NULL, task1, (void *) &t);      pthread_join(threadID, &status);      taskdata *ts = (taskdata *) status;      cout << "after task1 " << ts->x << " " << ts->y << " " << ts->z << endl;      //by pthread_exit()      pthread_create(&threadID, NULL, task2, (void *) &t);      pthread_join(threadID, &status);      ts = (taskdata *) status;      cout << "after task2 " << ts->x << " " << ts->y << " " << ts->z << endl;  }

With output of:

before 10 10 Hello after task1 35 5.5 Goodbye after task2 10 10 World

494

asked Sep 11 '10 20:09

ValenceElectron

2 Answers

(1) In C++ code, using return causes the stack to be unwound and local variables destroyed, whereas pthread_exit is only guaranteed to invoke cancellation handlers registered with pthread_cancel_push(). On some systems this mechanism will also cause the destructors for C++ local variables to be called, but this is not guaranteed for portable code --- check your platform documentation.

Also, in main(), return will implicitly call exit(), and thus terminate the program, whereas pthread_exit() will merely terminate the thread, and the program will remain running until all threads have terminated or some thread calls exit(), abort() or another function that terminates the program.

(2) The use of return works because the POSIX specification says so. The returned value is stored in a place where pthread_join() can retrieve it. The resources used by the thread are not reclaimed until pthread_join() is called.

(3) I never use the return value of a thread in raw POSIX threads. However, I tend to use higher level facilities such as the Boost thread library, and more recently the C++0x thread library, which provide alternative means for transferring values between threads such as futures, which avoid the problems associated with memory management that you allude to.

184

answered Oct 23 '22 10:10

Anthony Williams

I think that return from the start_routine is preferable, because it ensures that the call stack is properly unwound.

This is even more important for C than C++ since it doesn't have the destructor magic that cleans up the mess after preliminary exits. So your code should go through all final parts of routines on the call stack to do frees and alike.

For why this works, this is simple

If the start_routine returns, the effect shall be as if there was an implicit call to pthread_exit() using the return value of start_routine as the exit status

For my personal experience I tend to not use the status of terminated threads much. This is why I often have the threads started detached. But this should depend much on the application and is certainly not generalizable.

answered Oct 23 '22 11:10

Jens Gustedt

Related questions
                            
                                Java source refactoring of 7000 references
                            
                                What is the equivalent of /proc/cpuinfo on FreeBSD v8.1?
                            
                                Bash command to delete all but last 5 directories [duplicate]
                            
                                PLC Version Control
                            
                                jQuery remove table row with non-standard id characters
                            
                                Using the same lock for multiple methods
                            
                                PageIndexChanging in GridView in ASP.NET
                            
                                In lxml, how do I remove a tag but retain all contents?
                            
                                User ASP.NET runs under
                            
                                Clustered bar plot in gnuplot
                            
                                Is it OK to query a MongoDB multiple times per request?
                            
                                Selectively turning off Devise's flash notices in Rails 3

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With