I have a query related to the implementation of threads in Linux. Linux does not have an explicit thread support. In userspace, we might use an thread library (like NPTL) for creating threads. Now if we use NPTL it supports 1:1 mapping. The kernel will use the <code>clone()</code> function to implement threads. Suppose I have created 4 threads. Then it would mean that: <ul> <li>There will be 4 <code>task_struct</code>. </li> <li>Inside the <code>task_struct</code>, there will be provision of sharing resources as per the arguments to clone <code>(CLONE_VM | CLONE_FS | CLONE_FILES | CLONE_SIGHAND)</code>.</li> </ul> Now I have the following query: <ol> <li>Will the 4 threads have the same PID? If someone can elaborate, how the PIDs are shared. </li> <li>How are the different threads identified; is there some TID (thread ID) concept?</li> </ol>

The four threads will have the same PID but only when viewed from above. What you (as a user) calls a PID is not what the kernel (looking from below) calls a PID. In the kernel, each thread has its own ID, called a PID, although it would possibly make more sense to call this a TID, or thread ID, and they also have a TGID (thread group ID) which is the PID of the first thread that was created when the process was created. When a new process is created, it appears as a thread where both the PID and TGID are the same (currently unused) number. When a thread starts another thread, that new thread gets its own PID (so the scheduler can schedule it independently) but it inherits the TGID from the original thread. That way, the kernel can happily schedule threads independent of what process they belong to, while processes (thread group IDs) are reported to you. The following hierarchy of threads may help(a): <pre class="prettyprint lang-none prettyprint-override"><code> USER VIEW vvvv vvvv | <-- PID 43 -->|<----------------- PID 42 -----------------> | | | +---------+ | | | process | | | _| pid=42 |_ | __(fork) _/ | tgid=42 | \_ (new thread) _ / | +---------+ | \ +---------+ | | +---------+ | process | | | | process | | pid=43 | | | | pid=44 | | tgid=43 | | | | tgid=42 | +---------+ | | +---------+ | | <-- PID 43 -->|<--------- PID 42 -------->|<--- PID 44 ---> | | ^^^^^^ ^^^^ KERNEL VIEW </code></pre> You can see that starting a new process (on the left) gives you a new PID and a new TGID (both set to the same value). Starting a new thread (on the right) gives you a new PID while maintaining the same TGID as the thread that started it. <hr> (a)Tremble in awe at my impressive graphical skills :-)

If threads share the same PID, how can they be identified?

1 Answers

The four threads will have the same PID but only when viewed from above. What you (as a user) calls a PID is not what the kernel (looking from below) calls a PID.

In the kernel, each thread has its own ID, called a PID, although it would possibly make more sense to call this a TID, or thread ID, and they also have a TGID (thread group ID) which is the PID of the first thread that was created when the process was created.

When a new process is created, it appears as a thread where both the PID and TGID are the same (currently unused) number.

When a thread starts another thread, that new thread gets its own PID (so the scheduler can schedule it independently) but it inherits the TGID from the original thread.

That way, the kernel can happily schedule threads independent of what process they belong to, while processes (thread group IDs) are reported to you.

The following hierarchy of threads may help^(a):

                         USER VIEW                          vvvv vvvv               |           <-- PID 43 -->|<----------------- PID 42 ----------------->               |                           |               |      +---------+          |               |      | process |          |               |     _| pid=42  |_         |          __(fork) _/ | tgid=42 | \_ (new thread) _         /     |      +---------+          |       \ +---------+   |                           |    +---------+ | process |   |                           |    | process | | pid=43  |   |                           |    | pid=44  | | tgid=43 |   |                           |    | tgid=42 | +---------+   |                           |    +---------+               |                           | <-- PID 43 -->|<--------- PID 42 -------->|<--- PID 44 --->               |                           |                         ^^^^^^ ^^^^                         KERNEL VIEW

You can see that starting a new process (on the left) gives you a new PID and a new TGID (both set to the same value). Starting a new thread (on the right) gives you a new PID while maintaining the same TGID as the thread that started it.

^(a)Tremble in awe at my impressive graphical skills :-)

answered Sep 26 '22 08:09

paxdiablo

Related questions
                            
                                Are "data races" and "race condition" actually the same thing in context of concurrent programming
                            
                                Returning value from Thread
                            
                                Why should Java ThreadLocal variables be static
                            
                                What is the difference between .Wait() vs .GetAwaiter().GetResult()?
                            
                                What are the main uses of yield(), and how does it differ from join() and interrupt()?
                            
                                How to scale threads according to CPU cores?
                            
                                Java Synchronized Block for .class
                            
                                When would you call java's thread.run() instead of thread.start()?
                            
                                Async/Await vs Threads
                            
                                Scala actors: receive vs react
                            
                                Is Java Regex Thread Safe?
                            
                                wait until all threads finish their work in java
                            
                                Accessing UI (Main) Thread safely in WPF
                            
                                Are Mutexes needed in javascript?
                            
                                Are there any cases when it's preferable to use a plain old Thread object instead of one of the newer constructs?
                            
                                AsyncTask threads never die
                            
                                What is the Re-entrant lock and concept in general?
                            
                                Which would be better for concurrent tasks on node.js? Fibers? Web-workers? or Threads?
                            
                                Does Python support multithreading? Can it speed up execution time?
                            
                                Using ThreadPool.QueueUserWorkItem in ASP.NET in a high traffic scenario

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

If threads share the same PID, how can they be identified?

Tags:

multithreading

linux-kernel

pid

SPSN

People also ask

1 Answers

paxdiablo

Recent Activity

Donate For Us