I want to understand what precisely is happening behind the scene when I spawn a new thread in .NET, something like here: <pre class="prettyprint"><code>Thread t = new Thread(DoWork); //I am not interested in DoWork per se t.Start(); </code></pre> 1. What thread-related objects are created in CLR and Windows kernel? 2. Why are those objects needed? 3. How much managed/unmanaged memory (heap and stack) is allocated on x86, x64 Windows? UPDATE I am looking for such objects as managed thread object, which is I assume is t, but perhaps some other additional managed objects; kernel thread object, user thread environment block and alike. Many thanks!

So this is a really complicated question that does not really have a great answer of "x". <ol> <li>The CLR is not required to map a single CLR thread to a single OS fiber. So... this is hard to answer. I think the current version of .NET (4.0) attempts to use a 1-to-1 relationship between CLR threads and OS fibers when possible on all OSes. Previous versions of .NET (more like <= 1.1) I'm not sure this was the case on all OSes. The scheduler handles most of the these objects and they won't be part of any .NET object graph. This scheduler is part of the CLR and not part of the <code>Thread</code> object. If you dig into the IL, you'll see many internal calls for actual execution. </li> <li>I assume the question is "Why are those objects needed?" If so, it's because the OS host has to actually have the fiber to execute the code for that thread on it. <code>ThreadPool</code> usage can greatly reduce this cost of creating them each time.</li> <li>Sorry... depends. A lot of it unmanaged as well, which means the OS host could choose to handle this differently depending on load and system version. </li> </ol> "The logical abstraction of a thread of control is captured by an instance of the <code>System.Threading.Thread</code> object in the class library." http://www.ecma-international.org/publications/files/ECMA-ST/Ecma-335.pdf So EMCA standard really doesn't say anything about the topic. But luckily we have... "Because the CLR thread object is per-fiber, any information hanging off of it is also per-fiber. Thread.ManagedThreadId returns a stable ID that flows around with the CLR thread. It is not dependent on the identity of the physical OS thread, which means using it implies no form of affinity. Different fibers running on the same thread return different IDs. " From Joe Duffy http://www.bluebytesoftware.com/blog/2006/11/10/FibersAndTheCLR.aspx

What is exactly happening when I spawn a new thread from .NET?

Tags:

.net

memory

windows

multithreading

kernel

I want to understand what precisely is happening behind the scene when I spawn a new thread in .NET, something like here:

Thread t = new Thread(DoWork); //I am not interested in DoWork per se
t.Start();

1. What thread-related objects are created in CLR and Windows kernel?
2. Why are those objects needed?
3. How much managed/unmanaged memory (heap and stack) is allocated on x86, x64 Windows?

UPDATE
I am looking for such objects as managed thread object, which is I assume is t, but perhaps some other additional managed objects; kernel thread object, user thread environment block and alike.

Many thanks!

221

asked Jun 09 '11 23:06

oleksii

3 Answers

Win32 and Kernel memory allocated

I'm not exactly sure how the .NET part works, but if the runtime does decide to create a real thread with the OS, it would eventually call the Win32 API CreateThread in kernel32.dll, probably from mscorlib.ni.dll

By default, new threads get 1MB of virtual address for the stack, which is committed as needed. This can be controlled with the maxStackSize parameter. The main thread's stack size comes from a parameter in the executable file itself.

In the process's address space, a TEB (thread environment block) will be allocated (see also). Incidentally, the FS register on x86 points to this for things like thread local storage and structured exception handling (SEH). There are probably other things allocated by Win32 that are not documented.

In creating the Win32 thread, the Win32 server process (csrss.exe) is contacted. You can see that csrss has handles open to all Win32 processes and threads in Process Explorer for some kind of bookkeeping.

DLLs loaded in the process will be notified of the new thread and may allocate their own memory for tracking the thread.

The kernel will create an ETHREAD [layout] (derived from KTHREAD) object from kernel non-paged pool to track the thread's state. There will also be a kernel stack allocated (12k default for x86) which can be paged out (unless the thread is in a kernel mode wait state).

Why so many things need to allocate memory for a thread

Threads are the smallest preemptively scheduled unit that the OS provides and there is a lot of context connected to them. Many different components need to provide separate context for each thread because system services need to be able to deal with multiple threads doing different things all at the same time.

Some services require you to declare new threads to them explicitly but most are expected to work with new threads automatically. Sometimes this means allocating space right when the thread is started. As the thread engages other services, the amount of memory used to track the thread can increase as those services set up their own context for the thread.

How much memory is allocated

It's hard to say how much memory is allocated for a thread since it is spread across several address spaces and heaps. It will vary between Windows versions, installed components and what is loaded into the process currently.

The largest cost is generally accepted to be the 1MB of address space used by default for new threads, but even this limit can allow many hundreds to be used in a single process without running out of space.

If the design is using many more OS threads than the number of CPUs in the system, it should be reviewed. Work queues with a thread pool and lightweight threads with user mode scheduling with fibers or another library's implementation should be able to handle mulithreading without requiring an excessive number of OS threads, rendering the memory cost of the threads to be unimportant.

answered Oct 10 '22 19:10

Chris Smith

So this is a really complicated question that does not really have a great answer of "x".

The CLR is not required to map a single CLR thread to a single OS fiber. So... this is hard to answer. I think the current version of .NET (4.0) attempts to use a 1-to-1 relationship between CLR threads and OS fibers when possible on all OSes. Previous versions of .NET (more like <= 1.1) I'm not sure this was the case on all OSes. The scheduler handles most of the these objects and they won't be part of any .NET object graph. This scheduler is part of the CLR and not part of the Thread object. If you dig into the IL, you'll see many internal calls for actual execution.
I assume the question is "Why are those objects needed?" If so, it's because the OS host has to actually have the fiber to execute the code for that thread on it. ThreadPool usage can greatly reduce this cost of creating them each time.
Sorry... depends. A lot of it unmanaged as well, which means the OS host could choose to handle this differently depending on load and system version.

"The logical abstraction of a thread of control is captured by an instance of the System.Threading.Thread object in the class library." http://www.ecma-international.org/publications/files/ECMA-ST/Ecma-335.pdf

So EMCA standard really doesn't say anything about the topic. But luckily we have...

"Because the CLR thread object is per-fiber, any information hanging off of it is also per-fiber. Thread.ManagedThreadId returns a stable ID that flows around with the CLR thread. It is not dependent on the identity of the physical OS thread, which means using it implies no form of affinity. Different fibers running on the same thread return different IDs. " From Joe Duffy http://www.bluebytesoftware.com/blog/2006/11/10/FibersAndTheCLR.aspx

answered Oct 10 '22 17:10

Travis

Look here; there is a mapping between managed (i.e. CLR) primitives and unmanaged (i.e. NT kernel) ones that may answer most of your questions.

answered Oct 10 '22 18:10

CesarGon

Related questions
                            
                                Shims are not generated for .NET methods
                            
                                Difference between ParameterInfo.DefaultValue and ParameterInfo.RawDefaultValue
                            
                                Difference between ParameterInfo.IsOptional and ParameterInfo.HasDefaultValue?
                            
                                SQLException when Canceling Async Query
                            
                                How do I use SimplSockets with a delegate for a "hello world" project?
                            
                                Exception from HRESULT: 0x8002000B (DISP_E_BADINDEX) for System.Runtime.InteropServices.COMException
                            
                                Exit Code When Unhandled Exception Terminates Execution?
                            
                                System.Web.Globalization namespace introduced with .NET 4.6.2 conflicts at runtime with System.Globalization
                            
                                How to bind a DataGridView to a SQLite Database?
                            
                                What constitutes 'redundant delegate creation'?
                            
                                Populate WinForms TreeView from DataTable
                            
                                How to get Elmah working with ASP.NET and IIS 5.1 URL Routing
                            
                                C#: How to implement IOrderedEnumerable<T>
                            
                                Using Reflection.Emit to create a class implementing an interface
                            
                                How to encode/decode video using C#?
                            
                                Delegates in .NET: how are they constructed?
                            
                                Is there a way to create a "Self-hosted" Web Site in .Net? [closed]
                            
                                WPF Toolkit DataGrid column resize event
                            
                                Speeding up the rate that IIS/.NET/LINQ retrieves data from the Network Buffers
                            
                                EF 4.1, Code-First: Is there an easy way to remove ALL conventions?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With