So, since several days I do a lot of research about multiprocessing and multithreading on python and i'm very confused about many thing. So many times I see someone talking about GIL something that doesn't allow Python code to execute on several cpu cores, but when I code a program who create many threads I can see several cpu cores are active. 1st question: What's is really GIL? does it work? I think about something like when a process create too many thread the OS distributed task on multi cpu. Am I right? Other thing, I want take advantage of my cpus. I think about something like create as much process as cpu core and on this each process create as much thread as cpu core. Am I on the right lane?

To start with, GIL only ensures that only one cpython bytecode instruction will run at any given time. It does not care about which CPU core runs the instruction. That is the job of the OS kernel. So going over your questions: <ol> <li>GIL is just a piece of code. The CPython Virtual machine is the process which first compiles the code to Cpython bytecode but it's normal job is to interpret the CPython bytecode. GIL is a piece of code that ensures a single line of bytecode runs at a time no matter how many threads are running. Cpython Bytecode instructions is what constitutes the virtual machine stack. So in a way, GIL will ensure that only one thread holds the GIL at any given point of time. (also that it keeps releasing the GIL for other threads and not starve them.)</li> </ol> Now coming to your actual confusion. You mention that when you run a program with many threads, you can see multiple (may be all) CPU cores firing up. So I did some experimentation and found that your findings are right (which is obvious) but the behaviour is similar in a non threaded version too. <pre class="prettyprint"><code>def do_nothing(i): time.sleep(0.0001) return i*2 ThreadPool(20).map(do_nothing, range(10000)) </code></pre> <pre class="prettyprint"><code>def do_nothing(i): time.sleep(0.0001) return i*2 [do_nothing(i) for i in range(10000)] </code></pre> The first one in multithreaded and the second one is not. When you compare the CPU usage by by both the programs, you will find that in both the cases multiple CPU cores will fire up. So what you noticed, although right, has not much to do with GIL or threading. CPU usage going high in multiple cores is simply because OS kernel will distribute the execution of code to different cores based on availability. Your last question is more of an experimental thing as different programs have different CPU/io usage. You just have to be aware of the cost of creation of a thread and a process and the working of GIL & PVM and optimize the number of threads and processes to get the maximum perf out. You can go through this talk by David Beazley to understand how multithreading can make your code perform worse (or better).

There are answers about what the Global Interpreter Lock (GIL) is here. Buried among the answers is mention of Python "bytecode", which is central to the issue. When your program is compiled, the output is bytecode, i.e. low-level computer instructions for a fictitious "Python" computer, that gets interpreted by the Python interpreter. When the interpreter is executing a bytecode, it serializes execution by acquiring the Global Interpreter Lock. This means that two threads cannot be executing bytecode concurrently on two different cores. This also means that true multi-threading is not implemented. But does this mean that there is no reason to use threading? No! Here are a couple of situations where threading is still useful: <ol> <li>For certain operations the interpreter will release the GIL, i.e. when doing I/O. So consider as an example the case where you want to fetch a lot of URLs from different websites. Most of the time is spent waiting for a response to be returned once the request is made and this waiting can be overlapped even if formulating the requests has to be done serially.</li> <li>Many Python functions and modules are implemented in the C language and are not limited by any GIL restrictions. The <code>numpy</code> module is one such highly optimized package.</li> </ol> Consequently, threading is best used when the tasks are not cpu-intensive, i.e. they do a lot of waiting for I/O to complete, or they do a lot of sleeping, etc.

Multiprocessing multithreading GIL?

Tags:

python

multithreading

multiprocessing

So, since several days I do a lot of research about multiprocessing and multithreading on python and i'm very confused about many thing. So many times I see someone talking about GIL something that doesn't allow Python code to execute on several cpu cores, but when I code a program who create many threads I can see several cpu cores are active.

1st question: What's is really GIL? does it work? I think about something like when a process create too many thread the OS distributed task on multi cpu. Am I right?

Other thing, I want take advantage of my cpus. I think about something like create as much process as cpu core and on this each process create as much thread as cpu core. Am I on the right lane?

291

asked Aug 17 '20 15:08

Ninja-Flex6969

2 Answers

To start with, GIL only ensures that only one cpython bytecode instruction will run at any given time. It does not care about which CPU core runs the instruction. That is the job of the OS kernel.

So going over your questions:

GIL is just a piece of code. The CPython Virtual machine is the process which first compiles the code to Cpython bytecode but it's normal job is to interpret the CPython bytecode. GIL is a piece of code that ensures a single line of bytecode runs at a time no matter how many threads are running. Cpython Bytecode instructions is what constitutes the virtual machine stack. So in a way, GIL will ensure that only one thread holds the GIL at any given point of time. (also that it keeps releasing the GIL for other threads and not starve them.)

Now coming to your actual confusion. You mention that when you run a program with many threads, you can see multiple (may be all) CPU cores firing up. So I did some experimentation and found that your findings are right (which is obvious) but the behaviour is similar in a non threaded version too.

def do_nothing(i):
    time.sleep(0.0001)
    return i*2

ThreadPool(20).map(do_nothing, range(10000))

def do_nothing(i):
    time.sleep(0.0001)
    return i*2

[do_nothing(i) for i in  range(10000)]

The first one in multithreaded and the second one is not. When you compare the CPU usage by by both the programs, you will find that in both the cases multiple CPU cores will fire up. So what you noticed, although right, has not much to do with GIL or threading. CPU usage going high in multiple cores is simply because OS kernel will distribute the execution of code to different cores based on availability.

Your last question is more of an experimental thing as different programs have different CPU/io usage. You just have to be aware of the cost of creation of a thread and a process and the working of GIL & PVM and optimize the number of threads and processes to get the maximum perf out.

You can go through this talk by David Beazley to understand how multithreading can make your code perform worse (or better).

102

answered Nov 15 '22 03:11

sprksh

There are answers about what the Global Interpreter Lock (GIL) is here. Buried among the answers is mention of Python "bytecode", which is central to the issue. When your program is compiled, the output is bytecode, i.e. low-level computer instructions for a fictitious "Python" computer, that gets interpreted by the Python interpreter. When the interpreter is executing a bytecode, it serializes execution by acquiring the Global Interpreter Lock. This means that two threads cannot be executing bytecode concurrently on two different cores. This also means that true multi-threading is not implemented. But does this mean that there is no reason to use threading? No! Here are a couple of situations where threading is still useful:

For certain operations the interpreter will release the GIL, i.e. when doing I/O. So consider as an example the case where you want to fetch a lot of URLs from different websites. Most of the time is spent waiting for a response to be returned once the request is made and this waiting can be overlapped even if formulating the requests has to be done serially.
Many Python functions and modules are implemented in the C language and are not limited by any GIL restrictions. The numpy module is one such highly optimized package.

Consequently, threading is best used when the tasks are not cpu-intensive, i.e. they do a lot of waiting for I/O to complete, or they do a lot of sleeping, etc.

answered Nov 15 '22 02:11

Booboo

Related questions
                            
                                In what situation is an object not equal to itself?
                            
                                How to solve TypeError: on_delete must be callable on Django models?
                            
                                python based Dockerfile throws locale.Error: unsupported locale setting
                            
                                BERT tokenizer & model download
                            
                                Is there a way for pytest to check if a log entry was made at Error level or higher?
                            
                                What is the difference between numpy.fft.fft and numpy.fft.fftfreq
                            
                                Scraping Google images with Python
                            
                                How to correctly use the Tensorflow MeanIOU metric?
                            
                                ERROR: Could not build wheels for pendulum which use PEP 517 and cannot be installed directly
                            
                                Python: How to replace tqdm progress bar by next one in nested loop?
                            
                                Find the difference between strings for each two rows of pandas data.frame
                            
                                Optimized way for checking if the given number is a Palindrome
                            
                                Split list into N sublists with approximately equal sums
                            
                                Efficient algorithm for getting number of partitions of integer with distinct parts (Partition function Q)
                            
                                Python-poetry error: Setting settings.virtualenvs.in-project does not exist
                            
                                Is there a plugin similar to gitlens for pycharm or other products?
                            
                                How to use python to schedule tasks in a Django application
                            
                                Discord Bot - "Attribute Error: 'NoneType' object has no attribute 'strip.'
                            
                                Import librosa gives "no module named numba.decorators", how to solve?
                            
                                Discord.py - Make a bot react to its own message(s)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With