Why the Global Interpreter Lock?

2 Answers

In general, for any thread safety problem you will need to protect your internal data structures with locks. This can be done with various levels of granularity.

You can use fine-grained locking, where every separate structure has its own lock.
You can use coarse-grained locking where one lock protects everything (the GIL approach).

There are various pros and cons of each method. Fine-grained locking allows greater parallelism - two threads can execute in parallel when they don't share any resources. However there is a much larger administrative overhead. For every line of code, you may need to acquire and release several locks.

The coarse grained approach is the opposite. Two threads can't run at the same time, but an individual thread will run faster because its not doing so much bookkeeping. Ultimately it comes down to a tradeoff between single-threaded speed and parallelism.

There have been a few attempts to remove the GIL in python, but the extra overhead for single threaded machines was generally too large. Some cases can actually be slower even on multi-processor machines due to lock contention.

Do other languages that are compiled to bytecode employ a similar mechanism?

It varies, and it probably shouldn't be considered a language property so much as an implementation property. For instance, there are Python implementations such as Jython and IronPython which use the threading approach of their underlying VM, rather than a GIL approach. Additionally, the next version of Ruby looks to be moving towards introducing a GIL.

106

answered Oct 09 '22 08:10

Brian

The following is from the official Python/C API Reference Manual:

The Python interpreter is not fully thread safe. In order to support multi-threaded Python programs, there's a global lock that must be held by the current thread before it can safely access Python objects. Without the lock, even the simplest operations could cause problems in a multi-threaded program: for example, when two threads simultaneously increment the reference count of the same object, the reference count could end up being incremented only once instead of twice.

Therefore, the rule exists that only the thread that has acquired the global interpreter lock may operate on Python objects or call Python/C API functions. In order to support multi-threaded Python programs, the interpreter regularly releases and reacquires the lock -- by default, every 100 bytecode instructions (this can be changed with sys.setcheckinterval()). The lock is also released and reacquired around potentially blocking I/O operations like reading or writing a file, so that other threads can run while the thread that requests the I/O is waiting for the I/O operation to complete.

I think it sums up the issue pretty well.

answered Oct 09 '22 10:10

Eli Bendersky

Related questions
                            
                                Python: defining my own operators?
                            
                                Convert seconds to hh:mm:ss in Python [duplicate]
                            
                                SSL backend error when using OpenSSL
                            
                                Why does Python start at index -1 (as opposed to 0) when indexing a list from the end? [duplicate]
                            
                                What if I don't close the database connection in Python SQLite
                            
                                unittest Vs pytest
                            
                                equivalent of a python dict in R
                            
                                Understanding celery task prefetching
                            
                                Using mock patch to mock an instance method
                            
                                Can I run a Google Colab (free edition) script and then shutdown my computer?
                            
                                Fast prime factorization module
                            
                                Virtualenv and source version control
                            
                                gunicorn.errors.HaltServer: <HaltServer 'Worker failed to boot.' 3> django
                            
                                Why is copying a shuffled list much slower?
                            
                                Return and yield in the same function
                            
                                Difference between a -= b and a = a - b in Python
                            
                                Why do 3 backslashes equal 4 in a Python string?
                            
                                Why is the command bound to a Button or event executed when declared?
                            
                                Hashable, immutable
                            
                                How to integrate pep8.py in Eclipse?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why the Global Interpreter Lock?

Tags:

python

scripting

multithreading

bytecode

locking

Federico A. Ramponi

People also ask

2 Answers

Brian

Eli Bendersky

Recent Activity

Donate For Us