I'm embedding the python interpreter in a multithreaded C application and I'm a little confused as to what APIs I should use to ensure thread safety. From what I gathered, when embedding python it is up to the embedder to take care of the GIL lock before calling any other Python C API call. This is done with these functions: <pre class="prettyprint"><code>gstate = PyGILState_Ensure(); // do some python api calls, run python scripts PyGILState_Release(gstate); </code></pre> But this alone doesn't seem to be enough. I still got random crashes since it doesn't seem to provide mutual exclusion for the Python APIs. After reading some more docs I also added: <pre class="prettyprint"><code>PyEval_InitThreads(); </code></pre> right after the call to <code>Py_IsInitialized()</code> but that's where the confusing part comes. The docs state that this function: <blockquote> Initialize and acquire the global interpreter lock </blockquote> This suggests that when this function returns, the GIL is supposed to be locked and should be unlocked somehow. but in practice this doesn't seem to be required. With this line in place my multithreaded worked perfectly and mutual exclusion was maintained by the <code>PyGILState_Ensure/Release</code> functions. When I tried adding <code>PyEval_ReleaseLock()</code> after <code>PyEval_ReleaseLock()</code> the app dead-locked pretty quickly in a subsequent call to <code>PyImport_ExecCodeModule()</code>. So what am I missing here?

I had exactly the same problem and it is now solved by using <code>PyEval_SaveThread()</code> immediately after <code>PyEval_InitThreads()</code>, as you suggest above. However, my actual problem was that I used <code>PyEval_InitThreads()</code> after <code>PyInitialise()</code> which then caused <code>PyGILState_Ensure()</code> to block when called from different, subsequent native threads. In summary, this is what I do now: <ol> <li> There is global variable: <pre class="prettyprint"><code>static int gil_init = 0; </code></pre> </li> <li> From a main thread load the native C extension and start the Python interpreter: <pre class="prettyprint"><code>Py_Initialize() </code></pre> </li> <li> From multiple other threads my app concurrently makes a lot of calls into the Python/C API: <pre class="prettyprint"><code>if (!gil_init) { gil_init = 1; PyEval_InitThreads(); PyEval_SaveThread(); } state = PyGILState_Ensure(); // Call Python/C API functions... PyGILState_Release(state); </code></pre> </li> <li> From the main thread stop the Python interpreter <pre class="prettyprint"><code>Py_Finalize() </code></pre> </li> </ol> All other solutions I've tried either caused random Python sigfaults or deadlock/blocking using <code>PyGILState_Ensure()</code>. The Python documentation really should be more clear on this and at least provide an example for both the embedding and extension use cases.

Eventually I figured it out. After <pre class="prettyprint"><code>PyEval_InitThreads(); </code></pre> You need to call <pre class="prettyprint"><code>PyEval_SaveThread(); </code></pre> While properly release the GIL for the main thread.

Embedding python in multithreaded C application

Tags:

python

c

multithreading

gil

python-embedding

I'm embedding the python interpreter in a multithreaded C application and I'm a little confused as to what APIs I should use to ensure thread safety.

From what I gathered, when embedding python it is up to the embedder to take care of the GIL lock before calling any other Python C API call. This is done with these functions:

gstate = PyGILState_Ensure();
// do some python api calls, run python scripts
PyGILState_Release(gstate);

But this alone doesn't seem to be enough. I still got random crashes since it doesn't seem to provide mutual exclusion for the Python APIs.

After reading some more docs I also added:

PyEval_InitThreads();

right after the call to Py_IsInitialized() but that's where the confusing part comes. The docs state that this function:

Initialize and acquire the global interpreter lock

This suggests that when this function returns, the GIL is supposed to be locked and should be unlocked somehow. but in practice this doesn't seem to be required. With this line in place my multithreaded worked perfectly and mutual exclusion was maintained by the PyGILState_Ensure/Release functions.
When I tried adding PyEval_ReleaseLock() after PyEval_ReleaseLock() the app dead-locked pretty quickly in a subsequent call to PyImport_ExecCodeModule().

So what am I missing here?

257

asked May 16 '12 19:05

shoosh

Video Answer

2 Answers

I had exactly the same problem and it is now solved by using PyEval_SaveThread() immediately after PyEval_InitThreads(), as you suggest above. However, my actual problem was that I used PyEval_InitThreads() after PyInitialise() which then caused PyGILState_Ensure() to block when called from different, subsequent native threads. In summary, this is what I do now:

There is global variable:
```
static int gil_init = 0; 
```
From a main thread load the native C extension and start the Python interpreter:
```
Py_Initialize() 
```

From multiple other threads my app concurrently makes a lot of calls into the Python/C API:

if (!gil_init) {
    gil_init = 1;
    PyEval_InitThreads();
    PyEval_SaveThread();
}
state = PyGILState_Ensure();
// Call Python/C API functions...    
PyGILState_Release(state);

From the main thread stop the Python interpreter
```
Py_Finalize()
```

All other solutions I've tried either caused random Python sigfaults or deadlock/blocking using PyGILState_Ensure().

The Python documentation really should be more clear on this and at least provide an example for both the embedding and extension use cases.

answered Oct 29 '22 09:10

forman

Eventually I figured it out.
After

PyEval_InitThreads();

You need to call

PyEval_SaveThread();

While properly release the GIL for the main thread.

answered Oct 29 '22 08:10

shoosh

Related questions
                            
                                Is it possible to use *args in a dataclass?
                            
                                Is there a way to kill uvicorn cleanly?
                            
                                Extract target from Tensorflow PrefetchDataset
                            
                                How to deal with warning : "Workbook contains no default style, apply openpyxl's default "
                            
                                python: xml.etree.elementtree.ElemenTtree.write() declaration tag
                            
                                Python multiprocessing logging - why multiprocessing.get_logger
                            
                                How to share a cache between multiple processes?
                            
                                Why does pytest + xdist not capture output?
                            
                                Django and Dropzone.js
                            
                                Debugging Python and C++ exposed by boost together
                            
                                javascript dependencies in python project
                            
                                Database indexes in Django 1.11: difference between db_true, indexes and index_together
                            
                                Flask-SQLAlchemy backref function and backref parameter
                            
                                Python - curly braces in type hints
                            
                                AttributeError: module 'tensorflow' has no attribute 'python'
                            
                                Why does pandas merge on NaN?
                            
                                How to implement indentation based code folding in QScintilla?
                            
                                Implementation of kleptography in Python (SETUP attack)
                            
                                Maximum flow - Ford-Fulkerson: Undirected graph
                            
                                Python Pulp using with Matrices

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Embedding python in multithreaded C application

Tags:

python

c

multithreading

gil

python-embedding

shoosh

People also ask

Video Answer

2 Answers

forman

shoosh

Recent Activity

Donate For Us