Why is `gevent.spawn` different than a monkeypatched `threading.Thread()`?

Tags:

While double checking that threading.Condition is correctly monkey patched, I noticed that a monkeypatched threading.Thread(…).start() behaves differently from gevent.spawn(…).

Consider:

from gevent import monkey; monkey.patch_all()
from threading import Thread, Condition
import gevent

cv = Condition()

def wait_on_cv(x):
    cv.acquire()
    cv.wait()
    print "Here:", x
    cv.release()

# XXX: This code yields "This operation would block forever" when joining the first thread
threads = [ gevent.spawn(wait_on_cv, x) for x in range(10) ]

"""
# XXX: This code, which seems semantically similar, works correctly
threads = [ Thread(target=wait_on_cv, args=(x, )) for x in range(10) ]
for t in threads:
    t.start()
"""

cv.acquire()
cv.notify_all()
print "Notified!"
cv.release()

for x, thread in enumerate(threads):
    print "Joining", x
    thread.join()

Note, specifically, the two comments starting with XXX.

When using the first line (with gevent.spawn), the first thread.join() raises an exception:

Notified!
Joining 0
Traceback (most recent call last):
  File "foo.py", line 30, in 
    thread.join()
  File "…/gevent/greenlet.py", line 291, in join
    result = self.parent.switch()
  File "…/gevent/hub.py", line 381, in switch
    return greenlet.switch(self)
gevent.hub.LoopExit: This operation would block forever

However, Thread(…).start() (the second block), everything works as expected.

Why would this be? What's the difference between gevent.spawn() and Thread(…).start()?

235

asked Oct 23 '12 22:10

David Wolever

1 Answers

What happen in your code is that the greenlets that you have created in you threads list didn't have yet the chance to be executed because gevent will not trigger a context switch until you do so explicitly in your code using gevent.sleep() and such or implicitly by calling a function that block e.g. semaphore.wait() or by yielding and so on ..., to see that you can insert a print before cv.wait() and see that it's called only after cv.notify_all() is called:

def wait_on_cv(x):
    cv.acquire()
    print 'acquired ', x
    cv.wait()
    ....

So an easy fix to your code will be to insert something that will trigger a context switch after you create your list of greenlets, example:

...
threads = [ gevent.spawn(wait_on_cv, x) for x in range(10) ]
gevent.sleep()  # Trigger a context switch
...

Note: I am still new to gevent so i don't know if this is the right way to do it :)

This way all the greenlets will have the chance to be executed and each one of them will trigger a context switch when they call cv.wait() and in the mean time they will register them self to the condition waiters so that when cv.notify_all() is called it will notify all the greenlets.

HTH,

174

answered Sep 20 '22 04:09

mouad

Related questions
                            
                                What is the urls.py regex evaluation order in django?
                            
                                django-social-auth : How to redirect example.com to 127.0.0.1:8000?
                            
                                How to install pyodbc 64-bit?
                            
                                why doesn't EVERYTHING default to UTF-8? [closed]
                            
                                Compute outer product of arrays with arbitrary dimensions
                            
                                Adding my own description attribute to a Pandas DataFrame
                            
                                Edit table in pyqt using QAbstractTableModel
                            
                                Python print unicode doesn't show correct symbols
                            
                                Perl Inline::Python module, how to put code into a string
                            
                                Python: code.interact(local=locals()) where stdin/stdout are not available
                            
                                Numpy distutils howto
                            
                                Flask: passing around background worker job (rq, redis)
                            
                                Prevent / alter access to class variables [duplicate]
                            
                                How to list directory using Python [duplicate]
                            
                                What does a Python decorator do, and where is its code? [duplicate]
                            
                                Why it is compulsory to give classname while using super() in Python [duplicate]
                            
                                Python: OS Independent list of available storage devices
                            
                                Matrix factorization for collaborative filtering - new users and items?
                            
                                Get window position and size in python with Xlib
                            
                                Clarification: Does Heroku Run Python Apps Behind Nginx or Not?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is `gevent.spawn` different than a monkeypatched `threading.Thread()`?

Tags:

python

multithreading

gevent

David Wolever

People also ask

1 Answers

mouad

Recent Activity

Donate For Us