Before I start describing my question, it might worth mentioning that I'm using Python 2.7. I haven't checked, but this might be irrelevant for Python 3.x. While working with Python's Queues, I've discovered something strange. Usually, when getting an object from the Queue, I allow long but finite timeout (such as a few seconds), to allow debugging and error reporting in case no object was found, when one was expected. What I've found out is that sometimes there's a strange gap between the time when an object was inserted into a previously empty Queue, and the time the <code>get</code> method of the very same Queue has returned that object, even though the method was called before the <code>put</code> was called for that object. Digging a little bit I've discovered that the gap was filled by sleeping. In the <code>Queue</code> module, if the <code>timeout</code> argument that is being passed to the <code>get</code> method is not <code>None</code>, and is positive, the <code>non_empty</code> <code>Condition</code>'s <code>wait</code> method is called with a positive argument (that is not 100% precise; in fact, the <code>Queue</code>'s "<code>_qsize</code>" method, which returns the length of the underlying <code>deque</code> is first verified to return 0, but as long as the queue was empty in the first place, the next thing is the condition's wait). The <code>Conditions</code>'s <code>wait</code> method acts differently if it gets a timeout or not. If it does not get any timeout, it simply calls <code>waiter.acquire</code>. This is defined in <code>C</code> and is beyond what I understand, but it seems like it works properly. However, if timeout is given, a bizarre sequence of sleeps occur instead, when the sleep times start at some arbitrary size (1 milisecond), and gets longer over time. Here's the exact code which runs: <pre class="prettyprint"><code># Balancing act: We can't afford a pure busy loop, so we # have to sleep; but if we sleep the whole timeout time, # we'll be unresponsive. The scheme here sleeps very # little at first, longer as time goes on, but never longer # than 20 times per second (or the timeout time remaining). endtime = _time() + timeout delay = 0.0005 # 500 us -> initial delay of 1 ms while True: gotit = waiter.acquire(0) if gotit: break remaining = endtime - _time() if remaining <= 0: break delay = min(delay * 2, remaining, .05) _sleep(delay) </code></pre> This is clearly the reason for the gap I've found between the time the new object was put into the previously-empty Queue, and the time that the already-called get method has returned that object. As the delay time grows exponentially until blocked by a huge (from my perspective) size of 0.05 seconds, it creates surprising and unwanted significant sleeps in my application's life. Can you explain what's the purpose of this? Are Python developers assume no Python user will care about such time lengths? Is there a quick workaround or a proper fix? Do you recommend me to overload the threading module?

I recently got hit by the same problem, and I also tracked it down to this exact block of code in the <code>threading</code> module. It sucks. <hr> <blockquote> Can you explain what's the purpose of this? Are Python developers assume no Python user will care about such time lengths? </blockquote> Beats me... <hr> <blockquote> Do you recommend me to overload the threading module? </blockquote> Either overload the threading module, or migrate to <code>python3</code>, where this part of the implementation has been fixed. In my case, migrating to python3 would have been a huge effort, so I chose the former. What I did was: <ol> <li>I created a quick <code>.so</code> file (using <code>cython</code>) with an interface to <code>pthread</code>. It includes python functions which invoke the corresponding <code>pthread_mutex_*</code> functions, and links against <code>libpthread</code>. Specifically, the function most relevant to the task we're interested in is pthread_mutex_timedlock.</li> <li>I created a new <code>threading2</code> module, (and replaced all <code>import threading</code> lines in my codebase with <code>import threading2</code>). In <code>threading2</code>, I re-defined all the relevant classes from <code>threading</code> (<code>Lock</code>, <code>Condition</code>, <code>Event</code>), and also ones from <code>Queue</code> which I use a lot (<code>Queue</code> and <code>PriorityQueue</code>). The <code>Lock</code> class was completely re-implemented using <code>pthread_mutex_*</code> functions, but the rest were much easier -- I simply subclassed the original (e.g. <code>threading.Event</code>), and overridden <code>__init__</code> to create my new <code>Lock</code> type. The rest just worked.</li> </ol> The implementation of the new <code>Lock</code> type was very similar to the original implementation in <code>threading</code>, but I based the new implemenation of <code>acquire</code> on the code I found in <code>python3</code>'s <code>threading</code> module (which, naturally, is much simpler than the abovementioned "balancing act" block). This part was fairly easy. (Btw, the result in my case was 30% speedup of my massively-multithreaded process. Even more than I expected.) I hope this helps.

Arbitrary sleeping in threading's wait with timeout

Tags:

python

sleep

python-multithreading

python-2.7

Before I start describing my question, it might worth mentioning that I'm using Python 2.7. I haven't checked, but this might be irrelevant for Python 3.x.

While working with Python's Queues, I've discovered something strange. Usually, when getting an object from the Queue, I allow long but finite timeout (such as a few seconds), to allow debugging and error reporting in case no object was found, when one was expected. What I've found out is that sometimes there's a strange gap between the time when an object was inserted into a previously empty Queue, and the time the get method of the very same Queue has returned that object, even though the method was called before the put was called for that object.

Digging a little bit I've discovered that the gap was filled by sleeping. In the Queue module, if the timeout argument that is being passed to the get method is not None, and is positive, the non_empty Condition's wait method is called with a positive argument (that is not 100% precise; in fact, the Queue's "_qsize" method, which returns the length of the underlying deque is first verified to return 0, but as long as the queue was empty in the first place, the next thing is the condition's wait).

The Conditions's wait method acts differently if it gets a timeout or not. If it does not get any timeout, it simply calls waiter.acquire. This is defined in C and is beyond what I understand, but it seems like it works properly. However, if timeout is given, a bizarre sequence of sleeps occur instead, when the sleep times start at some arbitrary size (1 milisecond), and gets longer over time. Here's the exact code which runs:

# Balancing act:  We can't afford a pure busy loop, so we
# have to sleep; but if we sleep the whole timeout time,
# we'll be unresponsive.  The scheme here sleeps very
# little at first, longer as time goes on, but never longer
# than 20 times per second (or the timeout time remaining).
endtime = _time() + timeout
delay = 0.0005 # 500 us -> initial delay of 1 ms
while True:
    gotit = waiter.acquire(0)
    if gotit:
        break
    remaining = endtime - _time()
    if remaining <= 0:
        break
    delay = min(delay * 2, remaining, .05)
    _sleep(delay)

This is clearly the reason for the gap I've found between the time the new object was put into the previously-empty Queue, and the time that the already-called get method has returned that object. As the delay time grows exponentially until blocked by a huge (from my perspective) size of 0.05 seconds, it creates surprising and unwanted significant sleeps in my application's life.

Can you explain what's the purpose of this? Are Python developers assume no Python user will care about such time lengths? Is there a quick workaround or a proper fix? Do you recommend me to overload the threading module?

696

asked Mar 03 '14 12:03

Bach

1 Answers

I recently got hit by the same problem, and I also tracked it down to this exact block of code in the threading module.

It sucks.

Can you explain what's the purpose of this? Are Python developers assume no Python user will care about such time lengths?

Beats me...

Do you recommend me to overload the threading module?

Either overload the threading module, or migrate to python3, where this part of the implementation has been fixed.

In my case, migrating to python3 would have been a huge effort, so I chose the former. What I did was:

I created a quick .so file (using cython) with an interface to pthread. It includes python functions which invoke the corresponding pthread_mutex_* functions, and links against libpthread. Specifically, the function most relevant to the task we're interested in is pthread_mutex_timedlock.
I created a new threading2 module, (and replaced all import threading lines in my codebase with import threading2). In threading2, I re-defined all the relevant classes from threading (Lock, Condition, Event), and also ones from Queue which I use a lot (Queue and PriorityQueue). The Lock class was completely re-implemented using pthread_mutex_* functions, but the rest were much easier -- I simply subclassed the original (e.g. threading.Event), and overridden __init__ to create my new Lock type. The rest just worked.

The implementation of the new Lock type was very similar to the original implementation in threading, but I based the new implemenation of acquire on the code I found in python3's threading module (which, naturally, is much simpler than the abovementioned "balancing act" block). This part was fairly easy.

(Btw, the result in my case was 30% speedup of my massively-multithreaded process. Even more than I expected.)

I hope this helps.

148

answered Sep 27 '22 18:09

shx2

Related questions
                            
                                Namespace packages with a core part?
                            
                                Sqlalchemy complains that foreign key doesn't exist but actually it exists
                            
                                Very fast algorithm for all paths between two nodes
                            
                                Multiple permissions in view_config decorator?
                            
                                How to correctly shut down Python RQ worker processes dynamically?
                            
                                Implementing horizon charts in matplotlib
                            
                                Reversing from module import *
                            
                                Python3: Looking for alternatives to gevent and pylibmc/python-memcached
                            
                                Python subprocess - write multiple stdin
                            
                                replacing pronoun with its antecedent using python2.7 and nltk
                            
                                How to integrate Python scripting in my Android App (like SL4A)
                            
                                Project organization with Cython and C++
                            
                                Discrete optimization in python
                            
                                libxml2.2.dylib reference in python program
                            
                                Fade Between Two Music Tracks in-progress in Pygame
                            
                                Decorator to log function execution line by line
                            
                                Python live coding/debugging
                            
                                How to use Python kazoo library?
                            
                                How to (can I) ask a PIPE how many bytes it has available for reading?
                            
                                What is parameter name in PyArg_UnpackTuple (python c api) for?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With