Control scheduling priority of asyncio coroutines possible?

Tags:

python-asyncio

Is there any way to control the scheduling priority among all coroutines that are ready to run?

Specifically, I have several coroutines handling streaming I/O from the network into several queues, a second set of coroutines that ingest the data from the queues into a data structure. These ingestion coroutines signal a third set of coroutines that analyze that data structure whenever new data is ingested.

Data arrival from the network is an infinite stream with a non-deterministic message rate. I want the analysis step to run as soon as new data arrives, but not before all pending data is processed. The problem I see is that depending on the order of scheduling, an analysis coroutine could run before a reader coroutine that also had data ready, so the analysis coroutine can't even check the ingestion queues for pending data because it may not have been read off the network yet, even though those reader coroutines were ready to run.

One solution might be to structure the coroutines into priority groups so that the reader coroutines would always be scheduled before the analysis coroutines if they were both able to run, but I didn't see a way to do this.

Is there a feature of asyncio that can accomplish this prioritization? Or perhaps I'm asking the wrong question and I can restructure the coroutines such that this can't happen (but I don't see it).

-- edit --

Basically I have a N coroutines that look something like this:

while True:
  data = await socket.get()
  ingestData(data)
  self.event.notify()

So the problem I'm running into is that there's no way for me to know that any of the other N-1 sockets have data ready while executing this coroutine so I can't know whether or not I should notify the event. If I could prioritize these coroutines above the analysis coroutine (which is awaiting self.event.wait()) then I could be sure none of them were runnable when the analysis coroutine is scheduled.

478

asked Jan 20 '18 21:01

djmarcin

1 Answers

asyncio doesn't support explicitly specifying coroutine priorities, but it is straightforward to achieve the same effect them with the tools provided by the library. Given the example in your question:

async def process_pending():
    while True:
    data = await socket.get()
        ingestData(data)
        self.event.notify()

You could await the sockets directly using asyncio.wait, and then you would know which sockets are actionable, and only notify the analyzers after all have been processed. For example:

def _read_task(self, socket):
    loop = asyncio.get_event_loop()
    task = loop.create_task(socket.get())
    task.__process_socket = socket
    return task

async def process_pending_all(self):
    tasks = {self._read_task(socket) for socket in self.sockets}
    while True:
        done, not_done = await asyncio.wait(
            tasks, return_when=asyncio.FIRST_COMPLETED)
        for task in done:
            ingestData(task.result())
            not_done.add(self._read_task(task.__process_socket))
        tasks = not_done
        self.event.notify()

133

answered Sep 28 '22 01:09

user4815162342

Related questions
                            
                                aiohttp.client_exceptions.ClientConnectorError: Cannot connect to host stackoverflow.com:443 ssl:default [Connect call failed ('151.101.193.69', 443)]
                            
                                Python asyncio.semaphore in async-await function
                            
                                Python async: Waiting for stdin input while doing other stuff
                            
                                Sharing python objects across multiple workers
                            
                                Wait for timeout or event being set for asyncio.Event
                            
                                How to check a SSL certificate expiration date with aiohttp?
                            
                                Why am I getting different results when using a list comprehension with coroutines with asyncio?
                            
                                how can I asynchronously map/filter an asynchronous iterable?
                            
                                When to use multiple event loops?
                            
                                Python async AttributeError aexit
                            
                                Python async/await downloading a list of urls
                            
                                How to schedule a task in asyncio so it runs at a certain date?
                            
                                Converting a Python function with a callback to an asyncio awaitable
                            
                                partial asynchronous functions are not detected as asynchronous
                            
                                How to measure Python's asyncio code performance?
                            
                                python3 -Get result from async method
                            
                                Why am I getting NotImplementedError with async and await on Windows?
                            
                                Monitoring the asyncio event loop
                            
                                RuntimeError: Timeout context manager should be used inside a task
                            
                                Python asyncio: event loop does not seem to stop when stop method is called

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With