Multiple async requests simultaneously

Tags:

I'm trying to call ~ 300 API calls at the same time, so that I would get the results in a couple of seconds max.

My pseudo-code looks like this:

def function_1():
    colors = ['yellow', 'green', 'blue', + ~300 other ones]
    loop = asyncio.new_event_loop()
    asyncio.set_event_loop(loop)
    res = loop.run_until_complete(get_color_info(colors))

async def get_color_info(colors):
    loop = asyncio.get_event_loop()
    responses = []
    for color in colors:
        print("getting color")
        url = "https://api.com/{}/".format(color)
        data = loop.run_in_executor(None, requests.get, url)
        r = await data
        responses.append(r.json())
    return responses

Doing this I get getting color printed out every second or so and the code takes forever, so I'm pretty sure they don't run simultaneously. What am I doing wrong?

201

asked Oct 27 '18 11:10

Costantin

1 Answers

`aiohttp` with Native Coroutines (`async`/`await`)

Here is a typical pattern that accomplishes what you're trying to do. (Python 3.7+.)

One major change is that you will need to move from requests, which is built for synchronous IO, to a package such as aiohttp that is built specifically to work with async/await (native coroutines):

import asyncio
import aiohttp  # pip install aiohttp aiodns


async def get(
    session: aiohttp.ClientSession,
    color: str,
    **kwargs
) -> dict:
    url = f"https://api.com/{color}/"
    print(f"Requesting {url}")
    resp = await session.request('GET', url=url, **kwargs)
    # Note that this may raise an exception for non-2xx responses
    # You can either handle that here, or pass the exception through
    data = await resp.json()
    print(f"Received data for {url}")
    return data


async def main(colors, **kwargs):
    # Asynchronous context manager.  Prefer this rather
    # than using a different session for each GET request
    async with aiohttp.ClientSession() as session:
        tasks = []
        for c in colors:
            tasks.append(get(session=session, color=c, **kwargs))
        # asyncio.gather() will wait on the entire task set to be
        # completed.  If you want to process results greedily as they come in,
        # loop over asyncio.as_completed()
        htmls = await asyncio.gather(*tasks, return_exceptions=True)
        return htmls


if __name__ == '__main__':
    colors = ['red', 'blue', 'green']  # ...
    # Either take colors from stdin or make some default here
    asyncio.run(main(colors))  # Python 3.7+

There are two distinct elements to this, one being the asynchronous aspect of the coroutines and one being the concurrency introduced on top of that when you specify a container of tasks (futures):

You create one coroutine get that uses await with two awaitables: the first being .request and the second being .json. This is the async aspect. The purpose of awaiting these IO-bound responses is to tell the event loop that other get() calls can take turns running through that same routine.
The concurrent aspect is encapsulated in await asyncio.gather(*tasks). This maps the awaitable get() call to each of your colors. The result is an aggregate list of returned values. Note that this wrapper will wait until all of your responses come in and call .json(). If, alternatively, you want to process them greedily as they are ready, you can loop over asyncio.as_completed: each Future object returned represents the earliest result from the set of the remaining awaitables.

Lastly, take note that asyncio.run() is a high-level "porcelain" function introduced in Python 3.7. In earlier versions, you can mimic it (roughly) like:

# The "full" versions makes a new event loop and calls
# loop.shutdown_asyncgens(), see link above
loop = asyncio.get_event_loop()
try:
    loop.run_until_complete(main(colors))
finally:
    loop.close()

Limiting Requests

There are a number of ways to limit the rate of concurrency. For instance, see asyncio.semaphore in async-await function or large numbers of tasks with limited concurrency.

134

answered Oct 06 '22 01:10

Brad Solomon

Related questions
                            
                                Python list.remove() skips next element in list
                            
                                Does the `shell` in `shell=True` in subprocess means `bash`?
                            
                                Django -- Conditional Login Redirect
                            
                                Increase all of a lists values by an increment [duplicate]
                            
                                permanently remove directory from python path
                            
                                Error using cv2.equalizeHist
                            
                                Search for a value in a nested dictionary python
                            
                                How to make a list from a raw_input in python? [duplicate]
                            
                                How do I remove verbs, prepositions, conjunctions etc from my text? [closed]
                            
                                Sqlite - Use backticks (`) or double quotes (") with python
                            
                                Python argparse value range help message appearance
                            
                                How to create categorical variable based on a numerical variable
                            
                                What is the easiest way to detect key presses in python 3 on a linux machine?
                            
                                Mongoengine is very slow on large documents compared to native pymongo usage
                            
                                Running a python script in virtual environment with node.js pm2
                            
                                scikit-learn cross validation custom splits for time series data
                            
                                Not able to get_item from AWS dynamodb using python?
                            
                                Change data type of a specific column of a pandas dataframe
                            
                                AttributeError: 'collections.OrderedDict' object has no attribute 'eval'
                            
                                Confusion matrix on images in CNN keras

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multiple async requests simultaneously

Tags:

python

python-3.x

python-requests

python-asyncio

Costantin

People also ask

1 Answers

`aiohttp` with Native Coroutines (`async`/`await`)

Limiting Requests

Brad Solomon

Recent Activity

Donate For Us

Multiple async requests simultaneously

Tags:

python

python-3.x

python-requests

python-asyncio

Costantin

People also ask

1 Answers

aiohttp with Native Coroutines (async/await)

Limiting Requests

Brad Solomon

Related questions

Recent Activity

Donate For Us

`aiohttp` with Native Coroutines (`async`/`await`)