I'm having trouble understanding the use of asynchronous comprehensions introduced in Python 3.6. As a disclaimer, I don't have a lot of experience dealing with asynchronous code in general in Python. The example given in the what's new for Python 3.6 document is: <pre class="prettyprint"><code>result = [i async for i in aiter() if i % 2] </code></pre> In the PEP, this is expanded to: <pre class="prettyprint"><code>result = [] async for i in aiter(): if i % 2: result.append(i) </code></pre> I think I understand that the <code>aiter()</code> function gets called asynchronously, so that each iteration of <code>aiter</code> can proceed without the previous one necessarily returning yet (or is this understanding wrong?). What I'm not sure about is how that then translates to the list comprehension here. Do results get placed into the list in the order that they are returned? Or are there effective 'placeholders' in the final list so that each result is placed in the list in the right order? Or am I thinking about this the wrong way? Additionally, is someone able to provide a real-world example that would illustrate both an applicable use case and the basic mechanics of <code>async</code> in comprehensions like this?

You are basically asking how an <code>async for</code> loop works over a regular loop. That you can now use such a loop in a list comprehension doesn't make any difference here; that's just an optimisation that avoids repeated <code>list.append()</code> calls, exactly like a normal list comprehension does. An <code>async for</code> loop then, simply awaits each next step of the iteration protocol, where a regular <code>for</code> loop would block. To illustrate, imagine a normal <code>for</code> loop: <pre class="prettyprint"><code>for foo in bar: ... </code></pre> For this loop, Python essentially does this: <pre class="prettyprint"><code>bar_iter = iter(bar) while True: try: foo = next(bar_iter) except StopIteration: break ... </code></pre> The <code>next(bar_iter)</code> call is not asynchronous; it blocks. Now replace <code>for</code> with <code>async for</code>, and what Python does changes to: <pre class="prettyprint"><code>bar_iter = aiter(bar) # aiter doesn't exist, but see below while True: try: foo = await anext(bar_iter) # anext doesn't exist, but see below except StopIteration: break ... </code></pre> In the above example <code>aiter()</code> and <code>anext()</code> are fictional functions; these are functionally exact equivalents of their <code>iter()</code> and <code>next()</code> brethren but instead of <code>__iter__</code> and <code>__next__</code> these use <code>__aiter__</code> and <code>__anext__</code>. That is to say, asynchronous hooks exist for the same functionality but are distinguished from their non-async variants by the prefix <code>a</code>. The <code>await</code> keyword there is the crucial difference, so for each iteration an <code>async for</code> loop yields control so other coroutines can run instead. Again, to re-iterate, all this already was added in Python 3.5 (see PEP 492), all that is new in Python 3.6 is that you can use such a loop in a list comprehension too. And in generator expressions and set and dict comprehensions, for that matter. Last but not least, the same set of changes also made it possible to use <code>await <expression></code> in the expression section of a comprehension, so: <pre class="prettyprint"><code>[await func(i) for i in someiterable] </code></pre> is now possible.

<blockquote> I think I understand that the <code>aiter()</code> function gets called asynchronously, so that each iteration of <code>aiter</code> can proceed without the previous one necessarily returning yet (or is this understanding wrong?). </blockquote> That understanding is wrong. Iterations of an <code>async for</code> loop cannot be performed in parallel. <code>async for</code> is just as sequential as a regular <code>for</code> loop. The asynchronous part of <code>async for</code> is that it lets the iterator <code>await</code> on behalf of the coroutine iterating over it. It's only for use within asynchronous coroutines, and only for use on special asynchronous iterables. Other than that, it's mostly just like a regular <code>for</code> loop.

Python Asynchronous Comprehensions - how do they work?

Tags:

python

asynchronous

list-comprehension

python-3.6

I'm having trouble understanding the use of asynchronous comprehensions introduced in Python 3.6. As a disclaimer, I don't have a lot of experience dealing with asynchronous code in general in Python.

The example given in the what's new for Python 3.6 document is:

result = [i async for i in aiter() if i % 2]

In the PEP, this is expanded to:

result = [] async for i in aiter():     if i % 2:         result.append(i)

I think I understand that the aiter() function gets called asynchronously, so that each iteration of aiter can proceed without the previous one necessarily returning yet (or is this understanding wrong?).

What I'm not sure about is how that then translates to the list comprehension here. Do results get placed into the list in the order that they are returned? Or are there effective 'placeholders' in the final list so that each result is placed in the list in the right order? Or am I thinking about this the wrong way?

Additionally, is someone able to provide a real-world example that would illustrate both an applicable use case and the basic mechanics of async in comprehensions like this?

621

asked Feb 20 '17 02:02

Andrew Guy

2 Answers

You are basically asking how an async for loop works over a regular loop. That you can now use such a loop in a list comprehension doesn't make any difference here; that's just an optimisation that avoids repeated list.append() calls, exactly like a normal list comprehension does.

An async for loop then, simply awaits each next step of the iteration protocol, where a regular for loop would block.

To illustrate, imagine a normal for loop:

for foo in bar:     ...

For this loop, Python essentially does this:

bar_iter = iter(bar) while True:     try:         foo = next(bar_iter)     except StopIteration:         break     ...

The next(bar_iter) call is not asynchronous; it blocks.

Now replace for with async for, and what Python does changes to:

bar_iter = aiter(bar)  # aiter doesn't exist, but see below while True:     try:         foo = await anext(bar_iter)  # anext doesn't exist, but see below     except StopIteration:         break     ...

In the above example aiter() and anext() are fictional functions; these are functionally exact equivalents of their iter() and next() brethren but instead of __iter__ and __next__ these use __aiter__ and __anext__. That is to say, asynchronous hooks exist for the same functionality but are distinguished from their non-async variants by the prefix a.

The await keyword there is the crucial difference, so for each iteration an async for loop yields control so other coroutines can run instead.

Again, to re-iterate, all this already was added in Python 3.5 (see PEP 492), all that is new in Python 3.6 is that you can use such a loop in a list comprehension too. And in generator expressions and set and dict comprehensions, for that matter.

Last but not least, the same set of changes also made it possible to use await <expression> in the expression section of a comprehension, so:

[await func(i) for i in someiterable]

is now possible.

138

answered Sep 20 '22 12:09

Martijn Pieters

I think I understand that the aiter() function gets called asynchronously, so that each iteration of aiter can proceed without the previous one necessarily returning yet (or is this understanding wrong?).

That understanding is wrong. Iterations of an async for loop cannot be performed in parallel. async for is just as sequential as a regular for loop.

The asynchronous part of async for is that it lets the iterator await on behalf of the coroutine iterating over it. It's only for use within asynchronous coroutines, and only for use on special asynchronous iterables. Other than that, it's mostly just like a regular for loop.

answered Sep 21 '22 12:09

user2357112 supports Monica

Related questions
                            
                                What is the difference between OneVsRestClassifier and MultiOutputClassifier in scikit learn?
                            
                                blocks - send input to python subprocess pipeline
                            
                                How best to parse a simple grammar?
                            
                                How to get rid of double backslash in python windows file path string? [duplicate]
                            
                                Python logging.DEBUG level doesn't logging
                            
                                ImportError: DLL load failed: %1 is not a valid Win32 application
                            
                                How can I rotate a matplotlib plot through 90 degrees?
                            
                                OS X - Deciding between anaconda and homebrew Python environments
                            
                                Anaconda: Install specific packages from specific channels using environment.yml
                            
                                Downsample array in Python
                            
                                Python requests.exception.ConnectionError: connection aborted "BadStatusLine"
                            
                                PIP Constraints Files
                            
                                How to run cloned Django project?
                            
                                Get list of Cache Keys in Django
                            
                                NumPy and SciPy - Difference between .todense() and .toarray()
                            
                                How to run a single line or selected code in a Jupyter Notebook or JupyterLab cell?
                            
                                Using absolute unix paths in windows with python
                            
                                Why isn't SQLAlchemy default column value available before object is committed?
                            
                                How to convert ndarray to array?
                            
                                functools.partial wants to use a positional argument as a keyword argument

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With