Can I use asyncio to read from and write to a multiprocessing.Pipe?

Tags:

I need to communicate between processes in Python and am using asyncio in each of the processes for concurrent network IO.

Currently I'm using multiprocessing.Pipe to send and recv significantly large amounts of data between the processes, however I do so outside of asyncio and I believe I'm spending a lot of cpu time in IO_WAIT because of it.

It seems like asyncio can and should be used to handle the Pipe IO between processes, however I can't find an example for anything but piping STDIN/STDOUT.

From what I read it seems like I should register the pipe with loop.connect_read_pipe(PROTOCOL_FACTORY, PIPE) and likewise for write. However I don't understand the purpose of protocol_factory as it would relate to a multiprocessing.Pipe. It's not even clear if I should be creating a multiprocessing.Pipe or whether I can create a pipe within asyncio.

705

asked Nov 05 '19 22:11

David Parks

2 Answers

multiprocessing.Pipe uses the high level multiprocessing.Connection module that pickles and unpickles Python objects and transmits additional bytes under the hood. If you wanted to read data from one of these pipes using loop.connect_read_pipe(), you would have to re-implement all of this yourself.

The easiest way to read from a multiprocessing.Pipe without blocking the event loop would be to use loop.add_reader(). Consider the following example:

import asyncio
import multiprocessing


def main():
    read, write = multiprocessing.Pipe(duplex=False)
    writer_process = multiprocessing.Process(target=writer, args=(write,))
    writer_process.start()
    asyncio.get_event_loop().run_until_complete(reader(read))


async def reader(read):
    data_available = asyncio.Event()
    asyncio.get_event_loop().add_reader(read.fileno(), data_available.set)

    if not read.poll():
        await data_available.wait()

    print(read.recv())
    data_available.clear()


def writer(write):
    write.send('Hello World')


if __name__ == '__main__':
    main()

Pipes created using the lower-level os.pipe don't add anything extra the way that pipes from multiprocessing.Pipe do. As a result, we can use os.pipe with loop.connect_read_pipe(), without re-implementing any sort of inner-workings. Here is an example:

import asyncio
import multiprocessing
import os


def main():
    read, write = os.pipe()
    writer_process = multiprocessing.Process(target=writer, args=(write,))
    writer_process.start()
    asyncio.get_event_loop().run_until_complete(reader(read))


async def reader(read):
    pipe = os.fdopen(read, mode='r')

    loop = asyncio.get_event_loop()
    stream_reader = asyncio.StreamReader()
    def protocol_factory():
        return asyncio.StreamReaderProtocol(stream_reader)

    transport, _ = await loop.connect_read_pipe(protocol_factory, pipe)
    print(await stream_reader.readline())
    transport.close()


def writer(write):
    os.write(write, b'Hello World\n')


if __name__ == '__main__':
    main()

This code helped me figure out to use loop.connect_read_pipe.

answered Sep 22 '22 17:09

akindofyoga

aiopipe seems to do what you want! It can be used with the builtin multiprocessing module, and provides a similar API to the regular blocking pipes.

answered Sep 18 '22 17:09

kmaork

Related questions
                            
                                Remove substring from column based on another column
                            
                                Giving Custom inter quartile range for Boxplot in Matplotlib
                            
                                Why can't I find lsvirtualenv command?
                            
                                python: how to change the value of function's input parameter?
                            
                                How to convert dtype(uint16) data into a 16bit png image?
                            
                                XGBoost: AttributeError: 'DataFrame' object has no attribute 'feature_names'
                            
                                How to I transfer an image(opencv Matrix/numpy array) from c++ publisher to python sender via ZeroMQ?
                            
                                Scikit-learn pipeline TypeError: zip argument #2 must support iteration
                            
                                Can I conda install an alpha or beta version of Python?
                            
                                Why is ThreadPoolExecutor's default max_workers decided based on the number of CPUs?
                            
                                Why can't I run dev dependencies after `pipenv install --dev`?
                            
                                Keras ValueError: Dimensions must be equal issue
                            
                                List of locationmode to use with plotly
                            
                                How to validate namedtuple values?
                            
                                Enumerate columns with same prefix
                            
                                OSError: [Errno 24] Too many open files - OS Mojave
                            
                                How can I extract frames from a video at a certain FPS?
                            
                                Numpy Array: First occurence of N consecutive values smaller than threshold
                            
                                Title font in subplots with axes.title.set_text
                            
                                WSL Conda Environment in PyCharm

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Can I use asyncio to read from and write to a multiprocessing.Pipe?

Tags:

python

python-3.x

pipe

multiprocessing

python-asyncio

David Parks

People also ask

2 Answers

akindofyoga

kmaork

Recent Activity

Donate For Us