I'm running a script via Python's subprocess module. Currently I use: <pre class="prettyprint"><code>p = subprocess.Popen('/path/to/script', stdout=subprocess.PIPE, stderr=subprocess.PIPE) result = p.communicate() </code></pre> I then print the result to the stdout. This is all fine but as the script takes a long time to complete, I wanted real time output from the script to stdout as well. The reason I pipe the output is because I want to parse it.

<code>p.communicate()</code> waits for the subprocess to complete and then returns its entire output at once. Have you tried something like this instead, where you read the subprocess output line-by-line? <pre class="prettyprint"><code>p = subprocess.Popen('/path/to/script', stdout=subprocess.PIPE, stderr=subprocess.PIPE) for line in p.stdout: # do something with this individual line print line </code></pre>

Displaying subprocess output to stdout and redirecting it

Tags:

python

subprocess

stdout

I'm running a script via Python's subprocess module. Currently I use:

p = subprocess.Popen('/path/to/script', stdout=subprocess.PIPE, stderr=subprocess.PIPE)
result = p.communicate()

I then print the result to the stdout. This is all fine but as the script takes a long time to complete, I wanted real time output from the script to stdout as well. The reason I pipe the output is because I want to parse it.

626

asked Sep 09 '14 17:09

AsadSMalik

2 Answers

To save subprocess' stdout to a variable for further processing and to display it while the child process is running as it arrives:

#!/usr/bin/env python3
from io import StringIO
from subprocess import Popen, PIPE

with Popen('/path/to/script', stdout=PIPE, bufsize=1,
           universal_newlines=True) as p, StringIO() as buf:
    for line in p.stdout:
        print(line, end='')
        buf.write(line)
    output = buf.getvalue()
rc = p.returncode

To save both subprocess's stdout and stderr is more complex because you should consume both streams concurrently to avoid a deadlock:

stdout_buf, stderr_buf = StringIO(), StringIO()
rc =  teed_call('/path/to/script', stdout=stdout_buf, stderr=stderr_buf,
                universal_newlines=True)
output = stdout_buf.getvalue()
...

where teed_call() is define here.

Update: here's a simpler asyncio version.

^{Old version:}

Here's a single-threaded solution based on child_process.py example from tulip:

import asyncio
import sys
from asyncio.subprocess import PIPE

@asyncio.coroutine
def read_and_display(*cmd):
    """Read cmd's stdout, stderr while displaying them as they arrive."""
    # start process
    process = yield from asyncio.create_subprocess_exec(*cmd,
            stdout=PIPE, stderr=PIPE)

    # read child's stdout/stderr concurrently
    stdout, stderr = [], [] # stderr, stdout buffers
    tasks = {
        asyncio.Task(process.stdout.readline()): (
            stdout, process.stdout, sys.stdout.buffer),
        asyncio.Task(process.stderr.readline()): (
            stderr, process.stderr, sys.stderr.buffer)}
    while tasks:
        done, pending = yield from asyncio.wait(tasks,
                return_when=asyncio.FIRST_COMPLETED)
        assert done
        for future in done:
            buf, stream, display = tasks.pop(future)
            line = future.result()
            if line: # not EOF
                buf.append(line)    # save for later
                display.write(line) # display in terminal
                # schedule to read the next line
                tasks[asyncio.Task(stream.readline())] = buf, stream, display

    # wait for the process to exit
    rc = yield from process.wait()
    return rc, b''.join(stdout), b''.join(stderr)

The script runs '/path/to/script command and reads line by line both its stdout&stderr concurrently. The lines are printed to parent's stdout/stderr correspondingly and saved as bytestrings for future processing. To run the read_and_display() coroutine, we need an event loop:

import os

if os.name == 'nt':
    loop = asyncio.ProactorEventLoop() # for subprocess' pipes on Windows
    asyncio.set_event_loop(loop)
else:
    loop = asyncio.get_event_loop()
try:
    rc, *output = loop.run_until_complete(read_and_display("/path/to/script"))
    if rc:
        sys.exit("child failed with '{}' exit code".format(rc))
finally:
    loop.close()

answered Oct 23 '22 14:10

jfs

p.communicate() waits for the subprocess to complete and then returns its entire output at once.

Have you tried something like this instead, where you read the subprocess output line-by-line?

p = subprocess.Popen('/path/to/script', stdout=subprocess.PIPE, stderr=subprocess.PIPE)
for line in p.stdout:
  # do something with this individual line
  print line

answered Oct 23 '22 15:10

Dan Lenski

Related questions
                            
                                String immutability in CPython violated
                            
                                bottle framework with multiple files
                            
                                How to force errorbars to render last with Matplotlib
                            
                                Python’s `str.format()`, fill characters, and ANSI colors
                            
                                Using PostGIS on Python 3
                            
                                How to retrieve session data with Flask?
                            
                                multiprocessing GUI schemas to combat the "Not Responding" blocking
                            
                                Matplotlib animate fill_between shape
                            
                                Python urlparse.parse_qs unicode url
                            
                                How to remove whitespaces and newlines from every value in a JSON file?
                            
                                xml.etree.ElementTree get node depth
                            
                                Explain the speed difference between numpy's vectorized function application VS python's for loop
                            
                                Error: command 'gcc' failed: No such file or directory
                            
                                Is there a solid method for wavelet analysis in Python?
                            
                                python queue get size, use qsize() or len()?
                            
                                How to use the cross-spectral density to calculate the phase shift of two related signals
                            
                                Python list set value at index if index does not exist
                            
                                How to forward-declare/prototype a function in Python? [duplicate]
                            
                                Matlab equivalent of Python enumerate
                            
                                How to use dill to serialize a class definition?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With