I wanted to use a python equivalent to piping some shell commands in perl. Something like the python version of open(PIPE, "command |"). I go to the subprocess module and try this: <pre class="prettyprint"><code>p = subprocess.Popen("zgrep thingiwant largefile", shell=True, stdout=subprocess.PIPE) </code></pre> This works for reading the output the same way I would in perl, but it doesn't clean itself up. When I exit the interpreter, I get <pre class="prettyprint"><code>grep: writing output: Broken pipe </code></pre> spewed all over stderr a few million times. I guess I had naively hoped all this would be taken care of for me, but that's not true. Calling terminate or kill on p doesn't seem to help. Look at the process table, I see that this kills the /bin/sh process, but leaves the child gzip in place to complain about the broken pipe. What's the right way to do this?

After you open the pipe, you can work with the command output: <code>p.stdout</code>: <pre class="prettyprint"><code>for line in p.stdout: # do stuff p.stdout.close() </code></pre>

Python's Popen cleanup

Tags:

resource-cleanup

I wanted to use a python equivalent to piping some shell commands in perl. Something like the python version of open(PIPE, "command |").

I go to the subprocess module and try this:

p = subprocess.Popen("zgrep thingiwant largefile", shell=True, stdout=subprocess.PIPE)

This works for reading the output the same way I would in perl, but it doesn't clean itself up. When I exit the interpreter, I get

grep: writing output: Broken pipe

spewed all over stderr a few million times. I guess I had naively hoped all this would be taken care of for me, but that's not true. Calling terminate or kill on p doesn't seem to help. Look at the process table, I see that this kills the /bin/sh process, but leaves the child gzip in place to complain about the broken pipe.

What's the right way to do this?

598

asked Apr 07 '10 20:04

pythonic metaphor

2 Answers

The issue is that the pipe is full. The subprocess stops, waiting for the pipe to empty out, but then your process (the Python interpreter) quits, breaking its end of the pipe (hence the error message).

p.wait() will not help you:

Warning This will deadlock if the child process generates enough output to a stdout or stderr pipe such that it blocks waiting for the OS pipe buffer to accept more data. Use communicate() to avoid that.

http://docs.python.org/library/subprocess.html#subprocess.Popen.wait

p.communicate() will not help you:

Note The data read is buffered in memory, so do not use this method if the data size is large or unlimited.

http://docs.python.org/library/subprocess.html#subprocess.Popen.communicate

p.stdout.read(num_bytes) will not help you:

Warning Use communicate() rather than .stdin.write, .stdout.read or .stderr.read to avoid deadlocks due to any of the other OS pipe buffers filling up and blocking the child process.

http://docs.python.org/library/subprocess.html#subprocess.Popen.stdout

The moral of the story is, for large output, subprocess.PIPE will doom you to certain failure if your program is trying to read the data (it seems to me that you should be able to put p.stdout.read(bytes) into a while p.returncode is None: loop, but the above warning suggests that this could deadlock).

The docs suggest replacing a shell pipe with this:

p1 = Popen(["zgrep", "thingiwant", "largefile"], stdout=PIPE)
p2 = Popen(["processreceivingdata"], stdin=p1.stdout, stdout=PIPE)
output = p2.communicate()[0]

Notice that p2 is taking its standard input directly from p1. This should avoid deadlocks, but given the contradictory warnings above, who knows.

Anyway, if that last part doesn't work for you (it should, though), you could try creating a temporary file, writing all data from the first call to that, and then using the temporary file as input to the next process.

115

answered Oct 10 '22 00:10

Daniel G

After you open the pipe, you can work with the command output: p.stdout:

for line in p.stdout:
    # do stuff
p.stdout.close()

answered Oct 09 '22 22:10

tzot

Related questions
                            
                                Python 2.6 on Windows: how to terminate subprocess.Popen with "shell=True" argument?
                            
                                Python's eval() and globals()
                            
                                Using exec() with recursive functions
                            
                                HTTPSConnection module missing in Python 2.6 on CentOS 5.2
                            
                                PHP desktop applications [closed]
                            
                                Django way to do conditional formatting
                            
                                Preventing variable substitutions from occurring with buildout
                            
                                Nested transactions with SQLAlchemy and sqlite
                            
                                Does Python use NFAs for regular expression evaluation in the re module?
                            
                                How to determine if data is valid tar file without a file?
                            
                                Python Selector (URL routing library), experience/opinions?
                            
                                Python string pattern recognition/compression
                            
                                Define a list with type
                            
                                How to create class instance inside that class method?
                            
                                Restart a Python Program
                            
                                Python: Embed Chaco in PyQt4 Mystery
                            
                                Python: penalty for sleeping threads
                            
                                Using python to run other programs
                            
                                Minimize python distribution size
                            
                                Python: speed up removal of every n-th element from list

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python's Popen cleanup

Tags:

python

popen