Performance of subprocess.check_output vs subprocess.call

Tags:

I've been using subprocess.check_output() for some time to capture output from subprocesses, but ran into some performance problems under certain circumstances. I'm running this on a RHEL6 machine.

The calling Python environment is linux-compiled and 64-bit. The subprocess I'm executing is a shell script which eventually fires off a Windows python.exe process via Wine (why this foolishness is required is another story). As input to the shell script, I'm piping in a small bit of Python code that gets passed off to python.exe.

While the system is under moderate/heavy load (40 to 70% CPU utilization), I've noticed that using subprocess.check_output(cmd, shell=True) can result in a significant delay (up to ~45 seconds) after the subprocess has finished execution before the check_output command returns. Looking at output from ps -efH during this time shows the called subprocess as sh <defunct>, until it finally returns with a normal zero exit status.

Conversely, using subprocess.call(cmd, shell=True) to run the same command under the same moderate/heavy load will cause the subprocess to return immediately with no delay, all output printed to STDOUT/STDERR (rather than returned from the function call).

Why is there such a significant delay only when check_output() is redirecting the STDOUT/STDERR output into its return value, and not when the call() simply prints it back to the parent's STDOUT/STDERR?

803

asked Aug 15 '14 20:08

greenlaw

1 Answers

Reading the docs, both subprocess.call and subprocess.check_output are use-cases of subprocess.Popen. One minor difference is that check_output will raise a Python error if the subprocess returns a non-zero exit status. The greater difference is emphasized in the bit about check_output (my emphasis):

The full function signature is largely the same as that of the Popen constructor, except that stdout is not permitted as it is used internally. All other supplied arguments are passed directly through to the Popen constructor.

So how is stdout "used internally"? Let's compare call and check_output:

call

def call(*popenargs, **kwargs):     return Popen(*popenargs, **kwargs).wait()

check_output

def check_output(*popenargs, **kwargs):     if 'stdout' in kwargs:         raise ValueError('stdout argument not allowed, it will be overridden.')     process = Popen(stdout=PIPE, *popenargs, **kwargs)     output, unused_err = process.communicate()     retcode = process.poll()     if retcode:         cmd = kwargs.get("args")         if cmd is None:             cmd = popenargs[0]         raise CalledProcessError(retcode, cmd, output=output)     return output

communicate

Now we have to look at Popen.communicate as well. Doing this, we notice that for one pipe, communicate does several things which simply take more time than simply returning Popen().wait(), as call does.

For one thing, communicate processes stdout=PIPE whether you set shell=True or not. Clearly, call does not. It just lets your shell spout whatever... making it a security risk, as Python describes here.

Secondly, in the case of check_output(cmd, shell=True) (just one pipe)... whatever your subprocess sends to stdout is processed by a thread in the _communicate method. And Popen must join the thread (wait on it) before additionally waiting on the subprocess itself to terminate!

Plus, more trivially, it processes stdout as a list which must then be joined into a string.

In short, even with minimal arguments, check_output spends a lot more time in Python processes than call does.

186

answered Sep 22 '22 06:09

Joseph8th

Related questions
                            
                                Python: splitting string by all space characters
                            
                                python + igraph "plotting not available"
                            
                                TypeError: 'float' object is not subscriptable
                            
                                Calculating adjusted p-values in Python
                            
                                Can anybody explain me the numpy.indices()?
                            
                                Why are integers immutable in Python?
                            
                                Debug Jinja2 in Google App Engine
                            
                                How to fake/proxy a class in Python
                            
                                Allowing resizing window pyGame
                            
                                Nearest Neighbor Search: Python
                            
                                Exception Value:failed to find libmagic. Check your installation in windows 7
                            
                                Conditional mocking: Call original function if condition does match
                            
                                How can i use signals in django bulk create
                            
                                Importance of apps orders in INSTALLED_APPS
                            
                                Using Jupyter behind a proxy
                            
                                Confusion re: pandas copy of slice of dataframe warning
                            
                                Equivalent for LinkedHashMap in Python
                            
                                Is there a direct equivalent in Java for Python's str.join? [duplicate]
                            
                                Annoying white space in bar chart (matplotlib, Python)
                            
                                Fit a non-linear function to data/observations with pyMCMC/pyMC

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Performance of subprocess.check_output vs subprocess.call

Tags:

python

linux

subprocess

wine