If need to periodically check the <code>stdout</code> of a running process. For example, the process is <code>tail -f /tmp/file</code>, which is spawned in the python script. Then every x seconds, the stdout of that subprocess is written to a string and further processed. The subprocess is eventually stopped by the script. To parse the stdout of a subprocess, if used <code>check_output</code> until now, which doesn't seem to work, as the process is still running and doesn't produce a definite output. <pre class="prettyprint"><code>>>> from subprocess import check_output >>> out = check_output(["tail", "-f", "/tmp/file"]) #(waiting for tail to finish) </code></pre> It should be possible to use threads for the subprocesses, so that the output of multiple subprocesses may be processed (e.g. tail -f /tmp/file1, tail -f /tmp/file2). How can I start a subprocess, periodically check and process its stdout and eventually stop the subprocess in a multithreading friendly way? The python script runs on a Linux system. The goal is not to continuously read a file, the tail command is an example, as it behaves exactly like the actual command used. edit: I didn't think this through, the file did not exist. <code>check_output</code> now simply waits for the process to finish. edit2: An alternative method, with <code>Popen</code> and <code>PIPE</code> appears to result in the same issue. It waits for <code>tail</code> to finish. <pre class="prettyprint"><code>>>> from subprocess import Popen, PIPE, STDOUT >>> cmd = 'tail -f /tmp/file' >>> p = Popen(cmd, shell=True, stdin=PIPE, stdout=PIPE, stderr=STDOUT, close_fds=True) >>> output = p.stdout.read() #(waiting for tail to finish) </code></pre>

Your second attempt is 90% correct. The only issue is that you are attempting to read all of <code>tail</code>'s stdout at the same time once it's finished. However, <code>tail</code> is intended to run (indefinitely?) in the background, so you really want to read stdout from it line-by-line: <pre class="prettyprint"><code>from subprocess import Popen, PIPE, STDOUT p = Popen(["tail", "-f", "/tmp/file"], stdin=PIPE, stdout=PIPE, stderr=STDOUT) for line in p.stdout: print(line) </code></pre> I have removed the <code>shell=True</code> and <code>close_fds=True</code> arguments. The first is unnecessary and potentially dangerous, while the second is just the default. Remember that file objects are iterable over their lines in Python. The <code>for</code> loop will run until <code>tail</code> dies, but it will process each line as it appears, as opposed to <code>read</code>, which will block until <code>tail</code> dies. If I create an empty file in <code>/tmp/file</code>, start this program and begin echoing lines into the file using another shell, the program will echo those lines. You should probably replace <code>print</code> with something a bit more useful. Here is an example of commands I typed after starting the code above: Command line <pre class="prettyprint"><code>$ echo a > /tmp/file $ echo b > /tmp/file $ echo c >> /tmp/file </code></pre> Program Output (From Python in a different shell) <pre class="prettyprint"><code>b'a\n' b'tail: /tmp/file: file truncated\n' b'b\n' b'c\n' </code></pre> In the case that you want your main program be responsive while you respond to the output of <code>tail</code>, start the loop in a separate thread. You should make this thread a daemon so that it does not prevent your program from exiting even if <code>tail</code> is not finished. You can have the thread open the sub-process or you can just pass in the standard output to it. I prefer the latter approach since it gives you more control in the main thread: <pre class="prettyprint"><code>def deal_with_stdout(): for line in p.stdout: print(line) from subprocess import Popen, PIPE, STDOUT from threading import Thread p = Popen(["tail", "-f", "/tmp/file"], stdin=PIPE, stdout=PIPE, stderr=STDOUT) t = Thread(target=deal_with_stdout, daemon=True) t.start() t.join() </code></pre> The code here is nearly identical, with the addition of a new thread. I added a <code>join()</code> at the end so the program would behave well as an example (<code>join</code> waits for the thread to die before returning). You probably want to replace that with whatever processing code you would normally be running. If your thread is complex enough, you may also want to inherit from <code>Thread</code> and override the <code>run</code> method instead of passing in a simple <code>target</code>.

Check on the stdout of a running subprocess in python

Tags:

python

subprocess

python-3.x

multithreading

stdout

If need to periodically check the stdout of a running process. For example, the process is tail -f /tmp/file, which is spawned in the python script. Then every x seconds, the stdout of that subprocess is written to a string and further processed. The subprocess is eventually stopped by the script.

To parse the stdout of a subprocess, if used check_output until now, which doesn't seem to work, as the process is still running and doesn't produce a definite output.

>>> from subprocess import check_output
>>> out = check_output(["tail", "-f", "/tmp/file"])
 #(waiting for tail to finish)

It should be possible to use threads for the subprocesses, so that the output of multiple subprocesses may be processed (e.g. tail -f /tmp/file1, tail -f /tmp/file2).

How can I start a subprocess, periodically check and process its stdout and eventually stop the subprocess in a multithreading friendly way? The python script runs on a Linux system.

The goal is not to continuously read a file, the tail command is an example, as it behaves exactly like the actual command used.

edit: I didn't think this through, the file did not exist. check_output now simply waits for the process to finish.

edit2: An alternative method, with Popen and PIPE appears to result in the same issue. It waits for tail to finish.

>>> from subprocess import Popen, PIPE, STDOUT
>>> cmd = 'tail -f /tmp/file'
>>> p = Popen(cmd, shell=True, stdin=PIPE, stdout=PIPE, stderr=STDOUT, close_fds=True)
>>> output = p.stdout.read()
 #(waiting for tail to finish)

397

asked Mar 02 '17 10:03

boolean.is.null

1 Answers

Your second attempt is 90% correct. The only issue is that you are attempting to read all of tail's stdout at the same time once it's finished. However, tail is intended to run (indefinitely?) in the background, so you really want to read stdout from it line-by-line:

from subprocess import Popen, PIPE, STDOUT
p = Popen(["tail", "-f", "/tmp/file"], stdin=PIPE, stdout=PIPE, stderr=STDOUT)
for line in p.stdout:
    print(line)

I have removed the shell=True and close_fds=True arguments. The first is unnecessary and potentially dangerous, while the second is just the default.

Remember that file objects are iterable over their lines in Python. The for loop will run until tail dies, but it will process each line as it appears, as opposed to read, which will block until tail dies.

If I create an empty file in /tmp/file, start this program and begin echoing lines into the file using another shell, the program will echo those lines. You should probably replace print with something a bit more useful.

Here is an example of commands I typed after starting the code above:

Command line

$ echo a > /tmp/file
$ echo b > /tmp/file
$ echo c >> /tmp/file

Program Output (From Python in a different shell)

b'a\n'
b'tail: /tmp/file: file truncated\n'
b'b\n'
b'c\n'

In the case that you want your main program be responsive while you respond to the output of tail, start the loop in a separate thread. You should make this thread a daemon so that it does not prevent your program from exiting even if tail is not finished. You can have the thread open the sub-process or you can just pass in the standard output to it. I prefer the latter approach since it gives you more control in the main thread:

def deal_with_stdout():
    for line in p.stdout:
        print(line)

from subprocess import Popen, PIPE, STDOUT
from threading import Thread
p = Popen(["tail", "-f", "/tmp/file"], stdin=PIPE, stdout=PIPE, stderr=STDOUT)
t = Thread(target=deal_with_stdout, daemon=True)
t.start()
t.join()

The code here is nearly identical, with the addition of a new thread. I added a join() at the end so the program would behave well as an example (join waits for the thread to die before returning). You probably want to replace that with whatever processing code you would normally be running.

If your thread is complex enough, you may also want to inherit from Thread and override the run method instead of passing in a simple target.

answered Sep 27 '22 21:09

Mad Physicist

Related questions
                            
                                How to directly add file to zip in python?
                            
                                What is an arbitrary element in Python?
                            
                                Pretty Print JSON [duplicate]
                            
                                How to calculate diff between two dates in django
                            
                                Pandas Sqlite query using variable
                            
                                Querying json object in dataframe using Pyspark
                            
                                Python 3: setup.py: pip install that does everything (build_ext + install)
                            
                                Class-based views: where to check for permissions?
                            
                                How can I play a mp4 movie using Moviepy and Pygame
                            
                                Error when checking model input: expected convolution2d_input_1 to have shape (None, 3, 32, 32) but got array with shape (50000, 32, 32, 3)
                            
                                In pandas, how do I flatten a group of rows
                            
                                Flask raises 404 for blueprint static files when using blueprint static route
                            
                                How to use Parameters in Python Luigi
                            
                                Export decorator that manages __all__
                            
                                Insert item into case-insensitive sorted list in Python
                            
                                How to install python3-dev in Oracle Linux?
                            
                                How to add caffe to anaconda on windows?
                            
                                Delete row based on nulls in certain columns (pandas)
                            
                                Treat nan as zero in numpy array summation except for nan in all arrays
                            
                                Finding Patterns in a Numpy Array

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With