CompletedProcess from subprocess.run() doesn't return a string

Tags:

According to the Python 3.5 docs, subprocess.run() returns an a CompletedProcess object with a stdout member that contains "A bytes sequence, or a string if run() was called with universal_newlines=True." I'm only seeing a byte sequence and not a string, which I was assuming (hoping) would be equivalent to a text line. For example,

import pprint
import subprocess

my_data = ""
line_count = 0

proc = subprocess.run(
         args = [ 'cat', 'input.txt' ],
         universal_newlines = True,
         stdout = subprocess.PIPE)

for text_line in proc.stdout:
    my_data += text_line
    line_count += 1

word_file = open('output.txt', 'w')
pprint.pprint(my_data, word_file)
pprint.pprint(line_count, word_file)

Note: this uses a new feature in Python 3.5 that won't run in previous versions.

Do I need to create my own line buffering logic, or is there a way to get Python to do that for me?

667

asked Dec 04 '15 23:12

highpost

2 Answers

proc.stdout is already a string in your case, run print(type(proc.stdout)), to make sure. It contains all subprocess' output -- subprocess.run() does not return until the child process is dead.

for text_line in proc.stdout: is incorrect: for char in text_string enumerates characters (Unicode codepoints) in Python, not lines. To get lines, call:

lines = result.stdout.splitlines()

The result may be different from .split('\n') if there are Unicode newlines in the string.

If you want to read the output line by line (to avoid running out of memory for long-running processes):

from subrocess import Popen, PIPE

with Popen(command, stdout=PIPE, universal_newlines=True) as process:
    for line in process.stdout:
        do_something_with(line)

Note: process.stdout is a file-like object in this case. Popen() does not wait for the process to finish -- Popen() returns immidiately as soon as the child process is started. process is a subprocess.Popen instance, not CompletedProcess here.

If all you need is to count the number of lines (terminated by b'\n') in the output, like wc -l:

from functools import partial

with Popen(command, stdout=PIPE) as process:
    read_chunk = partial(process.stdout.read, 1 << 13)
    line_count = sum(chunk.count(b'\n') for chunk in iter(read_chunk, b''))

See Why is reading lines from stdin much slower in C++ than Python?

176

answered Sep 19 '22 15:09

jfs

if you need to have STDOUT lines in an array to better manipulate them you simply miss to split output by the "Universal newline" separators

nmap_out = subprocess.run(args = ['nmap', '-T4', '-A', '192.168.1.128'],
                              universal_newlines = True,
                              stdout = subprocess.PIPE)

nmap_lines = nmap_out.stdout.splitlines()
print(nmap_lines)

output is:

['Starting Nmap 7.01 ( https://nmap.org ) at 2016-02-28 12:24 CET', 'Note: Host seems down. If it is really up, but blocking our ping probes, try -Pn', 'Nmap done: 1 IP address (0 hosts up) scanned in 2.37 seconds']

answered Sep 16 '22 15:09

Gianfranco

Related questions
                            
                                Intersecting matplotlib graph with unsorted data
                            
                                Python xlwt create faulty excel book
                            
                                Subprocess calls, are they done in parallel?
                            
                                the reason: python string assignments accidentally change '\b' into '\x08' and '\a' into '\x07', why Python did this?
                            
                                Append value to one list in dictionary appends value to all lists in dictionary
                            
                                How do I setup messaging and session middleware in a Django RequestFactory during unit testing
                            
                                Passing an instance of a class (object of a class) to another class
                            
                                Does Python's imaplib let you set a timeout?
                            
                                how to call another webservice api from flask
                            
                                Week of a month pandas
                            
                                Flask: are blueprints necessary for app factories?
                            
                                How to work with the scrapy contracts?
                            
                                How do I generate random text in NLTK 3.0?
                            
                                Django: How to login user directly after registration using generic CreateView
                            
                                Get a unique list of items that occur more than once in a list
                            
                                How to use custom token model in Django Rest Framework
                            
                                Is there a numpy biginteger?
                            
                                Calculate new value based on decreasing value
                            
                                Pandas: cannot filter based on string equality
                            
                                Why I get 'list' object has no attribute 'items'?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

CompletedProcess from subprocess.run() doesn't return a string

Tags:

python

subprocess

python-3.5

highpost

People also ask

2 Answers

jfs

Gianfranco

Recent Activity

Donate For Us