Python - Limit amount of data subprocess.Popen can produce

Tags:

I found lots of similar questions asking size of an object at run time in python. Some of the answers suggests to set a limit on amount of memory of sub-process. I do not want to set a limit on memory of sub-process. Here is what I want --

I'm using subprocess.Popen() to execute an external program. I can, very well, get standard output and error with process.stdout.readlines() and process.stderr.readlines() after the process is complete.

I have a problem when an erroneous program gets into an infinite loop and keeps producing output. Since subprocess.Popen() stores output data in memory this infinite loop quickly eats up entire memory and program slows down.

One solution is that I can run the command with timeout. But programs take variable time to complete. Large timeout, for a program taking small time and having an infinite loop, defeats the purpose of having it.

Is there any simple way where I can put an upper limit say 200MB on amount of data the command can produce? If it exceeds the limit command should get killed.

847

asked May 02 '13 07:05

Aryaveer

2 Answers

First: It is not subprocess.Popen() storing the data, but it is the pipe between "us" and "our" subprocess.

You shouldn't use readlines() in this case as this will indefinitely buffer the data and only at the end return them as a list (in this case, it is indeed this function which stores the data).

If you do something like

bytes = lines = 0
for line in process.stdout:
    bytes += len(line)
    lines += 1
    if bytes > 200000000 or lines > 10000:
        # handle the described situation
        break

you can act as wanted in your question. But you shouldn't forget to kill the subprocess afterwards in order to stop it producing further data.

But if you want to take care of stderr as well, you'd have to try to replicate process.communicate()'s behaviour with select() etc., and act appropriately.

answered Oct 13 '22 00:10

glglgl

There doesn't seem to be an easy answer to what you want

http://linux.about.com/library/cmd/blcmdl2_setrlimit.htm

rlimit has a flag to limit memory, CPU or number of open files, but apparently nothing to limit the amount of I/O.

You should handle the case manually as already described.

answered Oct 13 '22 01:10

LtWorf

Related questions
                            
                                How can I generate "Go First" Dice for N dice?
                            
                                PIP/easy_install PIL in Virtualenv vcvarsall.bat error Windows 7
                            
                                Has the DataFrame object from pandas superceded the other alternatives for heterogeneous data types?
                            
                                Automatically document my REST API
                            
                                SQLAlchemy temporary table with Declarative Base
                            
                                Testing InlineFormset clean methods
                            
                                Parsing EDGAR filings
                            
                                Cython: Inline Function not pure C
                            
                                Best way to programmatically save a webpage to a Static HTML File
                            
                                Python winreg looping through sub-keys
                            
                                OLS with pandas: datetime index as predictor
                            
                                Using Python for iOS programming [duplicate]
                            
                                Comparing sentences according to their meaning
                            
                                How do I use the xlib and OpenGL modules together with python?
                            
                                Evaluating a function at a point in SymPy
                            
                                Length cutting through file handling
                            
                                Custom Qt Widgets with python for Qt Designer
                            
                                dev-server HTTP Error 403: Forbidden
                            
                                Python: GIL context - switching
                            
                                Python, using ctypes to create C++ class wrapper

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python - Limit amount of data subprocess.Popen can produce

Tags:

python

subprocess

Aryaveer

People also ask

2 Answers

glglgl

LtWorf

Recent Activity

Donate For Us