How do I close a Python 2.5.2 Popen subprocess once I have the data I need?

Tags:

I am running the following version of Python:

$ /usr/bin/env python --version                                                                                                                                                            
Python 2.5.2

I am running the following Python code to write data from a child subprocess to standard output, and reading that into a Python variable called metadata:

# Extract metadata (snippet from extractMetadata.py)
inFileAsGzip = "%s.gz" % inFile                                                                                                                                                                                                            
if os.path.exists(inFileAsGzip):                                                                                                                                                                                                           
    os.remove(inFileAsGzip)                                                                                                                                                                                                                
os.symlink(inFile, inFileAsGzip)                                                                                                                                                                                                           
extractMetadataCommand = "bgzip -c -d -b 0 -s %s %s" % (metadataRequiredFileSize, inFileAsGzip)                                                                                                                                            
metadataPipes = subprocess.Popen(extractMetadataCommand, stdin=None, stdout=subprocess.PIPE, shell=True, close_fds=True)                                                                                                      
metadata = metadataPipes.communicate()[0]                                                                                                                                                                                                                                                                                                                                                                                                          
metadataPipes.stdout.close()                                                                                                                                                                                                             
os.remove(inFileAsGzip) 
print metadata

The use case is as follows, to pull the first ten lines of standard output from the aforementioned code snippet:

$ extractMetadata.py | head

The error will appear if I pipe into head, awk, grep, etc.

The script ends with the following error:

close failed: [Errno 32] Broken pipe

I would have thought closing the pipes would be sufficient, but obviously that's not the case.

613

asked Oct 05 '10 05:10

Alex Reynolds

1 Answers

Hmmm. I've seen some "Broken pipe" strangeness with subprocess + gzip before. I never did figure out exactly why it was happening but by changing my implementation approach, I was able to avoid the problem. It looks like you're just trying to use a backend gzip process to decompress a file (probably because Python's builtin module is horrendously slow... no idea why but it definitely is).

Rather than using communicate() you can, instead, treat the process as a fully asynchronous backend and just read it's output as it arrives. When the process dies, the subprocess module will take care of cleaning things up for you. The following snippit should provide the same basic functionality without any broken pipe issues.

import subprocess

gz_proc = subprocess.Popen(['gzip', '-c', '-d', 'test.gz'], stdout=subprocess.PIPE)

l = list()
while True:
    dat = gz_proc.stdout.read(4096)
    if not d:
        break
    l.append(d)

file_data = ''.join(l)

145

answered Oct 18 '22 08:10

Rakis

Related questions
                            
                                How to fix errors occurring on installation of Jupyter Notebook?
                            
                                How to setup Django permissions to be specific to a certain model's instances?
                            
                                Downloading a web page and all of its resource files in Python
                            
                                Special considerations for using Python in init.d script?
                            
                                Guidance on optimising Python runtime for embedded systems with low system resources
                            
                                How do I close the stdout-pipe when killing a process started with python subprocess Popen?
                            
                                Python: Behavior of the garbage collector
                            
                                Playing MP3 files with Python
                            
                                Replacing elements with lxml.html
                            
                                Cross-Platform Python Notification Library
                            
                                How do I sudo the current process?
                            
                                python parent class 'wrapping' child-class methods
                            
                                How do I get virtualenvwrapper and cygwin to co-operate?
                            
                                doctest locally defined functions
                            
                                What are some strategies for maintaining a common database schema with a team of developers and no DBA?
                            
                                Python process will not exit
                            
                                What's the best way to count unique visitors with Hadoop?
                            
                                Which solution is better for Django social authentication?
                            
                                Python decorators and class inheritance
                            
                                Why doesn't Python's `except` use `isinstance`?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I close a Python 2.5.2 Popen subprocess once I have the data I need?

Tags:

python

pipe

popen

Alex Reynolds

People also ask

1 Answers

Rakis

Recent Activity

Donate For Us