Why doesn't `print` work in Python multiprocessing pool.map

Tags:

multiprocessing

I am trying to implement the multiprocessing module for a working with a large csv file. I am using Python 2.7 and following the example from here.

I ran the unmodified code (copied below for convenience) and noticed that print statements within the worker function do not work. The inability to print makes it difficult to understand the flow and debug.

Can anyone please explain why print is not working here? Does pool.map not execute print commands? I searched online but did not find any documentation that would indicate this.

Click to copy

import multiprocessing as mp
import itertools
import time
import csv

def worker(chunk):
    # `chunk` will be a list of CSV rows all with the same name column
    # replace this with your real computation
    print(chunk)     # <----- nothing prints
    print 'working'  # <----- nothing prints
    return len(chunk)  

def keyfunc(row):
    # `row` is one row of the CSV file.
    # replace this with the name column.
    return row[0]

def main():
    pool = mp.Pool()
    largefile = 'test.dat'
    num_chunks = 10
    results = []
    with open(largefile) as f:
        reader = csv.reader(f)
        chunks = itertools.groupby(reader, keyfunc)
        while True:
            # make a list of num_chunks chunks
            groups = [list(chunk) for key, chunk in
                      itertools.islice(chunks, num_chunks)]
            if groups:
                result = pool.map(worker, groups)
                results.extend(result)
            else:
                break
    pool.close()
    pool.join()
    print(results)

if __name__ == '__main__':
    main()

738

asked May 05 '14 19:05

Roberto

1 Answers

This is an issue with IDLE, which you're using to run your code. IDLE does a fairly basic emulation of a terminal for handling the output of a program you run in it. It cannot handle subprocesses though, so while they'll run just fine in the background, you'll never see their output.

The simplest fix is to simply run your code from the command line.

An alternative might be to use a more sophisticated IDE. There are a bunch of them listed on the Python wiki, though I'm not sure which ones have better terminal emulation for multiprocessing output.

answered Oct 13 '22 12:10

Blckknght

Related questions
                            
                                Crispy Form VariableDoesNotExist on Django
                            
                                How can i change the font on ttk.Entry
                            
                                Geoalchemy2 query all users within X meteres
                            
                                Python Tcp disconnect detection
                            
                                Using greater than operator with subprocess.call
                            
                                Can a python lambda/fn yield on behalf of an arbitrary caller?
                            
                                How to count all positive and negative values in a pandas groupby?
                            
                                Interpret numpy.fft.fft2 output
                            
                                Rest calls with multiple lookup fields for reverse lookup
                            
                                Changing the line color in plot_surface
                            
                                Test the existence of an Element with lxml.objectify
                            
                                Reconnecting MySQL on timeout
                            
                                How to filter a queryset for dates matching a given day?
                            
                                Can I fill web forms with Scrapy?
                            
                                Javascript: unpack object as function parameters
                            
                                Can Python slicing be used to skip one specific element by index?
                            
                                The `uwsgi_modifier1 30` directive is not removing the SCRIPT_NAME from PATH_INFO as documented
                            
                                Python import as tuple
                            
                                sqlite3.ProgrammingError: Cannot operate on a closed database. [Python] [sqlite]
                            
                                Why are some numpy calls not implemented as methods?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With