How can I access a shared dictionary with multiprocessing?

Tags:

python

I think I am following the python documentation correctly but I am having trouble getting the result I am looking for. I basically have a list of numbers, that are being passed to a function of nested for loops and the output is saved in a dictionary.

Here's the code:

from multiprocessing import Pool, Manager

list = [1,2,3,10]
dictionary = {}
def test(x, dictionary):
    for xx in range(100):
        for xxx in range(100):
            dictionary[x]=xx*xxx



if __name__ == '__main__':
    pool = Pool(processes=4)
    mgr = Manager()
    d = mgr.dict()
    for N in list:
        pool.apply_async(test, (N, d))

    # Mark pool as closed -- no more tasks can be added.
    pool.close()

    # Wait for tasks to exit
    pool.join()

    # Output results
    print d

Here's the expected result:

{1: 9801, 2: 9801, 3: 9801, 10: 9801}

Any suggestions of what I'm doing wrong? Also, I haven't convinced myself that shared resources are the best approach(thinking of using a database to maintain state) so if my approach is completely flawed or there's a better way to do this in python please let me know.

293

asked Feb 13 '12 05:02

Lostsoul

1 Answers

Change the definition of test to:

def test(x, d):
    for xx in range(100):
        for xxx in range(100):
            d[x]=xx*xxx

Otherwise you're just incrementing some global dictionary (without synchronization) and never access it later.

As for the general approach, I think this one in particular has a lot of contention on the shared dictionary. Do you really have to update it from each process as soon as that? Accumulating batches of partial results in each process and just updating the shared object once in a while should perform better.

196

answered Oct 20 '22 19:10

Eli Bendersky

Related questions
                            
                                Python documentation: iterable many times?
                            
                                Need to use matplotlib scatter markers outside the chart, in labels for a bar graph
                            
                                How can I share one webdriver instance in my test classes in the suite? I use Selenium2 and Python
                            
                                How to render tags in Flask/GAE?
                            
                                Using QUiLoader and UI files in PySide to dynamically create user interface at run-time
                            
                                Passing python script arguments to test modules
                            
                                How to run initialization code before tests when using Python's unittest module as a testrunner?
                            
                                Caught TypeError while rendering: __init__() got an unexpected keyword argument 'use_decimal'
                            
                                A good persistent synchronous queue in python
                            
                                Redirecting all input from Dragon NaturallySpeaking to Python? (Using Natlink)
                            
                                Perl beats Python in fetching HTML pages? [closed]
                            
                                How to stop logging in Django unittests from printing to stderr?
                            
                                SQLAlchemy: Multiple foreign keys to same table with compound primary key
                            
                                python: cleanest way to wrap each method in parent class in a "with"
                            
                                Makefile for C program that uses numpy extensions
                            
                                Understanding namespace documentation
                            
                                sqlalchemy mixin, foreignkey and relation
                            
                                Matplotlib: contour plot with slider widget
                            
                                Deadlock in concurrent.futures code
                            
                                Is there a python grub.cfg parser?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With