How to use multiprocessing.Pool in an imported module?

Tags:

I have not been able to implement the suggestion here: Applying two functions to two lists simultaneously.

I guess it is because the module is imported by another module and thus my Windows spawns multiple python processes?

My question is: how can I use the code below without the if if __name__ == "__main__":

args_m = [(mortality_men, my_agents, graveyard, families, firms, year, agent) for agent in males]
args_f = [(mortality_women, fertility, year, families, my_agents, graveyard, firms, agent) for agent in females]

with mp.Pool(processes=(mp.cpu_count() - 1)) as p:
    p.map_async(process_males, args_m)
    p.map_async(process_females, args_f)

Both process_males and process_females are fuctions. args_m, args_f are iterators

Also, I don't need to return anything. Agents are class instances that need updating.

846

asked Mar 04 '17 23:03

B Furtado

1 Answers

The reason you need to guard multiprocessing code in a if __name__ == "__main__" is that you don't want it to run again in the child process. That can happen on Windows, where the interpreter needs to reload all of its state since there's no fork system call that will copy the parent process's address space. But you only need to use it where code is supposed to be running at the top level since you're in the main script. It's not the only way to guard your code.

In your specific case, I think you should put the multiprocessing code in a function. That won't run in the child process, as long as nothing else calls the function when it should not. Your main module can import the module, then call the function (from within an if __name__ == "__main__" block, probably).

It should be something like this:

some_module.py:

def process_males(x):
    ...

def process_females(x):
    ...

args_m = [...] # these could be defined inside the function below if that makes more sense
args_f = [...]

def do_stuff():
    with mp.Pool(processes=(mp.cpu_count() - 1)) as p:
        p.map_async(process_males, args_m)
        p.map_async(process_females, args_f)

main.py:

import some_module

if __name__ == "__main__":
    some_module.do_stuff()

In your real code you might want to pass some arguments or get a return value from do_stuff (which should also be given a more descriptive name than the generic one I've used in this example).

185

answered Sep 21 '22 09:09

Blckknght

Related questions
                            
                                Mean of data scaled with sklearn StandardScaler is not zero
                            
                                How to calculate 95% confidence intervals using Bootstrap method
                            
                                Override Jinja block in included template from extending template
                            
                                Call method once to set multiple fields in Django Rest Framework serializer
                            
                                Reading from CSV: delimiter must be a string, not unicode
                            
                                How to install NumPy for Python 3.6
                            
                                Pycharm - how can I copy a file (tab) in pycharm (to a new tab)?
                            
                                Pandas merge how to avoid unnamed column
                            
                                Python Tornado get URL arguments
                            
                                My regex works on regex101 but doesn't work in python? [duplicate]
                            
                                How to write to an open Excel file using Python?
                            
                                XOR on contiguous subarrays of an array
                            
                                How to plot rows in dataframe
                            
                                Replace outliers with column quantile in Pandas dataframe
                            
                                Tensorflow - ValueError: Shape must be rank 1 but is rank 0 for 'ParseExample/ParseExample'
                            
                                How to Argsort in Tensorflow?
                            
                                ZIp only contents of directory, exclude parent - Python
                            
                                Generating a dynamic nested JSON object and array - python
                            
                                Auto reconnect postgresq database
                            
                                How can I cause Python 3.5 to crash?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use multiprocessing.Pool in an imported module?

Tags:

python

multiprocessing

python-multiprocessing

B Furtado

People also ask

1 Answers

Blckknght

Recent Activity

Donate For Us