Implement Parallel for loops in Python

Tags:

I have a Python program which looks like this:

total_error = []
for i in range(24):
    error = some_function_call(parameters1, parameters2)
    total_error += error

The function 'some_function_call' takes a lot of time and I can't find an easy way to reduce time complexity of the function. Is there a way to still reduce the execution time while performing parallel tasks and later adding them up in total_error. I tried using pool and joblib but could not successfully use either.

415

asked Jan 03 '18 17:01

thechargedneutron

2 Answers

You can use python multiprocessing:

from multiprocessing import Pool, freeze_support, cpu_count
import os

all_args = [(parameters1, parameters2) for i in range(24)]

# call freeze_support() if in Windows
if os.name == "nt":
    freeze_support()

# you can use whatever, but your machine core count is usually a good choice (although maybe not the best)
pool = Pool(cpu_count()) 

def wrapped_some_function_call(args): 
    """
    we need to wrap the call to unpack the parameters 
    we build before as a tuple for being able to use pool.map
    """ 
    sume_function_call(*args) 

results = pool.map(wrapped_some_function_call, all_args)
total_error = sum(results)

answered Sep 16 '22 11:09

Netwave

You can also use concurrent.futures in Python 3, which is a simpler interface than multiprocessing. See this for more details about differences.

from concurrent import futures

total_error = 0

with futures.ProcessPoolExecutor() as pool:
  for error in pool.map(some_function_call, parameters1, parameters2):
    total_error += error

In this case, parameters1 and parameters2 should be a list or iterable of the same size as the number of times you want to run the function (24 times as per your example).

If paramters<1,2> are not iterables/mappable, but you just want to run the function 24 times, you can submit the jobs for the function for the required number of times, and later acquire the result using a callback.

class TotalError:
    def __init__(self):
        self.value = 0

    def __call__(self, r):
        self.value += r.result()

total_error = TotalError()
with futures.ProcessPoolExecutor() as pool:
  for i in range(24):
    future_result = pool.submit(some_function_call, parameters1, parameters2)
    future_result.add_done_callback(total_error)

print(total_error.value)

answered Sep 18 '22 11:09

Gerges

Related questions
                            
                                Selenium Python wait for text to be present in element error shows takes 3 arguments 2 given
                            
                                exec() not working inside function python3.x
                            
                                SQLAlchemy: __init__() takes 1 positional argument but 2 were given (many to many)
                            
                                Decimal Python vs. float runtime
                            
                                Python 3: gzip.open() and modes
                            
                                Python negate boolean function
                            
                                loop through folder in python and open files throws an error
                            
                                Why autocompletion options in Spyder 3.1 are not fully working in the Editor?
                            
                                Flush output in for loop in Jupyter notebook
                            
                                ImportError: cannot import name 'PandasError'
                            
                                NotFittedError: TfidfVectorizer - Vocabulary wasn't fitted
                            
                                split string in python to get one value?
                            
                                How to round values only for display in pandas while retaining original ones in the dataframe?
                            
                                How to use the infer_vector in gensim.doc2vec?
                            
                                Pandas dataframe to Spark dataframe, handling NaN conversions to actual null?
                            
                                How to plot pandas groupby values in a graph
                            
                                $ python -bash: /usr/local/bin/python: No such file or directory
                            
                                Redrawing Seaborn Figures for Animations
                            
                                Shade the area between two axhline using matplotlib
                            
                                Python - How to check if socket is still connected

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Implement Parallel for loops in Python

Tags:

python

python-3.x

parallel-processing

python-2.7

thechargedneutron

People also ask

2 Answers

Netwave

Gerges

Recent Activity

Donate For Us