I have a simple function that I want to run in parallel. If the function is directly specified in the main function, it all works nicely. But if the very same function is called from a separate Python file (that is created to contains a series of helper functions), the code fails with the error: A task has failed to un-serialize. Please ensure that the arguments of the function are all picklable. I have tried to run this code: <pre class="prettyprint"><code>from joblib import Parallel, delayed import multiprocessing import otherFile as of inputs = range(10) def processInput(i): return i * i num_cores = multiprocessing.cpu_count() results1 = Parallel(n_jobs=num_cores)(delayed(processInput)(i) for i in inputs) # this works results2 = Parallel(n_jobs=num_cores)(delayed(of.processInput)(i) for i in inputs) # this fails </code></pre> When I call the function processInput() from the of file I have simply copied the same function in that .py file. <pre class="prettyprint"><code>def processInput(i): return i * i </code></pre> How can I make the parallelization work if the function I need to call is in a separate .py file? This is the full error: <pre class="prettyprint"><code>results = Parallel(n_jobs=num_cores)(delayed(of.processInput)(i) for i in inputs) Traceback (most recent call last): File "<ipython-input-387-d8dd1dc361a6>", line 1, in <module> results = Parallel(n_jobs=num_cores)(delayed(of.processInput)(i) for i in inputs) File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\site-packages\joblib\parallel.py", line 934, in __call__ self.retrieve() File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\site-packages\joblib\parallel.py", line 833, in retrieve self._output.extend(job.get(timeout=self.timeout)) File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\site-packages\joblib\_parallel_backends.py", line 521, in wrap_future_result return future.result(timeout=timeout) File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\concurrent\futures\_base.py", line 432, in result return self.__get_result() File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\concurrent\futures\_base.py", line 384, in __get_result raise self._exception BrokenProcessPool: A task has failed to un-serialize. Please ensure that the arguments of the function are all picklable.* </code></pre>

Not sure if you have checked if the imported function 'of.processInput' works without using multiprocessing? If it doesn't work, then this could be the elephant in the room that others have not pointed out. Maybe you're missing <pre class="prettyprint"><code>__init__.py </code></pre> or maybe it is because the directory is not being seen by Python's <code>import</code> command. To add the directory you can do: <pre class="prettyprint"><code>import sys; sys.path.append("path/to/otherFile/") </code></pre> Though I am not sure if the error message you get is even remotely related to this issue.

Parallel code not working when function to parallelize is in a different file

Tags:

python

parallel-processing

multiprocessing

I have a simple function that I want to run in parallel. If the function is directly specified in the main function, it all works nicely. But if the very same function is called from a separate Python file (that is created to contains a series of helper functions), the code fails with the error:

A task has failed to un-serialize. Please ensure that the arguments of the function are all picklable.

I have tried to run this code:

from joblib import Parallel, delayed
import multiprocessing
import otherFile as of

inputs = range(10) 
def processInput(i):
    return i * i

num_cores = multiprocessing.cpu_count()

results1 = Parallel(n_jobs=num_cores)(delayed(processInput)(i) for i in inputs) # this works
results2 = Parallel(n_jobs=num_cores)(delayed(of.processInput)(i) for i in inputs) # this fails

When I call the function processInput() from the of file I have simply copied the same function in that .py file.

def processInput(i):
    return i * i

How can I make the parallelization work if the function I need to call is in a separate .py file?

This is the full error:

results = Parallel(n_jobs=num_cores)(delayed(of.processInput)(i) for i in inputs)
Traceback (most recent call last):

  File "<ipython-input-387-d8dd1dc361a6>", line 1, in <module>
    results = Parallel(n_jobs=num_cores)(delayed(of.processInput)(i) for i in inputs)

  File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\site-packages\joblib\parallel.py", line 934, in __call__
    self.retrieve()

  File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\site-packages\joblib\parallel.py", line 833, in retrieve
    self._output.extend(job.get(timeout=self.timeout))

  File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\site-packages\joblib\_parallel_backends.py", line 521, in wrap_future_result
    return future.result(timeout=timeout)

  File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\concurrent\futures\_base.py", line 432, in result
    return self.__get_result()

  File "C:\Users\xxxxx\AppData\Local\Continuum\anaconda3\lib\concurrent\futures\_base.py", line 384, in __get_result
    raise self._exception

BrokenProcessPool: A task has failed to un-serialize. Please ensure that the arguments of the function are all picklable.*

682

asked May 10 '19 13:05

opt

1 Answers

Not sure if you have checked if the imported function 'of.processInput' works without using multiprocessing? If it doesn't work, then this could be the elephant in the room that others have not pointed out. Maybe you're missing

__init__.py

or maybe it is because the directory is not being seen by Python's import command. To add the directory you can do:

import sys; sys.path.append("path/to/otherFile/")

Though I am not sure if the error message you get is even remotely related to this issue.

100

answered Oct 20 '22 08:10

Solomon Vimal

Related questions
                            
                                XML differences between WCF and Python SUDS for inheritance?
                            
                                Array order in `numpy.dot`
                            
                                Getting python -m module to work for a module implemented in C
                            
                                Project Euler Number 338
                            
                                Handling Sessions on Google App Engine with Android/IPhone
                            
                                Handle Firefox Not Responding While Using Selenium WebDriver With Python?
                            
                                Ajax POST returning render_template in Flask?
                            
                                Redux: How do I get Jython to use Python modules stored in Lib within its own jar file when running in Hadoop?
                            
                                How to convert requests.cookiejar to qnetworkcookiejar?
                            
                                Using Numpy in different platforms
                            
                                How to create an OUTPUT typemap for a class type?
                            
                                How do I make rdpy-rdpmitm let client re-input username and password when password not incorrect
                            
                                Jupyter notebook dead kernel
                            
                                Why doesn't Spyder obey my IPython config file?
                            
                                Running .exe on Azure
                            
                                Speed up App Engine local SDK DB query when multiple order properties present?
                            
                                app engine python gcloud not updating instance
                            
                                Properly convert png to npy numpy array (Image to Array)
                            
                                Tensorflow server: I don't want to initialize global variables for every session
                            
                                Strange sdl side-effect on unrelated windows

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With