Error with OMP_NUM_THREADS when using dask distributed

Tags:

I am using distributed, a framework to allow parallel computation. In this, my primary use case is with NumPy. When I include NumPy code that relies on np.linalg, I get an error with OMP_NUM_THREADS, which is related to the OpenMP library.

An minimal example:

from distributed import Executor
import numpy as np
e = Executor('144.92.142.192:8786')

def f(x, m=200, n=1000):
    A = np.random.randn(m, n)
    x = np.random.randn(n)
    #  return np.fft.fft(x)  # tested; no errors
    #  return np.random.randn(n)  # tested; no errors
    return A.dot(y).sum()  # tested; throws error below

s = [e.submit(f, x) for x in [1, 2, 3, 4]]
s = e.gather(s)

When I test with the linalg test, e.gather fails as each job throws the following error:

OMP: Error #34: System unable to allocate necessary resources for OMP thread:
OMP: System error #11: Resource temporarily unavailable
OMP: Hint: Try decreasing the value of OMP_NUM_THREADS.

What should I set OMP_NUM_THREADS to?

423

asked Sep 10 '16 03:09

Scott

2 Answers

Short answer

export OMP_NUM_THREADS=1

or 

dask-worker --nthreads 1

Explanation

The OMP_NUM_THREADS environment variable controls the number of threads that many libraries, including the BLAS library powering numpy.dot, use in their computations, like matrix multiply.

The conflict here is that you have two parallel libraries that are calling each other, BLAS, and dask.distributed. Each library is designed to use as many threads as there are logical cores available in the system.

For example if you had eight cores then dask.distributed might run your function f eight times at once on different threads. The numpy.dot function call within f would use eight threads per call, resulting in 64 threads running at once.

This is actually fine, you'll experience a performance hit but everything can run correctly, but it will be slower than if you use just eight threads at a time, either by limiting dask.distributed or by limiting BLAS.

Your system probably has OMP_THREAD_LIMIT set at some reasonable number like 16 to warn you of this event when it happens.

168

answered Sep 19 '22 20:09

MRocklin

If you're using MKL blas you might also get some improvement using the TBB threading layer. I haven't actually had occasion to try it out so YMMV.

http://conference.scipy.org/proceedings/scipy2018/anton_malakhov.html

answered Sep 18 '22 20:09

Dave Hirschfeld

Related questions
                            
                                Print 10 most frequently occurring words of a text that including and excluding stopwords
                            
                                Pygame font error
                            
                                PyCharm unresolved reference when importing class from other file
                            
                                Looking for idiomatic way to evaluate to False if argument is False in Python 3
                            
                                Get max value index for a list of dicts
                            
                                Find end nodes (leaf nodes) in radial (tree) networkx graph
                            
                                Passing Pk or Slug to Generic DetailView in Django?
                            
                                Python Decorator for printing every line executed by a function
                            
                                Google Drive API Client (Python): Insufficient Permission for files().insert()
                            
                                (python) plot 3d surface with colormap as 4th dimension, function of x,y,z
                            
                                How to Change image captured date in python?
                            
                                Error in function to return 3 largest values from a list of numbers
                            
                                render html strings in flask templates
                            
                                Duplicate column name
                            
                                How to incrementally write into a json file
                            
                                xgboost installation issue with anaconda
                            
                                Cloud Pub/Sub Demo : 403 User not authorized to perform this action. when try to push notification
                            
                                Can anyone tell my why I'm getting the error [AttributeError: 'list' object has no attribute 'encode']
                            
                                PEP8 breaking long string in assert [duplicate]
                            
                                Pandas swap columns based on condition

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Error with OMP_NUM_THREADS when using dask distributed

Tags:

python

numpy

cluster-computing

dask