Assume I have a method that multiplies two <code>std::vector</code> : <pre class="prettyprint"><code>double multiply(std::vector<double> const& a, std::vector<double> const& b){ double tmp(0); /*here I could easily do a parallelization with*/ /*#pragma omp parallel loop for*/ for(unsigned int i=0;i<a.size();i++){ tmp += a[i]*b[i]; } return tmp; } </code></pre> If I set in this function the pragma macro, a call to <code>multiply(...)</code> will run on all threads. Now assume that somewehere else I want to do many vector multiplication : <pre class="prettyprint"><code>void many_multiplication(std::vector<double>* a, std::vector<double>* b, unsigned int N){ /*here I could easily do a parallelization with*/ /*#pragma omp parallel loop for*/ for(unsigned int i=0;i<N;i++){ for(unsigned int j=0;j<N;j++){ multiply(a[i],b[j]); } } } </code></pre> I could also do the parallelization the same way. But this will lead to unwanted nested parallelism. How can I check that if <code>multiply(..)</code> is called within a parallel region, then the <code>pragma</code> macro of <code>multiply(...)</code> is "turn off". And if it's called from a non-parallel region, then it's "turn on".

Nested parallelism is disabled by default, unless enabled specificially by setting <code>OMP_NESTED</code> to <code>true</code> or by calling <code>omp_set_nested(1);</code> (§2.3.2 of the OpenMP specification) Explicitly modifying the settings for nesting as suggested by Avi Ginsburg is a bad idea. Instead, you should use conditional parallel execution based on the level of nesting: <pre class="prettyprint"><code>double multiply(std::vector<double> const& a, std::vector<double> const& b){ double tmp(0); int active_levels = omp_get_active_level(); #pragma omp parallel for reduction(+:tmp) if(active_level < 1) for(unsigned int i=0;i<a.size();i++){ tmp += a[i]+b[i]; } return tmp; } </code></pre> <code>omp_get_active_level()</code> returns the number of active parallel regions that enclose the thread at the moment the call is made. It returns <code>0</code> if called from outside a parallel region or with inactive outer region(s). Thanks to the <code>if(active_level < 1)</code> clause, the parallel region will only be activated, i.e. run in parallel, if it is not enclosed in an active region, regardless of the setting for nesting. If your compiler does not support OpenMP 3.0 or higher (e.g. with any version of MS Visual C/C++ Compiler), then <code>omp_in_parallel()</code> call can be used instead: <pre class="prettyprint"><code>double multiply(std::vector<double> const& a, std::vector<double> const& b){ double tmp(0); int in_parallel = omp_in_parallel(); #pragma omp parallel for reduction(+:tmp) if(in_parallel == 0) for(unsigned int i=0;i<a.size();i++){ tmp += a[i]+b[i]; } return tmp; } </code></pre> <code>omp_in_parallel()</code> returns non-zero if at least one enclosing parallel region is active, but does not provide information about the depth of nesting, i.e. is a bit less flexible. In any case, writing such code is a bad practice. You should simply leave the parallel regions as they are and allow the end user choose whether nested parallelism should be enabled or not.

Add the pragma to both functions. You can turn the nested parallelism on and off with <code>omp_set_nested(int val)</code> (zero for off, non-zero for on). So, if you wanted nested parallelism on in your program in general, but off for the <code>many_multiplication</code> function, you would implement <code>many_multiplication</code> as follows: <pre class="prettyprint"><code>void many_multiplication(std::vector<double>* a, std::vector<double>* b, unsigned int N){ omp_set_nested(0); #pragma omp parallel loop for for(unsigned int i=0;i<N;i++){ for(unsigned int j=0;j<N;j++){ multiply(a[i],b[j]); } } omp_set_nested(1); } </code></pre>

openmp : check if nested parallesim

Tags:

c++

openmp

Assume I have a method that multiplies two std::vector :

Click to copy

double multiply(std::vector<double> const& a, std::vector<double> const& b){
    double tmp(0);
    /*here I could easily do a parallelization with*/
    /*#pragma omp parallel loop for*/
    for(unsigned int i=0;i<a.size();i++){
        tmp += a[i]*b[i];
    }
    return tmp;
}

If I set in this function the pragma macro, a call to multiply(...) will run on all threads.

Now assume that somewehere else I want to do many vector multiplication :

Click to copy

void many_multiplication(std::vector<double>* a, std::vector<double>* b, unsigned int N){
    /*here I could easily do a parallelization with*/
    /*#pragma omp parallel loop for*/
    for(unsigned int i=0;i<N;i++){
        for(unsigned int j=0;j<N;j++){
            multiply(a[i],b[j]);
        }
    }
}

I could also do the parallelization the same way. But this will lead to unwanted nested parallelism.

How can I check that if multiply(..) is called within a parallel region, then the pragma macro of multiply(...) is "turn off". And if it's called from a non-parallel region, then it's "turn on".

765

asked Jul 20 '15 15:07

PinkFloyd

2 Answers

Nested parallelism is disabled by default, unless enabled specificially by setting OMP_NESTED to true or by calling omp_set_nested(1); (§2.3.2 of the OpenMP specification) Explicitly modifying the settings for nesting as suggested by Avi Ginsburg is a bad idea. Instead, you should use conditional parallel execution based on the level of nesting:

Click to copy

double multiply(std::vector<double> const& a, std::vector<double> const& b){
    double tmp(0);
    int active_levels = omp_get_active_level();
    #pragma omp parallel for reduction(+:tmp) if(active_level < 1)
    for(unsigned int i=0;i<a.size();i++){
        tmp += a[i]+b[i];
    }
    return tmp;
}

omp_get_active_level() returns the number of active parallel regions that enclose the thread at the moment the call is made. It returns 0 if called from outside a parallel region or with inactive outer region(s). Thanks to the if(active_level < 1) clause, the parallel region will only be activated, i.e. run in parallel, if it is not enclosed in an active region, regardless of the setting for nesting.

If your compiler does not support OpenMP 3.0 or higher (e.g. with any version of MS Visual C/C++ Compiler), then omp_in_parallel() call can be used instead:

Click to copy

double multiply(std::vector<double> const& a, std::vector<double> const& b){
    double tmp(0);
    int in_parallel = omp_in_parallel();
    #pragma omp parallel for reduction(+:tmp) if(in_parallel == 0)
    for(unsigned int i=0;i<a.size();i++){
        tmp += a[i]+b[i];
    }
    return tmp;
}

omp_in_parallel() returns non-zero if at least one enclosing parallel region is active, but does not provide information about the depth of nesting, i.e. is a bit less flexible.

In any case, writing such code is a bad practice. You should simply leave the parallel regions as they are and allow the end user choose whether nested parallelism should be enabled or not.

156

answered Sep 25 '22 14:09

Hristo Iliev

Add the pragma to both functions. You can turn the nested parallelism on and off with omp_set_nested(int val) (zero for off, non-zero for on).

So, if you wanted nested parallelism on in your program in general, but off for the many_multiplication function, you would implement many_multiplication as follows:

Click to copy

void many_multiplication(std::vector<double>* a, std::vector<double>* b, unsigned int N){
    omp_set_nested(0);
    #pragma omp parallel loop for
    for(unsigned int i=0;i<N;i++){
        for(unsigned int j=0;j<N;j++){
            multiply(a[i],b[j]);
        }
    }
    omp_set_nested(1);
}

answered Sep 25 '22 14:09

Avi Ginsburg

Related questions
                            
                                struct constructor will take space within the struct space?
                            
                                Is it valid to pass non-arithmetic types as arguments to cmath functions?
                            
                                asio lambda with unique_ptr capture
                            
                                How to detect whether some callable takes a rvalue reference?
                            
                                Communication between objects in C++
                            
                                can memcpy for std::aligned_storage?
                            
                                Variable between two source files (class & global)
                            
                                C++11 std::thread join crashes with system_error exception and SIGABRT on Xcode 6?
                            
                                Can assignment from a const_iterator dereference cause undefined behaviour?
                            
                                How to send a set object in MPI_Send
                            
                                Does the draw order affects objects position in depth? (images included)
                            
                                C++ Order of Evaluation of Subexpressions with Logical Operators
                            
                                How to increase throughput of Boost ASIO, UDP client application
                            
                                global declarations/initializations using static, const, constexpr
                            
                                Using inheritance to add functionality
                            
                                Is A Member Function Thread Safe?
                            
                                How to find and avoid uninitialised primitive members in C++?
                            
                                Qt removing stretches from a QHBoxLayout
                            
                                How to define a nested class outside its parent in C++
                            
                                Did I understand correctly the point of Scott Meyers' example of std::weak_ptr?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

openmp : check if nested parallesim

Tags:

c++

openmp

PinkFloyd

People also ask

2 Answers

Hristo Iliev

Avi Ginsburg

Recent Activity

Donate For Us