I have two functions, <code>do_step_one(i)</code> and <code>do_step_two(i)</code>, for <code>i</code> from <code>0</code> to <code>N-1</code>. Currently, I have this (sequential) code: <pre class="prettyprint"><code>for(unsigned int i=0; i<N; i++) { do_step_one(i); } for(unsigned int i=0; i<N; i++) { do_step_two(i); } </code></pre> Each call of <code>do_step_one()</code> and <code>do_step2()</code> can be done in any order and in parallel, but any <code>do_step_two()</code> needs the end of all the <code>do_step_one()</code> to start (it use <code>do_step_one()</code> results). I tried the following : <pre class="prettyprint"><code>#omp parallel for for(unsigned int i=0; i<N; i++) { do_step_one(i); #omp barrier do_step_two(i); } </code></pre> But gcc complains <blockquote> convolve_slices.c:21: warning: barrier region may not be closely nested inside of work-sharing, critical, ordered, master or explicit task region. </blockquote> What do I misunderstand? How to solve that issue?

Just a side note, if you want to make sure the threads are not recreated, separate the declaration of parallel and declaration of for: <pre class="prettyprint"><code>#pragma omp parallel { #pragma omp for for(unsigned int i=0; i<N; i++){ do_step_one(i); } //implicit barrier here #pragma omp for for(unsigned int i=0; i<N; i++){ do_step_two(i); } } </code></pre>

How to create an `omp parallel for` with synchronization (`barrier`) of all threads in the middle with OpenMP

Tags:

openmp

I have two functions, do_step_one(i) and do_step_two(i), for i from 0 to N-1.

Currently, I have this (sequential) code:

for(unsigned int i=0; i<N; i++) {
     do_step_one(i);
}

for(unsigned int i=0; i<N; i++) {
     do_step_two(i);
}

Each call of do_step_one() and do_step2() can be done in any order and in parallel, but any do_step_two() needs the end of all the do_step_one() to start (it use do_step_one() results).

I tried the following :

#omp parallel for
for(unsigned int i=0; i<N; i++) {
    do_step_one(i);

#omp barrier

    do_step_two(i);
}

But gcc complains

convolve_slices.c:21: warning: barrier region may not be closely nested inside of work-sharing, critical, ordered, master or explicit task region.

What do I misunderstand? How to solve that issue?

923

asked Nov 10 '09 18:11

Guillaume Bouchard

1 Answers

Just a side note, if you want to make sure the threads are not recreated, separate the declaration of parallel and declaration of for:

#pragma omp parallel
{
  #pragma omp for
  for(unsigned int i=0; i<N; i++){
    do_step_one(i);
  }
  //implicit barrier here
  #pragma omp for
  for(unsigned int i=0; i<N; i++){
    do_step_two(i);
  }
}

answered Sep 26 '22 01:09

Jason

Related questions
                            
                                OpenMP: run two functions in parallel, each by half of thread pool
                            
                                Splitting up a program into 4 threads is slower than a single thread
                            
                                What limits scaling in this simple OpenMP program?
                            
                                How to parallelize do while and while loop in openmp?
                            
                                parallelize inner loop using openmp
                            
                                OpenMP Several "shared"-directives?
                            
                                the OpenMP "master" pragma must not be enclosed by the "parallel for" pragma
                            
                                Parallelizing a for loop using openmp & replacing push_back
                            
                                Memory management while using threads
                            
                                C++ thread-safe uniform distribution random number generation
                            
                                Can I assign multiple threads to a code section in OpenMP?
                            
                                Does `std::mutex` and `std::lock` guarantee memory synchronisation in inter-processor code?
                            
                                How do OpenMP, MPI, POSIX threads, std::thread, boost::thread correlate?
                            
                                Fortran OpenMP with subroutines and functions
                            
                                Multithreaded & SIMD vectorized Mandelbrot in R using Rcpp & OpenMP
                            
                                How to tell if OpenMP works in my C++ program
                            
                                Conditional "pragma omp"
                            
                                Parallel computing -- jumbled up output?
                            
                                Using OpenMP stops GCC auto vectorising
                            
                                N-body algorithm: why is this slower in parallel?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With