How to parallelize correctly a nested for loops

Tags:

I'm working with OpenMP to parallelize a scalar nested for loop:

double P[N][N];
double x=0.0,y=0.0;

for (int i=0; i<N; i++)
{
    for (int j=0; j<N; j++)
    {
        P[i][j]=someLongFunction(x,y);
        y+=1;
    }
    x+=1;
}

In this loop the important thing is that matrix P must be the same in both scalar and parallel versions:

All my possible trials didn't succeed...

329

asked Dec 01 '11 08:12

linello

1 Answers

The problem here is that you have added iteration-to-iteration dependencies with:

x+=1;
y+=1;

Therefore, as the code stands right now, it is not parallelizable. Attempting to do so will result in incorrect results. (as you are probably seeing)

Fortunately, in your case, you can directly compute them without introducing this dependency:

for (int i=0; i<N; i++)
{
    for (int j=0; j<N; j++)
    {
        P[i][j]=someLongFunction((double)i, (double)N*i + j);
    }
}

Now you can try throwing an OpenMP pragma over this and see if it works:

#pragma omp parallel for
for (int i=0; i<N; i++)
{
    for (int j=0; j<N; j++)
    {
        P[i][j]=someLongFunction((double)i, (double)N*i + j);
    }
}

190

answered Sep 20 '22 16:09

Mysticial

Related questions
                            
                                Polymorphism c++
                            
                                How to Display a "*.png" file on a UI in QT framework?
                            
                                What are the prominent differences in the Boost Thread library or the Pthreads? [duplicate]
                            
                                Windows API gui designer?
                            
                                When will compiler generate default constructor for a derived class
                            
                                Why would this give a Use of uninitialised value of size 8
                            
                                Get column names in sqlite3
                            
                                Adding an offset to a pointer
                            
                                compare two boost::function
                            
                                C++ priority dictionary
                            
                                One pointer, two different classes in c++
                            
                                Automatic destruction of static object
                            
                                GrabCut - bgdModel & fgdModel empty - Assertion error
                            
                                How many usage does "volatile" keyword have in C++ function, from grammar perspective?
                            
                                Linking against boost_thread fails under Ubuntu 11.10
                            
                                Is repeatedly seeding a random number generator a reasonable hash function?
                            
                                c++ Function to format time_t as std::string: buffer length?
                            
                                boost.python confused about similar constructor
                            
                                what is the overhead of passing a reference?
                            
                                Overloading assignment operator in a class template that can cast to another template type

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to parallelize correctly a nested for loops

Tags:

c++

c

for-loop

nested

openmp

linello

People also ask

1 Answers

Mysticial

Recent Activity

Donate For Us