I have sequential code to parallelize via OpenMP. I have put in the corresponding pragmas and tested it. I measure the performance gain by checking the time spent in the main function. The weird thing is the elapsed time calculated via <code>cpu_time()</code> and <code>omp_get_wtime()</code> is different. Why? The elapsed time according to <code>cpu_time()</code> is similar to the sequential time. Before computation starts: <pre class="prettyprint"><code>ctime1_ = cpu_time(); #ifdef _OPENMP ctime1 = omp_get_wtime(); #endif </code></pre> After computation ends: <pre class="prettyprint"><code>ctime2_ = cpu_time(); #ifdef _OPENMP ctime2 = omp_get_wtime(); #endif </code></pre> cpu_time() function definition: <pre class="prettyprint"><code>double cpu_time(void) { double value; value = (double) clock () / (double) CLOCKS_PER_SEC; return value; } </code></pre> Printing result: <pre class="prettyprint"><code>printf("%f - %f seconds.\n", ctime2 - ctime1, ctime2_ - ctime1_); </code></pre> Sample result: <pre class="prettyprint"><code>7.009537 - 11.575277 seconds. </code></pre>

What you observe is a perfectly valid result for any parallel application - the combined CPU time of all threads as returned by <code>clock()</code> is usually more than the wallclock time measured by <code>omp_get_wtime()</code> except if your application mostly sleeps or waits.

OpenMP time and clock() give two different results

Tags:

c

openmp

I have sequential code to parallelize via OpenMP. I have put in the corresponding pragmas and tested it. I measure the performance gain by checking the time spent in the main function.

The weird thing is the elapsed time calculated via cpu_time() and omp_get_wtime() is different. Why?

The elapsed time according to cpu_time() is similar to the sequential time.

Before computation starts:

ctime1_ = cpu_time();
#ifdef _OPENMP
ctime1 = omp_get_wtime();
#endif

After computation ends:

ctime2_ = cpu_time();
#ifdef _OPENMP
ctime2 = omp_get_wtime();
#endif

cpu_time() function definition:

double cpu_time(void)
{
  double value;
  value = (double) clock () / (double) CLOCKS_PER_SEC;
  return value;
}

Printing result:

printf("%f - %f seconds.\n", ctime2 - ctime1, ctime2_ - ctime1_);

Sample result:

7.009537 - 11.575277 seconds.

658

asked May 20 '12 13:05

mert

2 Answers

What you observe is a perfectly valid result for any parallel application - the combined CPU time of all threads as returned by clock() is usually more than the wallclock time measured by omp_get_wtime() except if your application mostly sleeps or waits.

answered Oct 21 '22 23:10

Hristo Iliev

The clock function measures cpu time, the time you spend actively on the CPU, the OMP function measures the time as it has passed during execution, two completely different things.

Your process seems to be blocked in waiting somewhere.

answered Oct 21 '22 23:10

Jens Gustedt

Related questions
                            
                                _file_ or _line_ similar in golang
                            
                                Triple pointers in C: is it a matter of style?
                            
                                C strlen() implementation in one line of code
                            
                                Go TCP read is non blocking
                            
                                Using the open() system call
                            
                                Git client on the iPhone, possible? How?
                            
                                Does a string created with 'strcpy' need to be freed?
                            
                                Python equivialent of C programming techniques (while loops)
                            
                                comparing python with c/fortran
                            
                                Why sizeof(array) and sizeof(&array[0]) gives different results?
                            
                                C - is an indeterminate value indeterminable?
                            
                                Precedence of && over || [duplicate]
                            
                                How to write a Makefile to compile a simple C program
                            
                                How does logical negation work in C?
                            
                                How to initialize an unsigned long long type?
                            
                                How do I implement a bit array in C / Objective C
                            
                                Accessing elements of a matrix row-wise versus column-wise
                            
                                If fclose(0) is called, does this close stdin?
                            
                                3 plus symbols between two variables (like a+++b) in C [duplicate]
                            
                                Bitwise operator to get byte from 32 bits

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With