Question about the reduction clause in OpenACC

juanx002 · July 27, 2013, 10:42pm

Hi, Everyone,

I have a question when reading through a webpage about combing both OpenACC and OpenMP into one single program unit at the Dr.Dobb’s website (http://www.drdobbs.com/parallel/the-openacc-execution-model/240006334?pgno=2). The code snippet of concerned is excerpted to show in the text below. Can anyone let me know why the reduction clause (i.e., reduction(+:tmp)) of the OpenACC pragma is missing from line 16, while the same reduction clause (for the same loop as line 16) remains invoked by OpenMP in line 15?

Thanks,
Li

1  void gramSchmidt(restrict float Q[][COLS], const int rows, const int cols) 
2  {
3  #pragma acc data copy(Q[0:rows][0:cols])
4   for(int k=0; k < cols; k++) {
5      double tmp = 0.;
6  #pragma omp parallel for reduction(+:tmp)
7  #pragma acc parallel reduction(+:tmp)
8      for(int i=0; i < rows; i++) tmp +=  (Q[i][k] * Q[i][k]);
9      tmp = sqrt(tmp);
10      
11 #pragma omp parallel for
12 #pragma acc parallel loop
13    for(int i=0; i < rows; i++) Q[i][k] /= tmp;
14      
15 #pragma omp parallel for reduction(+:tmp)
16 #pragma acc parallel loop
17     for(int j=k+1; j < cols; j++) {
18       tmp=0.;
19       for(int i=0; i < rows; i++) tmp += Q[i][k] * Q[i][j];
20       for(int i=0; i < rows; i++) Q[i][j] -= tmp * Q[i][k];
21     }
22   }
23 }

MatColgrove · July 29, 2013, 4:53pm

Hi Li,

To me, the question is not why it’s missing from OpenACC but why it’s included for OpenMP.

Only the outer loop is parallelized making the inner loops sequential. The OpenACC reduction clause is only needed when making parallel reductions since this requires extra code to set-up a partial reduction and then launch a second kernel to perform the final reduction.

I’ll send a note to Rob and ask if he’ll clarify his intent here.

Best Regards,
Mat

Topic		Replies	Views
Reduction clause Legacy PGI Compilers	2	1959	April 23, 2013
Reduction results in wrong results. Bug? Legacy PGI Compilers	8	7688	January 24, 2014
OpenACC reductions Legacy PGI Compilers	1	2483	March 26, 2012
should use to "acc reduction" in an inner loop Legacy PGI Compilers	4	4236	December 6, 2012
Proper OpenACC reduction clause on many loops within "parallel" region nvc, nvc++ and nvfortran	1	423	March 6, 2021
reduction clause Legacy PGI Compilers	2	3059	May 26, 2014
Parallel construct reductions Legacy PGI Compilers	3	4133	January 25, 2014
reduction within "!$acc kernels loop" ? Legacy PGI Compilers	8	5459	January 11, 2013
Reduction prevents parallel execution on two GPUs Legacy PGI Compilers	5	5736	March 11, 2014
7.1-1 OpenMP reduction: PGC-S-0000-Internal compiler error Legacy PGI Compilers	1	3693	November 7, 2007

Question about the reduction clause in OpenACC

Related topics