Serialize inner loop (CUDA C)

WalS · November 28, 2011, 11:13pm

Hello,

How to make sure that inner loop in a nested for loop does not get parallelized? I tried to use “seq”, but somehow compiler seems to ignore it. What is a right way to use it? A sample code is given below.

#pragma acc region
{
#pragma for independent
for ( int i =0; i <outer; i ++) {
     int temp = 0;
     for ( int y=0; y<inner ; y++) { // make this loop execute serially
          temp += array[y][i];
     }
     final[i] = temp;
}

}

Thanks!

MatColgrove · November 28, 2011, 11:22pm

Hi WalS,

You can try using the “kernel” clause on the outer loop.

#pragma for independent, kernel
for ( int i =0; i <outer; i ++) {

Though, you probably don’t need the independent clause here.

Hope this helps,
Mat

WalS · November 28, 2011, 11:45pm

Hi Mat,

Thanks for quick reply. I had also tried using “kernel”. Still the loop is parallelized by compiler. After your suggestion, I tried it without “independent” clause, but no use.

Any other suggestions?

Thank you!

MatColgrove · November 29, 2011, 12:57am

Hi WalS,

Can you please post a reproducing example? I’ll need to see the code in context to get a better idea of what’s going on.

Thanks,
Mat

WalS · November 29, 2011, 9:09am

Hello Mat,

It was a stupid mistake of checking wrong compiler report. It is getting compiled correctly. Sorry for the trouble. Thanks a ton!

Topic		Replies	Views
OpenACC and nested loops Legacy PGI Compilers	2	4026	September 19, 2014
Nested loops in C Legacy PGI Compilers	2	3671	September 9, 2010
Programming Problem: force the inner loop run as sequential Legacy PGI Compilers	4	4020	September 7, 2016
Specified loop mapping schedule not applied (PGI Acc) Legacy PGI Compilers	2	1644	January 23, 2012
Serialization within a for loop? CUDA Programming and Performance	1	516	June 17, 2019
function call inside parallel region Legacy PGI Compilers	3	2496	July 30, 2015
prevent parallelization Legacy PGI Compilers	3	1921	March 22, 2012
Parallel for loop: Internal compiler error Legacy PGI Compilers	2	8078	November 24, 2009
does "acc loop seq" work Legacy PGI Compilers	2	3957	October 3, 2012
Parallelizing for loops using CUDA CUDA Programming and Performance	3	2564	March 8, 2012

Serialize inner loop (CUDA C)

Related topics