OpenMP not parallelizing nested loop, depends on order

kyle.niemeyer1 · November 8, 2012, 7:23pm

I’m having a strange issue with pgcc not parallelizing a nested loop, depending on the order of the loops.

For example, I have:

void function (vars) {
int row, col;
#pragma omp parallel for private(row, col)
148 for (row = 0; row < NUM; ++row) {
150     for (col = 0; col < NUM - 1; ++col) {

    ...
    }
}
206 }

Compiling with pgcc -fast -mp -Minfo gives me:
148, Parallel region activated
Parallel loop activated with static block schedule
206, Barrier
Parallel region terminated
150, Invariant if transformation
Loop not vectorized: may not be beneficial

If I switch the order of the for loops, then I get the same thing but without the “Loop not vectorized: may not be beneficial”.

Why would it be doing this? The order of the loops shouldn’t affect how it is parallelized, since they are independent. I would use the second way, but I have an operation that is in the outer loop only.

OpenACC doesn’t have a problem with this, so I’m not sure why OpenMP is having trouble. Any advice?

MatColgrove · November 8, 2012, 9:24pm

Hi Kyle,

Why would it be doing this? The order of the loops shouldn’t affect how it is parallelized, since they are independent.

It’s still parallelized, this message is about vectorization.

Mat

Topic		Replies	Views
PGI not vectorizing openmp loops Legacy PGI Compilers	1	2492	October 23, 2012
OpenACC and nested loops Legacy PGI Compilers	2	4083	September 19, 2014
PGI attempts to parallelize sequential loop Legacy PGI Compilers	3	2661	August 28, 2012
How to parallelize this loop... Legacy PGI Compilers	14	7983	December 18, 2012
Bug with !$acc routine seq? Legacy PGI Compilers	2	2290	April 29, 2019
Couple of questions (nested loops, loop bounds, etc.) Legacy PGI Compilers	17	25284	December 11, 2014
Nested loops in C Legacy PGI Compilers	2	3734	September 9, 2010
Loop "too deeply nested" and "data dependency Legacy PGI Compilers	9	10730	November 27, 2017
Loop is parallelizable Legacy PGI Compilers	2	1835	June 10, 2010
Triply nested loop using implicit OpenACC Legacy PGI Compilers	3	3077	September 5, 2012

OpenMP not parallelizing nested loop, depends on order

Related topics