Am I creating too many OpenMP Tasks?

_Sayan · July 17, 2012, 11:25pm

Hello,

I want to compare OpenMP tasking implementation with OpenMP loop, please review my code structure below.

!$OMP PARALLEL
...
!$OMP SINGLE
CALL abc()
CALL def()
CALL ghi()
!$OMP END SINGLE
!$OMP END PARALLEL

Inside the subroutines:

SUBROUTINE abc()
...
DO i=i0,i1
   DO j=j0,j1
      DO k=k0,k1
          !$OMP TASK UNTIED
          <embarrassingly parallel computation>
          !$OMP END TASK
      ENDDO
     ENDDO
 ENDDO

This code takes a long time - the array bounds are quite large, am I just creating too many tasks for the compiler to handle?

Thank you,
Sayan

MatColgrove · July 19, 2012, 5:29pm

Hi Sayan,

I’ve discussed your issue with several other of our application engineers and aren’t really sure what the problem is. It’s possible that if your parallel computation is very small but there are an extremely large number of threads, your time is being dominated by the overhead of creating and managing tasks.

One thing to try is to record your times at 1, 2, 4, and the Max number of threads. If the program doesn’t scale, i.e. the times are roughly the same, then this may indeed be the problem. The fix then would be to give each TASK more work.

If this isn’t the problem, we’ed need to see a reproducing example to tell what’s wrong.

Hope this helps,
Mat

_Sayan · July 25, 2012, 2:10pm

Hello Mat,

Thanks for answering. The code I am testing has sufficient computation. I think this is a data scoping issue, I am expecting that variables which are shared in OMP PARALLEL would propagate to nested TASK construct as well. Updated code structure:

!$OMP PARALLEL SHARED(...) PRIVATE(...) FIRSTPRIVATE(...)
!$OMP SINGLE
!$OMP TASK UNTIED
CALL test(...)
!$OMP END TASK
!$OMP END SINGLE
!$OMP END PARALLEL

In the test function:

SUBROUTINE test(...)
...
DO i=i0,in
DO j=j0,jn
DO k=k0,kn
!$OMP TASK SHARED(...) FIRSTPRIVATE(...) !variables are either shared or firstprivate here
<embarrassingly parallel code>
!$OMP END TASK
ENDDO
ENDDO
ENDDO
!$OMP TASKWAIT
END SUBROUTINE test

The code runs indefinitely; in this type of code structure, do you have an advice to avoid scoping related pitfalls? Thank you.

_Sayan · August 2, 2012, 11:20pm

I was creating too many tasks, when I moved the task construct just after the outermost loop, my code ran.

Topic		Replies	Views
PGI FORTRAN OpenMP: poor performance in a big loop??? Legacy PGI Compilers	17	8142	June 25, 2012
OpenMP in Do loop (Fortran) Legacy PGI Compilers	7	4945	October 7, 2016
Common blocks in OpenMP Legacy PGI Compilers	10	4607	March 14, 2013
Problem with simple loop structure Legacy PGI Compilers	2	2201	March 8, 2018
OpenMPI + OpenMP problem Legacy PGI Compilers	6	6773	September 28, 2016
Multi-Threaded computation with OpenMP Legacy PGI Compilers	12	4546	June 11, 2018
OpenMP and CUDA Legacy PGI Compilers	5	4000	October 12, 2017
when two programs are ran,why does it take the same time? Legacy PGI Compilers	9	4657	December 8, 2011
Combining OpenMP and OpenACC Legacy PGI Compilers	4	6206	November 14, 2017
RAM usage with OpenMP Legacy PGI Compilers	2	3421	July 11, 2014

Am I creating too many OpenMP Tasks?

Related topics