I am trying to use openMP to speedup a subroutine. I get two mp routines taking up considerable time:
_mp_p2 28% (81.66 seconds)
_mp_barrier 3% (9.33 seconds)
I am guessing the barrier is showing considerable time because I have a critical region, so I am guessing there are some threads that are waiting to execute the critical region and that time accumulates as _mp_barrier.
However, I do not know what _mp_p2 is. Usually a lot of time is spent in the routine grid, which is what I am using OMP on, but it seems that much of grid’s time has been transfered to this _mp_p2.