About compiler parallelization strategy/info

teodoro · October 4, 2012, 11:40pm

Hi,

Yes there a way I could access details about the loop parallelized by the compiler. For instance, the instruction count in a given parallel loop/section?

It may be ask tool much and is possible not available, but it would be useful as well to undertand the expected performance of a section in general. Does the compiler create models for that during the compilation, what is reasonable in order to decided among parallelization strategies. Is any of this information available?

George

MatColgrove · October 5, 2012, 2:50pm

Hi George,

Are you asking about CPU code instructions or GPU?

For the CPU, you can use the PGI utility pgcollect to perform sample based profiling and then use the PGI Profiler pgprof to drill down into the the assembly. You can also instrument your code using the flag “-Mprof=lines”. It’s a slower to run and is at the line level rather than assembly, but is more accurate than sample based profiling. Other usefully 3rd party profilers are Oprofile and TAU.

For the GPU, we provide basic profiling information (data movement and kernel time). for more in depth profiling, you can use NVIDIA’s CUDA Profiler. It doesn’t give instruction counts, but does give a lot of useful information.

For complete detail about PGI’s profiling tools please see the PGPROF Users Guide.

Hope this helps,
Mat

Topic		Replies	Views
Profiling using computeprof Legacy PGI Compilers	1	4001	January 17, 2011
Pthreads Legacy PGI Compilers	2	2200	October 19, 2011
Profiling of CPU (OpenMP) Legacy PGI Compilers	1	1351	October 8, 2018
Does PGI have any tools to diagnose the parallelism Legacy PGI Compilers	0	2658	August 7, 2012
endif identified for 100% of cpu time Legacy PGI Compilers	2	8236	July 30, 2013
Text based profiling for CUDA Fortran with MPI program Legacy PGI Compilers	1	3707	September 1, 2015
Using pgcollect on a subset of code? Legacy PGI Compilers	1	7867	September 28, 2009
use pgprof to profile cpu programs Legacy PGI Compilers	1	2717	December 28, 2016
Very slow performance of some loops Legacy PGI Compilers	3	2870	July 22, 2011
Count instructions of compilation Nsight Visual Studio Edition	1	899	May 7, 2013

About compiler parallelization strategy/info

Related topics