profiling mpi programs

mahmood.nt · March 24, 2018, 6:52pm

It seems that mpi programs (even with one core) are not compatible with the hardware event monitor of the visual profiler. Is that correct? Is there any workaround on that? Any alternative?

Robert_Crovella · March 24, 2018, 6:59pm

typically, in my experience, people profile MPI CUDA activity using nvprof, and then pull the results into visual profiler.

There are instructions in the profiler user guide.

[url]Profiler :: CUDA Toolkit Documentation

I haven’t looked at trying to do this only from the visual profiler. The visual profiler has ability to profile multiple processes, so it should be possible to do it directly:

[url]Profiler :: CUDA Toolkit Documentation

mahmood.nt · March 24, 2018, 7:07pm

The visual profiler itself has no problem running the program. The problem is when you want to configure events. I mean [1]…
Is nvprof able to measure them with mpi programs?

[1] Profiler :: CUDA Toolkit Documentation

Robert_Crovella · March 24, 2018, 7:25pm

Both nvprof and nvvp are able to collect metric data. Events are less typically used. I’d probably need to try a specific example. nvprof can certainly show you any queryable event. Whether or not those are trivially displayable in nvvp is something I would have to take a look at.

mahmood.nt · March 24, 2018, 7:32pm

I am referring to this message

Metric/event collection failed:
Events/metrics cannot be collected for multi-process applicaiton

mahmood.nt · March 25, 2018, 12:13pm

@txbob:

So, I tested with two scenarios with visual profiler:

Selecting Profile child processes and then:
File = mpirun
Working directory = /home/mahmood/lammps/eam
Arguments = -n 2 /opt/lammps-11Aug17/src/lmp_mpi -sf gpu -pk gpu 1 -in in.eam

Then select Next and then Finish. The program runs in the profiler and I can see the same output as the linux terminal. So, it is fine. After run, when I select Run->Configure Metrics and Events, I get the following error:

Metric/event collection failed:
Events/metrics cannot be collected for multi-process applicaiton

Selecting Profile current process only and then:
File = mpirun
Working directory = /home/mahmood/lammps/eam
Arguments = -n 2 /opt/lammps-11Aug17/src/lmp_mpi -sf gpu -pk gpu 1 -in in.eam

Then select Next and then Finish. The program runs in the profiler and I can see the same output as the linux terminal. So, it is fine. After run, when I select Run->Configure Metrics and Events, I can see the metric window where I can select which metric to monitor. I select one of them and then I press Apply and Run. The program runs two times (!) and the run time is the same as first scenario which is odd. Usually by selecting a metric, the runtime becomes slower. But here, I didn’t see any slow runtime.

Any comment?

mahmood.nt · March 26, 2018, 8:38am

Hello again
While visual profiler is not able to measure the metric when “profile child processes” is selected, the nvprof command is able to do that!

Do you have any comment?

Topic		Replies	Views
Profiling MPI processes on windows Visual Profiler and nvprof	0	1650	March 31, 2013
CUDA visual profiler using mpi? CUDA Programming and Performance	1	1233	November 9, 2009
Problem with cudaprof when executing a multi process program CUDA Programming and Performance	1	7194	March 29, 2010
Profiling MPI+Cuda CUDA Programming and Performance	1	1741	December 19, 2013
nvprof is too slow Visual Profiler and nvprof	12	4965	January 25, 2022
visual profiler with MPI CUDA Programming and Performance	3	6317	December 31, 2008
Visual Prifiler (64bit) 4.1 rc "unable to collect events and metrics" null CUDA Programming and Performance	0	1462	November 15, 2011
Profiling MPI + CUDA CUDA Programming and Performance	0	828	September 15, 2011
nvprof for gmx_mpi CUDA Programming and Performance	0	499	July 9, 2018
Profiling application with CUPTI in a separate process? CUDA Programming and Performance	2	918	July 6, 2017

profiling mpi programs

Related topics