How to observe the behavior of NVLINK by NVVP and nvprof?

Lennoxwu · July 8, 2019, 8:29pm

I am profiling a deep learning model, and the framework is tensorflow with NCCL.
I am sure there is a lot of traffic on NVLINK by checking the nvidia-smi.
The ncclAllReduce should make a lot of traffic.
However, I can not see any traffic by NVVP, and the NVLINK analysis is almost empty.(I attach the screen capture).
Will the transfer on NVLINK be shown in the timeline memcpy[D2D]?

I profile the model with the command

mpiexec --allow-run-as-root --bind-to socket -np 2 -x CUDA_VISIBLE_DEVICES=0,1     numactl -N 0 -m 0     nvprof -f -o /dev/shm/lennox/timeline.%q{OMPI_COMM_WORLD_RANK}.nvprof     python vgg.py --layers 16 -b 32 -u batch -i 200  --log_dir=/data/learning/tmp/         --data_dir=/data/learning/tf/models/research/inception/inception/data/ILSVRC2012/

Sanjiv.Satoor · July 10, 2019, 9:00am

You need to collect the nvlink metrics using nvprof to see them under NVLink analysis in NVVP.

Use the following nvprof options:

nvprof --aggregate-mode off --event-collection-mode continuous -m nvlink_total_data_transmitted,nvlink_total_data_received,nvlink_transmit_throughput,nvlink_receive_throughput –o

Topic		Replies	Views
Profiling communication for DGX2 CUDA Programming and Performance	5	874	July 8, 2019
DGX-1 NVlink Tx,Rx Throughput issues CUDA Programming and Performance	1	1067	August 7, 2024
Profiling deadloop (replay kernel) with nvprof on deep neural network Visual Profiler and nvprof	8	3304	August 24, 2017
nvprof: timelines for GPU metrics values. --metrics and --print-gpu-trace options. Visual Profiler and nvprof	4	1745	January 22, 2018
Profiler analysis view, no chart, no memory latency analysis Visual Profiler and nvprof cuda	1	541	December 1, 2020
Profiling tensorflow 2.0 Frameworks tensorflow	1	554	February 22, 2020
Can't see the source code in NVVP Visual Profiler and nvprof	4	815	January 2, 2025
gathering cpu-to-gpu and gpu-to-gpu transfers at the same time CUPTI – CUDA Profiler Tools Interface	2	3199	October 12, 2021
Detail page fault information by nvprof profiler Visual Profiler and nvprof	2	944	April 3, 2023
nvprof is too slow Visual Profiler and nvprof	12	4797	January 25, 2022

How to observe the behavior of NVLINK by NVVP and nvprof?

Related topics