Nsys or nsight-cu-cli, how to get metrics

belegkarnil · April 2, 2020, 2:49pm

Hi!

I would like to profile the CPU utilization and GPUs of a python MPI application.
I need to measure at least

the initialization time
the transfer from CPU to GPU
the time needed to allocate memory on GPU
the overlap between transfer and compute (if the compute start before the whole data are sent to the GPU)
the freeing time at the end of the execution
the memory used by the CPU and the GPU

From what I read, I thought that nsys is the best tool but I cannot extract these metrics.
Can you tell me what I have to use?

Thanks!

hwilper · May 20, 2020, 5:31pm

Sorry for the delay. What you will want to do is use the “nsys stats” command to extract statistics from a sqlite representation of the data. You will need to be using Nsys 2020.2 (or 2020.3 when it is released), please check “nsys stats --help” for details.

Topic		Replies	Views
Dump --gpu-metrics-device output to console/file using nsys Profiling Linux Targets	3	159	October 25, 2024
Error Collecting Nsys Profile Metrics Profiling Linux Targets nsight	3	816	April 18, 2024
Runtime too long with ncu, would like real-time profiling Profiling Linux Targets	8	1148	July 1, 2022
How to figure out CPU and GPU activity parallelism using Nsight Systems or Nsight Compute? Profiling Linux Targets	3	1120	December 19, 2019
Nsys for multi GPU apps Profiling Linux Targets	1	1430	September 10, 2018
Nsys profile doesn't collect tensor core utilization and the metrics about tensor active/SM instructions are not shown in the GUI Profiling Linux Targets cuda	6	800	February 24, 2024
Running nsys profiling for GPU memory data on python Profiling Linux Targets	5	1133	July 9, 2024
Any tool to show execution time DRIVE AGX Orin General drive-devtools	22	348	August 21, 2024
Nsys profiling MPI jobs Profiling Linux Targets nsight , hpc	1	2600	November 7, 2020
How is the CPU utilization computed? Profiling Linux Targets nsight	0	492	May 29, 2021

Nsys or nsight-cu-cli, how to get metrics

Related topics