If nsys has an option similar to ‘–profile-all-processes’?(Not getting cuda information from child processes on Linux

The_C_odore · November 1, 2021, 2:33am

Hello!

I try to use nsys to record a CUDA execution information which produced by a gpu plugin of PostgreSQL.

nsys profile --stats=true --force-overwrite true --gpu-metrics-device=all --trace-fork-before-exec=true  -o outputstrom1  psql -d postgres --command 'explain analyze SELECT *  FROM ar4, ar2 where ar4.key=ar2.key;'

(When I use ‘psql -d postgres …’, PostgreSQL will fork a progres to receive my command.
And the progres use a gpu plugin which will create multiple worker progress by ‘pthread_create’. The worker progress will invoke cuda kernel function.)

But I only got the CPU information like this.

And I can get the CUDA execution information by nvprof. I start nvprof before launching ‘psql’.

nvprof --profile-all-processes -s -o strom.%p.nvvp

So I’m wondering if nsys has an option similar to ‘–profile-all-processes’?
Or how can I use nsys to get the cuda information?

Thanks!

ztasoulas · November 2, 2021, 1:28pm

Which version of nsys are you using (nsys --version)? Also, have you taken a look at the warnings on the report? They appear by clicking on the top right corner.

The_C_odore · November 3, 2021, 2:12am

Sorry for the late reply.

$ nsys --version
NVIDIA Nsight Systems version 2021.2.1.58-642947b

NVIDIA-SMI 440.95.01    Driver Version: 440.95.01    CUDA Version: 11.4
Docker Image: nvidia/cuda:11.4.1-devel-centos8

The report shows that no CUDA event was collected. And child progress used CUDA actually.

Looking forward to your reply!

Ziqi · January 7, 2022, 7:41pm

Did you find an answer? Same problem here.

The_C_odore · January 9, 2022, 8:42am

Sorry, I don’t solve it. So I use nvprof…

If you have the answer, remember answer the question.
Thank you

Ziqi · January 9, 2022, 12:12pm

I don’t have an answer either. We can’t use nvprof, as nvprof stopped working on new GPU generations. It is my understanding that Nvidia should provide an answer as to whether such problems do exist with current Nsight Systems, and the plan to fix the existing bugs. Only in this way can we as users, and more importantly, as consumers (we bought many A100, A40 and A10 types), know if the problem is from our use or is from the product itself, and find alternative solutions if necessary. I saw several complaints about nsys missing multi-process CUDA info, and unfortunately, there is no clear answer from Nvidia yet.

Ziqi · January 10, 2022, 9:20pm

I got an answer. You need to add “–trace=cuda” manually although in the document, this flag is set by default. After adding this flag, cuda info is available in all processes. Let me know if it works for you.

logg72 · July 12, 2024, 2:24am

Excuse me, did you get all gpu information(e.g. l2 cache hit rate) for each processes?

hwilper · July 12, 2024, 1:49pm

The GPU information isn’t part of the -trace=cuda information. You will need to turn GPU metrics for that (see doc - User Guide — nsight-systems 2024.4 documentation)

Topic		Replies	Views
Nsight Systems Missing CUDA Info in Multi-Process Profiling Profiling Linux Targets cuda , nsight	2	2222	March 30, 2023
How to get full profiling with Nsight system for a particular process Profiling Linux Targets cudnn	8	1276	September 23, 2024
NSight Systems does not profile subprocess(via fork in unistd or Process in python.multiprocess) CUDA_API Profiling Linux Targets	6	1257	September 23, 2024
NSIGHT SYSTEM: Runtime Error and reported QuadDCommon::NotFoundException Profiling Linux Targets nsight	13	6190	September 8, 2023
Unable to install / locate Nvidia Nsight Systems CLI Profiling Linux Targets	4	3778	December 19, 2019
Not getting NVTX events from child processes on Linux Profiling Linux Targets	8	1224	June 23, 2021
Nsys Does not Track CUDA Api events Profiling Linux Targets	5	1063	December 22, 2022
Nsys crashes on first memory access Profiling Linux Targets cuda	4	69	August 14, 2024
How to profile all CUDA activity on a system Profiling x86 Windows Targets	6	1587	November 1, 2022
Nsys doesn't show cuda kernel and memory data Profiling Linux Targets cuda , kernel	10	165	December 7, 2024

If nsys has an option similar to ‘–profile-all-processes’?(Not getting cuda information from child processes on Linux

Related topics