Nsight Systems Missing CUDA Info in Multi-Process Profiling

Ziqi · January 7, 2022, 3:30pm

I profiled a multi-process application running in a docker container. The command I used to profile such an application is “nsys profile -o /home/zfan/sandbox/virgo_algo_preq/docker/profile_results/smaq_2c_1h ./LeafStandAlone.x86-64 -noForcedPatches /home/zfan/sandbox/virgo_algo_preq/data/JobInfo_108”. I didn’t find any CUDA information such as CUDA HW in my profiling result (the same happened for a singularity container), and I have long been troubled by profiling multi-process applications with nsys. On the other hand, there is no problem in profiling single-process applications. Both screenshots of reports for the single-process and the multiple-process applications are attached. (Note the CUDA HW in the single-process report.) The report for the multi-process itself is also attached. Can someone point out how to profile a multi-process application to capture all CUDA behaviors? In particular, I’d appreciate it if someone from the Nsight team helps me figure out if multi-process is good to profile in Nsight Systems (any known bugs?) and what is the correct way to use it. It is very miserable to have no available profiler for my multi-process application (nvprof doesn’t work on new generations of GPUs).

multi-process:
Capture

single-process:

smaq_2c_1h.qdrep (363.2 KB)

slava4 · May 12, 2022, 9:21pm

Have the same problem. Tried to workaround this by running multiple nsys profiling sessions simultaneously and it causes the other processes to often crash.

r05943077 · March 30, 2023, 2:45am

I think the problem is resolved.
Try this:

Topic		Replies	Views
If nsys has an option similar to ‘–profile-all-processes’?(Not getting cuda information from child processes on Linux Profiling Linux Targets nsight	8	2158	July 12, 2024
How to profile all CUDA activity on a system Profiling x86 Windows Targets	6	2020	November 1, 2022
Nsys hangs when profiling any cuda process Profiling Linux Targets cuda	1	326	August 11, 2025
NSight Systems does not profile subprocess(via fork in unistd or Process in python.multiprocess) CUDA_API Profiling Linux Targets	6	1503	September 23, 2024
Nsight Systems Missed Information in a Singularity Container Profiling Linux Targets cuda , nsight	2	891	January 4, 2022
I want to profile multiprocess at once Profiling Linux Targets nsight	1	964	January 7, 2022
nsys CUDA trace works for threads, but not for subprocesses Profiling Linux Targets	3	2466	May 13, 2019
How to get full profiling with Nsight system for a particular process Profiling Linux Targets cudnn	8	2355	September 23, 2024
Nsys for multi GPU apps Profiling Linux Targets	1	1430	September 10, 2018
Nsight-Compute returns "No kernels were profiled" for multi-process profiling Nsight Compute	2	2640	July 21, 2022

Nsight Systems Missing CUDA Info in Multi-Process Profiling

Related topics