Running nsys profiling for GPU memory data on python

vallabhnadgir · June 20, 2024, 10:07pm

I want to get GPU memory data for a Python file, but all I’m getting when I run nsys profile is the os runtime data and timeline. This is what I’m running:

Once I get the nsys-rep file, I make a report.txt using nsys stats and it says that the sqlite file contains no GPU memory data:

How do I get the GPU memory usage data?
Here is my report file:
report.txt (12.1 KB)

Thanks

mhallock · June 22, 2024, 3:38pm

Greetings,

Out of curiosity, if you open the nsys-rep file in the GUI, do you see the GPU memory usage on the timeline?

vallabhnadgir · June 23, 2024, 8:46pm

Nope, I can’t see it on the timeline. Here’s the nsys-rep file for your reference.

historamnsys-rep.zip (8.1 MB)

mhallock · June 24, 2024, 3:12pm

@vallabhnadgir Thank you for sharing the output!

Ok, so either your program is not actually using the GPU, or we are not managing to capture the GPU activity. I suggest the latter as an option because I do see one python process (pid 1376787) that there is no python sampling info, but there is a “pt_main_thread” which is where I assume your actual pytorch workload must be happening.

I will suggest you try the following:

Try again with the newest version of nsys. You are using 2023.4, newest is 2024.4.
Try again with only nsys progile -t cuda python3 perform_reconstruction.py to see if you capture cuda events. It would be interesting to know if you get any different behavior.
If possible, try without multiprocessing so there is only one python process to profile, and see if that works. It is my understanding that what you are doing should work, but lets verify that the program is in fact using the GPU and that we are able to capture the cuda trace.
If you still are unable to observe any cuda events in the profiler, check the output of nvidia-smi while the program is running, and see if it lists python as a process with an active cuda context.

vallabhnadgir · June 25, 2024, 10:40pm

Thanks a lot! Using nsys 2024.2.3 worked!

system · July 9, 2024, 10:40pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Nsys measure memory Profiling Linux Targets	7	1937	April 15, 2022
Nsys cannot capture cuda information Profiling DRIVE Targets	9	75	April 21, 2025
GPU related information missing when using nsys profile Profiling Linux Targets	3	835	May 26, 2023
Nsys doesn't show cuda kernel and memory data Profiling Linux Targets cuda , kernel	10	240	December 7, 2024
Discrepiances with memory profiling Jetson Xavier NX cuda	2	816	October 18, 2021
Profiling Python code using sudo Profiling Linux Targets nsight , python , profiling	8	2160	March 10, 2022
Nsys or nsight-cu-cli, how to get metrics Profiling Linux Targets	1	632	May 20, 2020
Can not get CUDA python backtrace Profiling Linux Targets	12	2008	May 7, 2023
Nsys not collecting python backtrace with --python-backtrace=cuda Profiling Linux Targets cuda , python , cudnn	4	99	October 9, 2024
How to output uvm page fault memory address to the terminal via using nsys 2024.1.? Profiling Linux Targets	8	643	February 15, 2024

Running nsys profiling for GPU memory data on python

Related topics