Profile pytorch model using NCU

x-wang20 · June 30, 2022, 4:59am

Description

I have established a pytorch model and wanna to profile each layer or operator to get memory occupation, pcie bandwidth and GPU utils and so on when making inference. How can I do that using Nsight Compute or is there some available methods?

Environment

TensorRT Version:
GPU Type:
Nvidia Driver Version:
CUDA Version:
CUDNN Version:
Operating System + Version:
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

Relevant Files

Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)

Steps To Reproduce

Please include:

Exact steps/commands to build your repro
Exact steps/commands to run your repro
Full traceback of errors encountered

spolisetty · July 1, 2022, 12:46pm

Hi,

Please refer the following docs,

https://developer.nvidia.com/nvidia-visual-profiler

If you still need further assistance we will move this post to Nsight related forum.

Thank you.

Topic		Replies	Views
Nsight Compute Error Jetson AGX Orin tensorrt , nsight , pytorch	12	1080	December 21, 2022
How to measure Tensor core utilization using NVIDIA profiling tools such as Nsight System, DLProf, nvprof etc TensorRT cudnn	4	1374	January 31, 2024
Cannot connect to process in nsight compute Nsight Compute deep-learning-profiler	5	836	April 29, 2024
TensorRT TensorRT tensorrt	5	652	January 19, 2022
TensorRT TensorRT	1	353	August 26, 2021
Nsight Compute with Pytorch Nsight Compute pytorch , profiling	4	321	August 23, 2024
NVTX Filtering TensorRT tensorrt , nsight	1	807	May 6, 2021
TensorRT TensorRT	1	443	August 26, 2021
TensorRT - NVTX Filtering using Nsight Systems Nsight Systems	1	795	June 12, 2021
TensorRT Algorithm selector TensorRT tensorrt	3	496	September 28, 2021

Profile pytorch model using NCU

Description

Environment

Relevant Files

Steps To Reproduce

Related topics