Nsight Computer with PyTorch

youcandoit · December 23, 2020, 2:00am

Hi,

I am using Nsight Compute and System to profile BERT in PyTorch. I want to know whether we need to package the interesting code using “torch.cuda.synchronize()” as follows. Is it ok to use only range_push/pop when I use Nisght Compute and System?

torch.cuda.synchronize()
torch.cuda.nvtx.range_push(“test”)
with torch.cuda.profiler.profile():
with torch.autograd.profiler.emit_nvtx():
interesting code
torch.cuda.synchronize()
torch.cuda.nvtx.range_pop()

As I know, if I want to measure time, it needs to be packaged with synchronization. (link)

It would be helpful if you could answer me.
Thank you!

Topic		Replies	Views
NSight Compute vs. NSight Systems vs. PyTorch Profiler Nsight Compute	2	2661	March 23, 2024
Nsight compute failed to profile with nvtx ranges in pytorch Nsight Compute pytorch , profiling	4	1012	September 19, 2023
Why not employ asynchronous techniques in deep learning models? cuDNN deep-learning	3	927	December 31, 2023
Nsight Compute with Pytorch Nsight Compute pytorch , profiling	4	357	August 23, 2024
Nsight-compute print "the application returned an error code (249)" Nsight Compute	5	1431	February 13, 2023
OptiX profiling? Nsight Compute cuda , optix	8	1015	November 27, 2023
Could not correctly profile python tasks [No CUDA events collected. Does the process use CUDA?] Profiling x86 Windows Targets cuda	1	1032	June 28, 2021
Profiler stuck while profiling a range Nsight Compute	1	1813	November 20, 2023
Kernel time of Nsight system is larger than nsight compute Profiling Linux Targets	11	840	April 3, 2024
How to Display Source Code Alongside SASS in Nsight Compute for Custom CUDA Kernels called from Pytorch/Python Autograd? Nsight Compute pytorch	0	311	November 6, 2024

Nsight Computer with PyTorch

Related topics