Understanding the Visualization of Overhead and Latency in NVIDIA Nsight Systems

Originally published at: https://developer.nvidia.com/blog/understanding-the-visualization-of-overhead-and-latency-in-nsight-systems/

Recently, a user came to us in the forums. They sent a screenshot of a profiling result using NVIDIA Nsight Systems on a PyTorch program. A single launch of an element-wise operation gave way to questions about the overheads and latencies in CUDA code, and how they are visualized with the Nsight Systems GUI. This…

This article is very helpful. However, the image quality is very low, making it harder to follow.
Could you please update the article with clearer images?

Thanks.

Thanks for the feedback! I re-uploaded the images and made them larger. I also made them clickable so that you could expand them as needed. I hope that helps! Please feel free to give us more feedback on this and other posts at any time.

1 Like