I can't show the gpu details about memory throughput

ho126jin · July 21, 2021, 5:09pm

And my result of profiling

In the picture above, the throughput for ‘Memcpy’ is written, but the throughput for one kernel is not written.

What do I have to do to get the throughput of the kernel?

And

The meaning of ‘throughput’ is global memory throughput, right?
also, global memory throughput means effective global memory bandwidth in https://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html#effective-bandwidth-calculation ?

Topic		Replies	Views
Memory throughput in ncu Nsight Compute cuda , pytorch	1	888	July 15, 2022
Effective memory bandwidth? CUDA Programming and Performance	9	3599	July 26, 2021
CUDA visual profiler CUDA Programming and Performance	1	1005	May 5, 2010
Visual Profiler reports higher than possible global mem throughput CUDA Programming and Performance	2	840	July 30, 2010
Low Memcpy Throughput CUDA Programming and Performance	1	2586	December 14, 2017
Memory throughput definition Nsight Compute	3	740	June 25, 2024
How to get the compute and memory throughput of GPU from the perspective of the whole GPU system Nsight Compute cuda	4	1114	September 23, 2022
Question about memory transfer Visual Profiler and nvprof	2	1623	February 5, 2020
Mismatch in memory bandwidth statistics Visual Profiler and nvprof	1	722	June 21, 2019
Xavier LPDDR4X memory throughput Jetson AGX Xavier	7	703	October 18, 2021