Calculating utilization (core load) of Tensor core and cuda core seperately

VivekM · January 7, 2021, 10:14am

Hi,
Architecture: Turing
DL: using TensorRT
I know using the tensor_precision_fu_utilization and tensor_int_fu_utilization the tensor core utilization can be found for each kernel scaling 0 to 10. Is there a convenient way to find out the total utilization of tensor cores lets say over 1 second ? without actually going into each kernel utilization? I want to use the CUPTI APIs to basically Segregate the tensor core and CUDA core utilization complete deep learning network over a period of time frame. we are using right now nvmlDeviceGetUtilizationRates() nvml library function to know the GPU utilization but I think this API returns the total GPU core load and the bifurcation between tensor and cuda cores is not in it.

Thanks

mjain · January 15, 2021, 12:44pm

Hi Vivek,

If I understand the use case, you want to capture the tensor core usage data without serializing the kernels in the application, is that correct? I think tool like DCGM (Data Center GPU Manager) is better suited for this use case as it can provide a set of metrics at the device-level with low performance overhead in a continuous manner. I assume “Tensor Activity” is the metric you are interested in. More details can be found at Welcome — NVIDIA DCGM Documentation latest documentation

Topic		Replies	Views
How to measure Tensor core utilization using NVIDIA profiling tools such as Nsight System, DLProf, nvprof etc TensorRT cudnn	4	2142	January 31, 2024
Get tensor core usage through nvml System Management and Monitoring (NVML)	4	2382	December 17, 2022
How can I get the utilization of cuda core and tensor core respectively? Profiling Linux Targets	5	3786	January 10, 2023
Nsight Profile of NVIDIA/CUDALibrarySamples/cuTENSOR. Does it use CUDA Programming and Performance	4	593	November 22, 2022
"tools" to monitor Tensor core usage System Management and Monitoring (NVML)	1	2336	December 19, 2022
Nsight Compute to measure metrics data Nsight Compute	1	582	January 29, 2021
How to confirm whether Tensor Core is working or not. Jetson AGX Xavier	8	11467	October 18, 2021
Is there a way to see if CUDA API execution happened on Tensor Cores or not? CUDA Programming and Performance	4	1014	September 18, 2018
Am I using Tensor Core? CUDA Programming and Performance	3	815	August 4, 2021
Concurrent execution of CUDA and Tensor cores CUDA Programming and Performance	34	9558	November 3, 2024

Calculating utilization (core load) of Tensor core and cuda core seperately

Related topics