Ensuring the execution of GEMM done by Tensor Core

uniadam · August 18, 2022, 9:18pm

Hi,

I want to be sure that my GEMM operation done by Tensor Core. By profiling I am seeing this kernel.

By checking the kernel name I am not seeing any thing about Tensor Core. As I remember for the double version it was visable.

I have doubt, because I expected better performance.

I am using V100S GPU with Cuda 11 and I have set:

cublasGemmAlgo_t ALGO = CUBLAS_GEMM_DEFAULT;

Robert_Crovella · August 19, 2022, 2:41pm

that is a tensor core kernel

you can also use the profiler (e.g. Nsight Compute) to verify tensor core activity.

Topic		Replies	Views
cublasGemmEx is a Tensor Core operation or CUDA core? GPU-Accelerated Libraries cublas	3	907	October 3, 2021
Run Parallel Tensor Cores GEMM and Cuda GEMM GPU-Accelerated Libraries cuda , cublas	9	2460	August 14, 2022
How to enable Tensor core for cublasSgemmBatched on H100? GPU-Accelerated Libraries cuda , kernel , cublas , cutlass	5	761	November 17, 2023
How to confirm Tensor Core is working or not in CuSPARSE GPU-Accelerated Libraries cuda	4	832	May 12, 2023
Am I using Tensor Core? CUDA Programming and Performance	3	702	August 4, 2021
Is CUBLAS_GEMM_DEFAULT_TENSOR_OP in cublasGemmEX no longer supported? GPU-Accelerated Libraries cublas , cutensor	3	1217	September 6, 2023
Multiple Streams on Tensor Cores CUDA Programming and Performance	4	647	February 14, 2019
Parallel execution on tensor cores and cuda cores on the same SM Jetson AGX Xavier tensorrt	4	1208	October 18, 2021
How to confirm whether Tensor Core is working or not. Jetson AGX Xavier	8	10723	October 18, 2021
complex FP16 tensor core GEMMs CUDA Programming and Performance	3	900	February 5, 2020