I have 3 matrix A, B & C. A & B are in Half data type and B is Single. The operation is like this:
A = A + BxC.
Now I am using “cublasGemmEx” for this operation, but I want to do this operation in Half and TF32. Can I do this operation without converting from FP32 to TF32? Do we have any replacment for “cublasGemmEx” which accept FP32 and do GEMM in TF32?
I am not sure if “cublasGemmEx” is doing the same now or not!