How does parameter computeType affect the computation?

user46403 · February 7, 2023, 3:39pm

Hi,
I’m a little confused about how the parameter computeType affects the calculation.

For example, in cusparseSpMV, the types of A, X and Y are 16f and computeType is CUDA_R_32F. Does it mean that in the computation, the types of A, X and Y will be converted to f32 first, then they will be calculated, and finally the result Y will be converted to f16?

Thank you.

Robert_Crovella · February 7, 2023, 4:08pm

This would be the typical, original tensor core calculation. The calculation is a 16-bit float by a 16-bit float, yielding a 32-bit float result. Corresponding results (within a single tensor core op) are accumulated in 32-bit float. The accumulated result is converted back to 16-bit float upon storage, i.e. on completion of the underlying sass tensor core operation. You can refer to diagrams 7 and 8 here, where the only deviation from diagram 8 in this case is that the FP32 result gets converted to FP16 at the point of storage of that result.

system · February 21, 2023, 4:09pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What is the compute type in cublasGemmEx when compute with alpha, beta? GPU-Accelerated Libraries cublas	2	1027	February 28, 2023
cusparseCsrmvEx with half I/O and float calculation. GPU-Accelerated Libraries	5	914	October 30, 2019
Multiplication in Half and Accumulation in Single CUDA NVCC Compiler	0	574	July 3, 2022
Fp16 or fp32 Accumulate? CUDA Developer Tools	0	861	April 12, 2020
cuspraseCsrmvEx half i/o and float calculation Deep Learning (Training & Inference) mixed-precision	0	585	October 8, 2019
Some results in A100 with cuBLAS and cuBLASLt GPU-Accelerated Libraries cublas	1	176	January 9, 2025
Numerics of tensor core instructions GPU-Accelerated Libraries	3	832	March 3, 2023
Why does cublasSgemm uses `f16` for `float`? GPU-Accelerated Libraries cublas	7	1444	March 8, 2023
Errors about cuSPARSE blocked ellpack format with fp16 compute type GPU-Accelerated Libraries cuda , cusparse	14	672	January 12, 2024
Just Released: cuSPARSELt v0.3 Technical Blog	0	288	June 21, 2022

How does parameter computeType affect the computation?

Related topics