FP16 add Arithmetic Function Variety

uniadam · June 30, 2022, 1:57pm

Hi,

I have a question about 3 kind of add that we have for FP16.

device __half __hadd ( const __half a, const __half b )
Performs half addition in round-to-nearest-even mode.
device __half __hadd_rn ( const __half a, const __half b )
Performs half addition in round-to-nearest-even mode.
device __half __hadd_sat ( const __half a, const __half b )
Performs half addition in round-to-nearest-even mode, with saturation to [0.0, 1.0].

I do not understand the meaning of following phrases correctly. What exactly is happening?

Prevents floating-point contractions of mul+add into fma.

for the second type of add are we doing multiplication operation also? I was thinking that fma is related to r=(a*b)+c but here we are just doing add.

Do we have any example or some documents for more information?

Best regards,

rs277 · July 1, 2022, 7:02pm

There are a few more details here: PTX ISA 8.3

Topic		Replies	Views
How to cuda half and half functions CUDA Programming and Performance	5	4140	January 10, 2019
Precision is be influenced when adopting the __half(fp16) dataType CUDA Programming and Performance cuda , programming	2	474	July 6, 2023
__half and standard operators + * / - CUDA Programming and Performance	5	606	February 7, 2023
__hadd not working correctly CUDA Programming and Performance cuda	3	401	October 19, 2023
Atomic operation in FP16 CUDA Programming and Performance	2	2105	February 22, 2017
AtomicAdd not overloaded for c10::Half CUDA Programming and Performance cuda	5	3615	March 5, 2022
CUDA __half atomicAdd Poor computing time CUDA NVCC Compiler cuda	3	507	February 2, 2024
Two expressions of same mathematical semantic give different results CUDA Programming and Performance	4	346	July 6, 2023
error when trying to use half (fp16) CUDA Programming and Performance	16	20464	October 13, 2015
Half2 atomics generate unused code CUDA Programming and Performance	13	363	August 8, 2024

FP16 add Arithmetic Function Variety

Related topics