Mixed Precision Processing

Hello everyone,
Was reading through the turing and volta architecture whitepapers, and was curious if a GPU could simultaneously perform multiple different operations if they were of different precisions…

Could you be a little more specific?

Looking at the GV100 whitepaper…

The GV100 SM is partitioned into four processing blocks, each with 16 FP32
Cores, 8 FP64 Cores, 16 INT32 Cores, two of the new mixed-precision Tensor Cores for deep
learning matrix arithmetic, a new L0 instruction cache, one warp scheduler, one dispatch unit,
and a 64 KB Register File.

Therefore, integer and float instructions can be launched in parallel.