Was reading through the turing and volta architecture whitepapers, and was curious if a GPU could simultaneously perform multiple different operations https://syncnet.onl/telegram-web/ if they were of different precisions… https://snaptube.cam/ https://9apps.cam/
Could you be a little more specific?
Looking at the GV100 whitepaper…
The GV100 SM is partitioned into four processing blocks, each with 16 FP32
Cores, 8 FP64 Cores, 16 INT32 Cores, two of the new mixed-precision Tensor Cores for deep
learning matrix arithmetic, a new L0 instruction cache, one warp scheduler, one dispatch unit,
and a 64 KB Register File.
Therefore, integer and float instructions can be launched in parallel.