Cuda 9 FP16

LukeCuda · August 5, 2017, 4:48am

“INCREASE APPLICATION THROUGHPUT WITH FP16 AND INT8 SUPPORT”

Can someone please outline the level of support for FP16 in CUDA9 on Pascal Geforces and Titans?

Robert_Crovella · August 5, 2017, 5:05am

You’re misreading that page. Those comments refer to CUDA 8. Please study that page carefully. Read it from top to bottom. notice where the CUDA 9 section starts. Notice where the CUDA 8 section starts. Notice where that quote is with respect to the start of the CUDA 8 section.

FP16 and INT8 support both appeared in CUDA 8, and for applications which can take advantage of them, they can increase throughput. This is not news.

Clochette · August 5, 2017, 5:13am

Not a single Pascal GeForce card supports FP16 (Titan X (Pascal) and Titan Xp are still GeForce).

Out of all Pascal GeForce cards, only Titan X (Pascal), Titan Xp and 1080 Ti support INT8 inference.

njuffa · August 5, 2017, 5:28am

Not at all, or at very low throughput? I thought it was the latter.

Clochette · August 5, 2017, 6:44am

The latter. Native FP16 has 1/64 throughput on consumer Pascals so there might as well be none, as it’s much slower than “simulated” FP16 (FP16 storage but FP32 compute).

Maddy_Scientist · August 5, 2017, 9:32am

Not sure what you mean by this. Every pascal GeForce card supports the dp4a instruction at effective 4x fp32 math throughput. The only pascals that don’t support it are gp100 and Parker, but these have never been deployed in GeForce boards.

Topic		Replies	Views
Does Nvidia Titan x have native FP16 and int8 support? CUDA Programming and Performance	7	7404	August 12, 2016
Titan V FP16 Performance CUDA Programming and Performance	5	4231	December 13, 2017
FP16 support on gtx 1060 and 1080 GPU-Accelerated Libraries math-api	14	25423	May 19, 2021
16 bit float operations CUDA Programming and Performance	2	7574	April 7, 2015
INT8 cublasGemmEx support on Tegra X2 and Tesla P100 GPU-Accelerated Libraries	4	1796	October 17, 2017
GTX 1050 ti does not have Compute Preemption? CUDA Programming and Performance	9	4438	December 19, 2016
integer arithmetic capabilities of Tesla GPUs & definition of terms CUDA Programming and Performance	5	1775	December 6, 2017
Half precision cuFFT Transforms GPU-Accelerated Libraries	12	6025	March 29, 2021
GeForce GPUs in the throughput table CUDA Programming and Performance	4	594	March 5, 2020
Nividia driver version for Ubuntu 1604, GTX 1080 Ti and CUDA 9 CUDA Setup and Installation	2	10286	March 6, 2018

Cuda 9 FP16

Related topics