cuda and double-precision floating-point arithmetics

Brano_2014 · March 27, 2012, 10:48am

this is what i found: " Cores perform only single-precision floating-point arithmetics. There is 1 double-precision floating-point unit. "

is this true for all compute capabilities (versions) ?

in NVIDIA CUDA C Programming Guide 4.1 section F.4.1 p.144 is written “32 CUDA cores for integer and floating-point arithmetic operations”. by “floating-point” they mean both single and double?

cgorac · March 27, 2012, 11:42am

In CUDA 4.2 programming guide, there is a pointer exactly in this sentence that you cited to the section 5.4.1 - and this section is there in 4.1 programming guide too, detailing throughput for specific arithmetic instructions; so you could see that for example with CC 2.0, double precision operations are twice slower than single precision operations, etc.

Brano_2014 · March 27, 2012, 12:07pm

Great :)

Thank you

parallelis · March 28, 2012, 7:03pm

Ratio between SP and DP throughput varies wildly depending on the generation of the GPU and if you are using a pro TESLA GPU or a consumer card. ie: GTX 680 have a ratio that is 1/16 (1 DP operation throughput for 16 SP operations throughput), while you may expect a 1/2 ratio on TESLA cards (1 DP per 2 SP throughput).

Topic		Replies	Views
Single vs Double Precision CUDA Programming and Performance	2	4956	August 2, 2010
A question on single and double precision performance calculation with CUDA cores CUDA Programming and Performance	7	2607	May 31, 2024
CUDA Double Precision Performance 933 GFlops vs 78GFlops CUDA Programming and Performance	17	10274	March 9, 2009
Detailed double precision to single precision ration in nVidia GPUs? CUDA Programming and Performance	5	6483	January 2, 2014
Double precision performance CUDA Programming and Performance	5	5773	May 22, 2011
Double Precision how is it exactly? CUDA Programming and Performance	2	1736	July 1, 2011
Looking for a laptop to run scientific simulations in CUDA with double precision - speed is important CUDA Programming and Performance	7	1128	November 8, 2017
GTX2xx double precision support CUDA Programming and Performance	1	2023	October 16, 2009
Instruction throughput table CUDA Programming and Performance	0	6161	November 17, 2011
About instruction throughputs CUDA Programming and Performance	9	5310	May 27, 2010

cuda and double-precision floating-point arithmetics

Related topics