Floating Point Precision of GPU

miztaken · September 9, 2010, 12:11am

Hi,

I am using 9800 GT and latest version of driver and CUDA.
I am doing some floating point addition, multiplication, divisions in both CPU and GPU.

But my result vary from that of GPU and CPU:-

here is what the output (fp values) looks like in CPU and GPU:

In CPU:

23.975410 26.500000 27.000000
24.000000 26.434782 27.000000
23.929729 27.000000 27.000000

23.500000 26.212872 27.000000
23.841270 26.000000 27.000000
23.000000 26.235556 27.000000

23.687500 26.000000 26.500000
23.489796 26.000000 26.000000
23.841270 26.000000 27.000000

In GPU:

23.991804 26.500000 27.000000
24.000000 26.478260 27.000000
23.935135 27.000000 27.000000

23.500000 26.202971 27.000000
23.825397 26.000000 27.000000
23.000000 26.231112 27.000000

23.669643 26.000000 26.500000
23.469387 26.000000 26.000000
23.825397 26.000000 27.000000

What are my options to correct this in my 9800 GT?

is double precision supported in my card (a/c to wikipedia seems like it doesnt) ? if not what are my options?

mianlu · September 9, 2010, 2:53am

this is normal
you can see the appendix in the programming guide describing the accuracy of floating point numbers on GPUs

mianlu · September 9, 2010, 2:53am

this is normal
you can see the appendix in the programming guide describing the accuracy of floating point numbers on GPUs

YDD · September 9, 2010, 3:58pm

What makes you sure that they need correcting? Have you compared SSE vs x87 results on the CPU side?

YDD · September 9, 2010, 3:58pm

What makes you sure that they need correcting? Have you compared SSE vs x87 results on the CPU side?

seibert · September 9, 2010, 4:54pm

If you actually need double precision (think carefully about this), your options are:

Get a new GPU. GTX 200 and 400 series do native double precision. You still probably won’t get the same answer as the CPU due to a number of factors (order of operations change in a parallel calculation, use of extended 80-bit precision in some cases on the CPU, etc). Also be prepared for a performance hit, although a GTX 470 working in double precision might be competitive with a 9800 GT in single precision…
Emulate near double precision using techniques ported from the dsfun90 library. The double-single format used in those algorithms has 48 bits of mantissa instead of 53, like true double precision. This is more than 10x slower than single precision. (Best case is addition at around 11x, everything else is much slower.)
Use a technique like Kahan summation to limit round-off error in long sums. If that isn’t the reason you use double precision, then obviously Kahan summation won’t help. :)

seibert · September 9, 2010, 4:54pm

If you actually need double precision (think carefully about this), your options are:

Get a new GPU. GTX 200 and 400 series do native double precision. You still probably won’t get the same answer as the CPU due to a number of factors (order of operations change in a parallel calculation, use of extended 80-bit precision in some cases on the CPU, etc). Also be prepared for a performance hit, although a GTX 470 working in double precision might be competitive with a 9800 GT in single precision…
Emulate near double precision using techniques ported from the dsfun90 library. The double-single format used in those algorithms has 48 bits of mantissa instead of 53, like true double precision. This is more than 10x slower than single precision. (Best case is addition at around 11x, everything else is much slower.)
Use a technique like Kahan summation to limit round-off error in long sums. If that isn’t the reason you use double precision, then obviously Kahan summation won’t help. :)

Topic		Replies	Views
floating point precision CUDA Programming and Performance	3	1462	April 10, 2009
CUDA kernel converging to CPU double result thought G80 is single precision? CUDA Programming and Performance	6	8609	March 7, 2008
Is there a difference between GPU double precision and CPU double precision? CUDA Programming and Performance	14	10628	November 26, 2009
CPU and GPU Floating point anomaly CUDA Programming and Performance	10	5623	November 10, 2013
Precision Problem CUDA Programming and Performance	2	1725	June 21, 2009
Is there any method to enhance precision? CUDA Programming and Performance	2	4292	January 3, 2009
Floating Point Accuracy CUDA Programming and Performance	11	30418	April 6, 2013
Maximum native precision or mixed-precision to have 21 digit floating point type? CUDA Programming and Performance mixed-precision	1	670	January 14, 2022
Double precision Accuracy with sqrt, log math functions Results on CPU & GPU are not exactly sam CUDA Programming and Performance	9	5412	April 12, 2012
floating point precision on CUDA CUDA Programming and Performance	11	14715	June 8, 2010