floating point operations

gibo · May 15, 2010, 10:23pm

hi everybody

i’m just passing some C functions of my program to CUDA kernels.

When i compare 2 results calculated in these 2 different ways to control if parallel kernels do their jobs well and both calculations match, i find errors on floating point operations (i’m working in single precision).
These errors are around 10^(-5) and i don’t think that it’s a gpu fault, since i know that it supports IEEE 754 (my graphic card is GTX295).

Assuming that my algorithm is right (i’m still controlling everything…) is it normal to have this kind of errors for some other reasons?!
Do they occur in some in particular operations?

i call device functions from kernel, just internal function i had in C language, adapted in cuda and transformed in device functions.
Can it influence anything?

thanks

Lev · May 15, 2010, 11:57pm

Do you use something like exp sincos etc?

gibo · May 16, 2010, 8:51am

Yes i do…is there any problem involving SFUs?

avidday · May 16, 2010, 8:56am

There is no IEEE 754 single precision conformance on the GT200. Double precision is IEEE754 compliant. If you are seeing 5 or 6 decimal places of agreement in single precision, then that is about as good as you can reasonably expect using single precision arithmetic anyway.

gibo · May 16, 2010, 9:07am

thanks, i didn’t know about this feature of GT200.

Last thing i haven’t clear: using double precision involves SFUs instead of SPs, isn’t it?

gibo · May 16, 2010, 9:39am

are u sure about that??? and i’m using a GTX…

Lev · May 16, 2010, 9:44am

You need to read programming developing guide, everything about precision is described there.

avidday · May 16, 2010, 9:45am

The GT200 is the name of the underlying GPU used in the GTX260/275/280/285/295, Telsa C1060/M1060/S1070, Quadro FX4800/5800 and Quadro Plex 2200 series. It has IEEE754 compliant double precision, but not single precision. IEEE754-2008 compliant single precision was only introduced with the new GF100 “Fermi” family of parts.

gibo · May 16, 2010, 9:52am

you’re right, i jumped appendixes, where everything about precision is explained.

from the table, it seems full IEEE 754 is in add and multiply only.

Thanks your for your time

_Big_Mac · May 16, 2010, 9:58am

ieee754 compliance or not, you can never expect two different pieces of hardware return the same exact results. 754 compliance in Fermi means specified error bounds for some operations (that had non-754 bounds before) and some rounding stuff. Note that even the specification itself includes error ranges, you have to expect different results.

The discrepancy you see, around E-5, is normal. Single precision floats can only represent about 6 significant digits (in decimal notation) and any kind of computation is likely to introduce errors to the least significant ones. Functions such as exp, sin etc. will introduce bigger errors. You will see this kind of small differences even while running your code on the CPU with different compile options.

gibo · May 16, 2010, 10:19am

thank for your complete explanation

now i feel myself lucky for other cases in which i’ve found a total match doing an if == compare between floats
only sin effects made results unmatched

Ailleur · May 16, 2010, 11:41am

== shoud never be used with floats, no matter the platform.
Always check if the two values are inside some epsilon.

gibo · May 16, 2010, 1:00pm

Yes of course i’ve usually checked within a epsilon.

But when i started to find errors where they shouldn’t be, i did some checks more

seibert · May 16, 2010, 5:46pm

It’s also worth pointing out that IEEE-754 does not specify the precise rounding behavior of transcendental functions. (This is due to the “Table Maker’s Dilemma.”) Two IEEE-754 compliant implementations have to give the the same answer for the same sequence of basic floating point operations, but once you throw a sin() in there, things can be different.

Topic		Replies	Views
Floating Point Accuracy CUDA Programming and Performance	11	30438	April 6, 2013
Precision in Tesla Suitability of GPUs for some applications CUDA Programming and Performance	17	5623	January 12, 2009
IEEE-754 compliant division CUDA Programming and Performance	5	10124	November 26, 2008
Float precision error in matrix multiplication application. CUDA Programming and Performance	14	3609	February 27, 2014
Double precision Accuracy with sqrt, log math functions Results on CPU & GPU are not exactly sam CUDA Programming and Performance	9	5449	April 12, 2012
floating point processor of GPUs CUDA Programming and Performance	7	4424	August 28, 2015
CUDA different results when running on different driver? CUDA Programming and Performance	6	2082	March 6, 2010
More information about double precision in Guide? CUDA Programming and Performance	4	4052	May 30, 2008
Float accuracy CUDA Programming and Performance	16	9386	July 22, 2010
FMA precision issue CUDA Programming and Performance	9	19380	November 21, 2010

floating point operations

Related topics