Math Bug in Tesla T10 Is this a bug in addition?

Raghavan_Dhandapani · February 16, 2009, 2:09am

Hi,

The following Kernel seems to run into trouble on the Tesla T10 GPU:

[codebox]

/--------------------------------------/

global void findSum(numType *cu_result)

{

numType temp=0;

for (int i=0;i<COUNT;i++)

  {

    temp +=80189;

   }

*cu_result=(temp/COUNT);

return;

};

/-----------------------------/[/codebox]

When numType is “float”, output is: 80188.8

When numType is “double” output is: -2.22507e-308

In both cases I call the kernel in the following manner:

[codebox]

numType *cu_result;

cudaMalloc((void**)&cu_result, sizeof(numType));

findSum<<<1,1>>>(cu_result);

[/codebox]

So there is just 1 block with 1 thread.

Am I doing something wrong?

Best,

Raghavan

SPWorley · February 16, 2009, 2:32am

I’ll bet you’re not compiling with the -arch sm_13 flag to enable double precision.

Raghavan_Dhandapani · February 16, 2009, 3:39am

Indeed I was not! This takes care of the case where “numType” is “double”.

But the single precision problem still exists. I guess this really is a bug

in the hardware??

Raghavan_Dhandapani · February 16, 2009, 4:24am

This seems to a single precision issue rather than a CUDA specific
issue. Never mind…

SPWorley · February 16, 2009, 4:42am

Nope, your Tesla (and your floats) are working just like they should be.

A float can’t hold integers larger than 2^24 exactly, so your sum has floating point truncation losses.

If you do the same accumulation loop on the CPU, you’ll get the identical 80188.8 result as the Tesla.

Floating point seems so simple, but it’s sometimes fiendish! A long but useful guide:

http://citeseerx.ist.psu.edu/viewdoc/summa…=10.1.1.22.6768

You can almost summarize it by saying “be careful adding numbers of different magnitudes, and don’t trust the difference between two numbers of the same magnitude.”

In this case, you’re doing the former by adding values like 80189.0 to numbers that are two orders of magnitude larger, like 20528330.0.

Raghavan_Dhandapani · February 16, 2009, 5:33am

Thanks!

Sarnath · February 16, 2009, 5:51am

Thanks for the link.

Topic		Replies	Views
A possible nvcc bug on double? CUDA Programming and Performance	1	2564	March 16, 2011
Code works with floats but not doubles CUDA Programming and Performance	4	5014	July 15, 2009
CUDA kernel converging to CPU double result thought G80 is single precision? CUDA Programming and Performance	6	8625	March 7, 2008
problem with double precision unpredictable results Different run give differents errors or no error CUDA Programming and Performance	12	2850	September 10, 2010
floating point precision on CUDA CUDA Programming and Performance	11	14952	June 8, 2010
Float vs double precision on Titan V CUDA Programming and Performance	2	935	April 11, 2020
worked fine for "int" "float" but NOT "double" CUDA Programming and Performance	13	5019	March 9, 2009
Precision in Tesla Suitability of GPUs for some applications CUDA Programming and Performance	17	5653	January 12, 2009
Different results on tesla s1070 and GTX8800 why? CUDA Programming and Performance	4	2584	February 4, 2010
First time cuda user. Do not understand where the numerical error is CUDA Programming and Performance	2	779	September 5, 2010

Math Bug in Tesla T10 Is this a bug in addition?

Related topics