Simple division operation is different in CPU and GPU, why?

CUDAkk · June 8, 2009, 11:25am

Hello All,

simple division operation is differing in CPU and GPU…

double op = 1080/(double)600;

the op value is 1.80000000 in CPU
and in GPU , it is 1.799999952.

When we use calculator for 1080/600, it gives exact 1.8
then why is the difference in GPU?
how can we set both GPU and CPU are same?

shifter1 · June 8, 2009, 12:29pm

You need to read the documentation. CPUs use 80 bit temporary registers to do floating point work, where the the GPU (9000 series and lower) uses 32 bit floats.

_Big_Mac · June 8, 2009, 1:39pm

It’s a problem of using floats vs doubles here. On my machine:

#include <cstdio>

int main() {

	float fa = 1080/(float)600;

	double da = 1080/(double)600;

	printf("%.10f \n%.10f \n",fa, da);

	return 0;

}

prints:

1.7999999523

1.8000000000

What you should also know is that such differences are expected in numerical computations. One thing is whether the numbers are stored as floats or as doubles, another is how the computations are implemented. IEEE754 doesn’t define the exact results of various functions, only the bit representation of numbers. Such small differences are exactly why floating point numbers should always be measured with a certain epsilon.

Additionally, CUDA implements division in a non-standard way using fast reciprocal.

Floating point computations carried out by programs compiled with different compilers (not to mention running on radically different hardware) will come out slightly different and one has to live with it.

Romant · June 8, 2009, 6:54pm

Hi,

I’ve also faced such a problem some time ago. Reasons of this issue are explained in previous posts, possible solution is to use doubles for division (of course you should have G200-bases hardware for it), but this approach also won’t fix the case when the result of division simply can’t be exactly represented in float due to 4 bytes limitation.

Hope this helps.

cvnguyen · June 8, 2009, 7:04pm

By default, the compiler generates code for 1.0 hardware which does not support double-precision floating-point. Did you put in the option -arch=sm13 ?

Manjunath_Gudisi · June 9, 2009, 5:17am

Not only 1.3 architectute and also 1.3 GPU Hardware.

CUDAkk · June 9, 2009, 5:43am

Thanks for valuable hints. We have to set not only arch but also hardware.

Topic		Replies	Views
Floating points CUDA Programming and Performance	3	2060	October 28, 2010
discrepancy between CPU and GPU after a division (accuracy issue) CUDA Programming and Performance	3	1494	June 10, 2015
Floats and floats... difference between CPU and GPU? CUDA Programming and Performance	12	14075	February 2, 2010
floating point error Error with floating point division CUDA Programming and Performance	9	8377	November 30, 2007
Accuracy problem I'd even say inaccuracy ... CUDA Programming and Performance	6	2783	June 28, 2008
Why are the calculations different between CPU and GPU? CUDA Programming and Performance	2	840	February 7, 2020
Floating point operations difference between CPU and GPU CUDA Programming and Performance	5	16841	November 16, 2012
floating point precision CUDA Programming and Performance	3	1462	April 10, 2009
cuda and double-precision floating-point arithmetics CUDA Programming and Performance	3	1881	March 28, 2012
I have two question. CUDA Programming and Performance	11	6914	December 2, 2007

Simple division operation is different in CPU and GPU, why?

Related topics