NVIDIA Developer Forums

Wrong results for double precision calculations Not setting arch=sm_13 causes incorrect results (onl

Accelerated Computing CUDA CUDA Programming and Performance

sWienke October 26, 2010, 3:20pm 1

Hi,

I have a CUDA program where the kernel uses doubles, e.g.

__global__ void saxpy_parallel(int n, double a, double *x, double *y)

{

	int i = blockIdx.x*blockDim.x + threadIdx.x;

	if (i < n){

	  y[i] = a*x[i] + y[i];

	 }

}

If I set the GPU Architecture to sm_13 everything is fine. However, using the default sm_10 architecture (where double is not supported) I get incorrect results when running the application, even if I don’t use the double precision!

The compiler just prompts a warning: “warning : Double is not supported. Demoting to float”

Can anyone reproduce the problem?

The funny thing is, that (using the same GPU hardware and CUDA program) I also get the warning, but the application produces correct(!) results. Why not on windows?

Greets, Sandra

My system:

Windows Server 2008, 64-bit

Cuda Toolkit 3.2 (version included in Parallel NSight installation)

2 GPUs of Tesla S1070 (cc13)

Visual Studio 2008

Driver 260.93

sWienke October 26, 2010, 3:20pm 2

Hi,

I have a CUDA program where the kernel uses doubles, e.g.

__global__ void saxpy_parallel(int n, double a, double *x, double *y)

{

	int i = blockIdx.x*blockDim.x + threadIdx.x;

	if (i < n){

	  y[i] = a*x[i] + y[i];

	 }

}

If I set the GPU Architecture to sm_13 everything is fine. However, using the default sm_10 architecture (where double is not supported) I get incorrect results when running the application, even if I don’t use the double precision!

The compiler just prompts a warning: “warning : Double is not supported. Demoting to float”

Can anyone reproduce the problem?

The funny thing is, that (using the same GPU hardware and CUDA program) I also get the warning, but the application produces correct(!) results. Why not on windows?

Greets, Sandra

My system:

Windows Server 2008, 64-bit

Cuda Toolkit 3.2 (version included in Parallel NSight installation)

2 GPUs of Tesla S1070 (cc13)

Visual Studio 2008

Driver 260.93

Topic		Replies	Views	Activity
Lack double support? CUDA Programming and Performance	2	1364	May 5, 2010
Double precision in CUDA 2.3 CUDA Programming and Performance	5	38199	March 5, 2010
Using double precision in CUDA how to turn on double precision in CUDA CUDA Programming and Performance	2	3056	July 27, 2008
How to activate double-precision computation CUDA Programming and Performance	4	30385	September 14, 2009
Problem with running code with double precision values Double precision gives wrong result CUDA Programming and Performance	2	1219	August 28, 2009
Double doesnt work even with -arch=sm_13 CUDA Programming and Performance	0	744	June 10, 2011
Unable to do double precision calcs CUDA Programming and Performance	4	2165	April 7, 2009
worked fine for "int" "float" but NOT "double" CUDA Programming and Performance	13	5027	March 9, 2009
Using double precision CUDA Programming and Performance	4	3409	September 15, 2009
-arch sm_13 business CUDA Programming and Performance	2	544	March 30, 2019