Switching kernel from single to double precision execution fails.

Skybuck · January 16, 2014, 6:22am

// typedef single TFloatingPointPrecision;
typedef double TFloatingPointPrecision;

struct TParticle
{
TFloatingPointPrecision mStartX;
TFloatingPointPrecision mStartY;

TFloatingPointPrecision mStopX;
TFloatingPointPrecision mStopY;

TFloatingPointPrecision mDirectionX;
TFloatingPointPrecision mDirectionY;

TFloatingPointPrecision mSpeed;	

TFloatingPointPrecision mCurrentX;
TFloatingPointPrecision mCurrentY;

int mColor;

};

if I switch this kernel from single to double precision the kernel fails ?

Any idea what could be wrong ?

All code is basically very simple.

Only thing I can think of is that the kernel is somehow running out of memory.

But with 1 GB of RAM that shouldn’t be happening…

Compiles tried with textpad and parameters:

cuda toolkit 5.5:
$File --ptx --device-debug --machine 32 -arch sm_20

cuda toolkit 4.2:
$File --ptx -G0 --machine 32 -arch sm_20 -Xptxas -v

Now luck with double precision.

Graphics Card is GT 520 with compute 2.1 support ?!?

(I can’t debug it at the moment… since vs2010 and cuda toolkit 5.5 is giving me the craps.)

Skybuck · January 16, 2014, 6:35am

If I change the structure to this, the cuModuleLoad completely fails ?!? it crashes with some invalid floating point operation exception ?! Perhaps the structure is not packed properly ?

It’s starting to seem like a cuda compiler or driver api loader issue ??? file being loaded is ptx.

typedef float TFloatingPointPrecision;

struct TParticle
{
TFloatingPointPrecision mStartX;
TFloatingPointPrecision mStartY;

TFloatingPointPrecision mStopX;
TFloatingPointPrecision mStopY;

TFloatingPointPrecision mDirectionX;
TFloatingPointPrecision mDirectionY;

TFloatingPointPrecision mSpeed;	

TFloatingPointPrecision mCurrentX;
double mCurrentY; // changed 1 field to see what effect it has ?!? crashes the load ?!

unsigned int mColor;

};

I’ll try a graphics driver update to see if that helps…

Updating drivers did not help unfortunately :( loading of kernel.ptx still crashes…

Skybuck · January 16, 2014, 7:14am

Interestingly enough the module load function not comepletely fails, even if all are floats.

Perhaps the driver api changed in 5.5 or the issue is more severe ?! Hmm…

Topic		Replies	Views
Kernel works in single precision but not in double CUDA Programming and Performance	7	1596	July 28, 2009
Compiling with NVCC for Double Precision CUDA Programming and Performance	3	9522	February 18, 2010
ptxas crashs on double precision kernel CUDA Programming and Performance	2	1123	September 8, 2009
Kernel faster in double precision than in simple ? CUDA Programming and Performance	4	1015	April 14, 2012
Double precision problem Kernel returns cudaErrorUnknown in a double precision mode CUDA Programming and Performance	0	2831	February 13, 2009
CUDA double/float woes CUDA not denoting double prec types? CUDA Programming and Performance	1	5900	May 1, 2008
Did any tried double precision computation? CUDA Programming and Performance	0	1976	September 9, 2009
Issues with double precision support on GT200 CUDA Programming and Performance	7	2717	July 7, 2008
Is there a difference between GPU double precision and CPU double precision? CUDA Programming and Performance	14	10516	November 26, 2009
Expected performance of double precision arithmetic CUDA Programming and Performance	8	3999	August 20, 2009

Switching kernel from single to double precision execution fails.

Related topics