CUDA has turned into CRAP (cuModuleLoad fails, bug in compiler/driver ?)

Skybuck · April 24, 2014, 3:43pm

Hello,

CUDA Driver API cuModuleLoad fails as shown in this video:

cuModuleLoad raises a floating point exception which prevents my application from running.

What is even more weird is that depending on which cuda toolkit compiler version and settings was used the 32 bit floating point version might or might not load/run or the 64 bit floating point version might or might not run.

The following winrar file contains (two versions) 32 bit and 64 bit floating point kernels and matching executables (Float vs Double):

http://www.skybuck.org/CUDA/Cuda5And6HasTurnedIntoCrap.rar

If compiled with cuda toolkit 4.2 in debug mode the 32 bit floating point kernel will load and run.

If compiled with cuda toolkit 6.0 in release mode the 64 bit floating point kernel will load and run.

All others will seem to fail.

Try compiling the kernel yourself into a ptx file on your system and see the results for yourself !

If it runs it would be nice to get a video of it as proof just for the fun it !

Bye,
Skybuck.

vacaloca · April 24, 2014, 4:23pm

I would suggest making the most isolated case possible. As can be seen by your video, there is quite a lot of external code that goes into calling the kernel, and the bug is probably elsewhere.

For what it’s worth, I compiled your kernel as nvcc CudaOpenGLKernelDouble.cu -arch=sm_35 --ptx
(with and without -G) flag.

For -G flag I get an access violation of nvcuda.dll, a division by zero error, the app opens to a black screen, and some CUDA_SUCCESS messages, although clearly nothing is working.

Without the -G flag I get a green screen and a bunch of CUDA launch fails.

Skybuck · April 24, 2014, 7:03pm

The division by zero is caused because there is no kernel loaded.

The code assumes the kernel was loaded and so forth.

If the kernel loaded properly there will be no division by zero.

I am debugging the Delphi code right now… it seems the mHandle is nil… (in the call to cuModuleLoad) that’s clearly a problem… investigating.

mCudaErrorCode := cuDeviceGet( mHandle, mNumber );

^ This API call is supposed to set the mHandle to something.

It’s returning nil/0 if mNumber is zero ? WTF ?!

Skybuck · June 20, 2014, 10:06pm

According to the cuda.h header cuDeviceGet returns handles within legal range 0 to N-1.

So it returning a handle of zero seems to be valid, so that’s probably not the problem.

Thus I now believe the problem is caused by the cuda compiler and/or cuda driver generating/and/or loading the ptx instructions.

Perhaps it’s a “just-in-time bug” lol, like a “just-in-time comepiler” :)

Testing ptx instructions, floats, doubles, and just in time compiling inside the driver is well beyond “my scope”. I have no tools available to investigate problems like these ?

Perhaps nvidia also has no or limited tools available to diagnose these kinds of problems.

Any help/suggestions/tools how to debug “ptx issues” and or “just-in-time compiler/driver/bugs” is welcomed.

Skybuck · July 20, 2014, 3:28am

Little possibly insignificant update to this problem:

The test application creates a special cuda context with cuGLCtxCreate instead of cuCtxCreate. cuGLCtxCreate has been deprecated, I am not sure if using this special context is creating the problem.

Bye,
Skybuck.

Topic		Replies	Views
CUDA_ERROR_NO_BINARY_FOR_GPU loading PTX ? Can't load PTX 'image' no matter what I do.. CUDA Programming and Performance	2	7362	April 5, 2012
Some troubles with cuda driver api in 64-bit mode CUDA Programming and Performance	2	831	June 23, 2011
Error compiling test program CUDA Programming and Performance	1	5629	July 16, 2009
Kernel fails to load "invalid floating point operation" CUDA Programming and Performance	2	681	March 27, 2015
Problem with cuModuleLoadDataEx CUDA Programming and Performance	1	1367	June 14, 2011
Help!! I can't get my NVidia GeForce GT 525M to load in a single CUDA PTX kernel!! CUDA Programming and Performance	11	5944	November 16, 2012
cuModuleLoad fails to load PTX for higher architectures than gpu. (documentation issue or so). CUDA Programming and Performance	3	1078	February 12, 2015
cuda .Net. LoadModule raise GASS.CUDA.CUDAException CUDA Programming and Performance	1	2298	August 5, 2011
cuModuleLoad seg fault with cubin file CUDA Programming and Performance	0	890	September 16, 2010
CUDA 4.0 + VS2010 + CUDA.net CUDA Programming and Performance	0	633	December 25, 2011

CUDA has turned into CRAP (cuModuleLoad fails, bug in compiler/driver ?)

Related topics