Unknown Error

Pittsburgh · October 16, 2008, 2:53pm

I get an “unknown error” on any CUDA code line following my kernel function call. For example, if the first CUDA code following my kernel call is:

CUT_CHECK_ERROR("Kernel execution failed");

I get the error:

“Cuda error: Kernel execution failed in file ‘template.cu’ in line 124 : unknown error.”

If I instead replace the CUT_CHECK_ERROR with:

CUDA_SAFE_CALL(cudaMemcpy(h_num_c, d_num_c, sizeof(unsigned int), cudaMemcpyDeviceToHost));

I get a similar error:

“Cuda error in file ‘template.cu’ in line 132 : unknown error.”

My code is attempting to launch the kernel with 118 blocks with 256 threads each. I have used cubin to make sure that I am not exceeding shared memory or the amount of registers. (smem = 40, reg = 8)

DeviceEmu gives me the result I expect, so I do not believe there is some problem in the kernel code (such as a hang or indexing an array out of bounds).

Has anyone else seen a similar “unknown error”?

Thanks.

Reimar · October 17, 2008, 6:07am

DeviceEmu unless combined with valgrind is useless to check for out-of-bounds access. On the CPU most out-of-bound accesses does not lead to a visible error.

Pittsburgh · October 27, 2008, 1:55am

Thanks for the information Reimar.

This error has returned for me. I was getting an error with the 177.84 driver that told me my kernel execution had timed out and was terminated, so I tried installing the latest driver for 2.0 (178.08). Now I again get the “Unknown Error”: Cuda error: Kernel execution failed in file ‘template.cu’ in line 76 : unknown error. (line 76 is the kernel launch line) The screen also flickers when this occurs and then goes back to normal (this is when I see the error printed).

From the .cubin file that I have generated I see no reason why the kernel launch should fail. In the above code, the number of blocks is 1101 and block size is 128. Shared memory usage is 3240 per block and only 13 registers per block are used (my GPU is an 8800 GTS).

Also, the amount of memory for the verticies passed to the device is roughly 35 MB, hardly enough to cause a problem. Moreover, I normally see a memory exceeded error if this was truly the problem.

My kernel is a graph coloring algorithm and I do not get the error for all inputs to my kernel. There are some graphs that I can input without issue. (DebugEmu and ReleaseEmu, as mentioned, also always yield an appropriate result.)

Is there anything I may be missing regarding why my kernel is failing to launch or timing out during execution? What are some clues I should be looking for to rootcause the issue?

Thanks for any help.

Malar · December 1, 2012, 7:44am

Hai,
I am having an issue in my cuda programming. When i run my code the output window closes with unknown error at the line where i did memcpy. But when i run in nsight-> start cuda debugging it works. Can any one help me please?

bw727 · October 17, 2018, 10:34am

when I rollback to previous driver: 9/1/2018

cudaErrorInsufficientDriver(35)

Topic		Replies	Views
Display driver crashes & "unknown error" on cudaMemcpyDeviceToHost CUDA Programming and Performance	0	3376	December 11, 2009
cutilCheckMsg("kernel launch failure"); unknown error. CUDA Programming and Performance	1	1348	October 27, 2010
kernel not executed, profiler reports all-zeros CUDA Programming and Performance	18	11151	December 2, 2008
Getting around apparent CUDA bugs CUDA Programming and Performance	5	1075	September 20, 2011
cudaMemcpy unknown error CUDA Programming and Performance	2	835	February 8, 2012
unknown error from cudaMemCpy Get cuda unknown error for unknown reason CUDA Programming and Performance	9	6356	December 3, 2010
WEIRD cudaMemcpy error CUDA Programming and Performance	2	4426	November 15, 2011
random kernel execution failure with unknown error CUDA programming on Linux CUDA Programming and Performance	9	8744	June 11, 2008
Emulation works, GPU doesn't. Newbie question. CUDA Programming and Performance	2	2788	September 3, 2009
Unspecified launch failure 4 kernel calls CUDA Programming and Performance	11	5247	April 2, 2008

Unknown Error

Related topics