Kernel execution failed: Too many resources..

Anjul_Patney · November 3, 2007, 11:28pm

Hello people,

I’m sorry if this is a repeat question, I tried to search but got no valid results. I just wanted to ask in what situations can a CUDA kernel exit with the following error:-

“Kernel execution failed :too many resources requested for launch”

I have a kernel where I allocate huge data to memory, and it works correctly. When I tried to allocate the same data through textures using CUDA Arrays (textures are limited in the x dimension to 65536, so I wrap the bytes around) , I get the above failure.

Anjul

MisterAnderson42 · November 4, 2007, 12:10am

I’ve only seen that message when requesting a block dimension with more threads than can run with the given register usage. There are 8192 registers, so the largest block you can run is 8192/registers used (from cubin). Or 512 as a device maximum.

It might also show up if you request more shared memory than is available (16k).

Anjul_Patney · November 4, 2007, 12:18am

Thanks! I do find that I’m exceeding 8192.

hmeck · November 26, 2007, 9:52pm

I too am now getting the “too many resources requested for launch” error. This is actually occurring in a kernel that worked previously to my installing 1.1. The kernel uses 17 reg/thread, 64 threads/block, so well under the 8192 limit. The shared memory is 8336. I have not changed any of the code around the kernel call, so in theory it is using the default 0 stream, and I have tried running the kernel with the stream set explicitly to 0. The guide says that this should cause kernel launches and memory copies to wait for all preceding operations, but is it possible the the (similar) shared memory use from a preceding kernel or following kernel could be causing this error?
Any help is appreciated.

jordyvaneijk · November 27, 2007, 12:14pm

To be exactly 512threads max per block and 768threads per multiprocessor.

MisterAnderson42 · November 27, 2007, 2:13pm

This seems odd. Are you requesting any “dynamic” shared memory in the kernel invocation. Perhaps it is being set to an uninitialized variable or something.

hmeck · November 27, 2007, 2:33pm

No, shared memory has constant size, allocated in the kernel. I’ve tried running the kernel by itself, and also not using shared memory (goes to local), and I’m getting the same result. I’m going to keep fiddling, maybe I can isolate the cause.

hmeck · November 27, 2007, 4:06pm

Okay, the kernel has a line:
nCands[candIdx] = Min(Min(tempCands[0], tempCands[1]), Min(tempCands[2], tempCands[3]));

where

device float3 Min(float3 A, float3 B)
{
if (A.z < B.z)
return A;
else
return B;
}

nCands and tempCands[i] are float3. The kernel still fails when I change this to

float3 tempF = Min(Min(tempCands[0], tempCands[1]), Min(tempCands[2], tempCands[3]));

nCands[candIdx].x = tempF.x;
nCands[candIdx].y = tempF.y;
nCands[candIdx].z = tempF.z;

However, if I comment out either the .x assignment or the .y assignment, the kernel runs no problem. It also works if nCands[candIdx] is assigned constants using make_float3. I can also assign tempF.x and tempF.y to float variables.

This kernel did actually work under 1.0

Edit: Definitely a register problem. I think I was side tracked by the fact that the kernel had once been working. When I got my register use down by one (actually, by using separate float arrays instead of one float3 array) my kernel started working again.

Mark_Harris · November 29, 2007, 10:33pm

Are you a registered developer? If you are, can you please file a bug through the registered developer site and attach the original kernel in the form where it worked on CUDA 1.0 but failed on CUDA 1.1?

We want to catch this sort of register allocation regression and fix it if possible.

Thanks,
Mark

Topic		Replies	Views
ERROR: too many resources requested for launch. CUDA Programming and Performance	8	26419	December 16, 2009
too many resources requested for launch CUDA Programming and Performance	28	25244	December 1, 2010
too many resources requested for launch what does it exactly mean? CUDA Programming and Performance	3	1612	January 28, 2009
too many resources requested for launch CUDA Programming and Performance	2	5962	September 7, 2009
Too Many Resources Requested CUDA Programming and Performance	8	1507	June 11, 2009
cudaErrorLaunchOutOfResources aka "too many resources requested for launch" CUDA Programming and Performance	3	10395	July 29, 2013
Too many resources requested for launch Legacy PGI Compilers	3	8114	September 23, 2010
Shared memory limits and cudaError_enum How to precisely determine how much of the shared memory is CUDA Programming and Performance	5	2921	April 29, 2009
Too many resources requested for launch: Strange Case CUDA Programming and Performance	6	1287	February 25, 2020
Max Used Register compile setting affecting kernel launch? CUDA Programming and Performance	9	2526	May 5, 2015

Kernel execution failed: Too many resources..

Related topics