I have a CUDA kernel that breaks with the error:
too many resources requested for launch
when I run it with “/=” in the loop in that kernel. If I change the “/=” to “+=” then the kernel runs fine.
Please see the attached file for a test case.
test_code2.cu (5.1 KB)
The code tries to launch 160 x 6 x 2 threads (xthreads x ythreads x blocks). If I reduce that to 128 (but no more) x 6 x 2 threads it runs fine.
Compile it with:
/usr/local/cuda-10.2/bin/nvcc -gencode arch=compute_53,code=sm_53 test_code2.cu
Thanks. I’ve been looking at this for 4 days and cannot see why it is not working.
This thread was started over here
https://forums.developer.nvidia.com/t/weird-cuda-problem-changing-to-in-a-loop-causes-a-variable-not-to-be-set/188871 but could not continue as the expert did not have a Jetson Nano in their possession.