My code works well in cuda 5.0 debug mode(with -G option).
But when I update cuda to 5.5, the error in debug mode is “too many resources requested for launch”
The code can run correctly when I reduce the size of block form 512 to 256.
Obviously, the gpu kernel use too many resigters.
I try to compile the code without -G option and set the block size to 512. I find the error disappears.
Does the “-G” option in CUDA 5.5 will limit the number of registers used in app.
My env is K20m + SLES11SP2 + CUDA 5.5