Hello guyz,
Now, I am struggling with several cuda usage issue but I’ve solved some of them well. However, I am using a while loop inside a 3D (x,y,z) kernel and whenever this loop is added into it, i see that my BlockIdx.y is limited up to 5 instead of 44. Is there a limitation of while loop usage inside a cuda kernel?
gridDim.x = 30,
gridDim.x = 44,
gridDim.x = 2,
BlockDim.x = 1,
BlockDim.y = 1,
BlockDim.z = 32,