I have a kernel that i want to call, but for some reason it doesnt get called if i dim my thread config to greater than 15x15
This kernel works fine on a execution configuration of 10x10 threads and 10x10 blocks:
dim3 dimBlock(10,10);
dim3 dimGrid(10,10);
The problem happens when i try to use more threads:
dim3 dimBlock(30,30);
dim3 dimGrid(10,10);
I can dim as many blocks as i want, i just cant dim the threads past 15
The funny thing is that i can configure the kernel a different way, using only the x dimensions, e.g. i can do <<<300,300>>> with no worries, but i have to use the dim3 variables because my algorithm depends on two dimensional threads and two dimensional blocks.
Can anyone think as to why associating more threads to a kernel would make it cease executing?
Ive spent hours on this and have gotten no-where.
Your help would be much appreciated