I am getting a kernel call timed out error while using a Geforce 8800 but no error while using a tesla
I’ve checked that i’m not exceeding the register limit and the kernel execution is rather short. What could be the possible cause??
I have a large array of data in global memory. I have a for loop surrounding the kernel, iteratively calling it and operating on different parts of the data. Is it something to do with accessing too much global memory or something?
what could be the possible cause?