i get this error when executing a kernel with following ptxas info with 256 threads/block and more than 302 block, less than 302 block works without the error
ptxas info : Used 32 registers, 80+80 bytes smem, 48 bytes cmem[1]
cutilCheckMsg cudaThreadSynchronize error: Kernel execution failed in file <template.cu>, line 193 : unknown error
the kernel is 30 lines and it calls 13 device functions with approximately 286 lines
–edit
i’m sorry i forgot to tell you i have elitegroup geforce 9800 gt compute capability 1.1
dual core 3.2 ghz
2gb ram
i hope that helps
thanks for advance
i tried 181.20, 182.08, and 182.50 drivers
i also tried on laptop with 8600M GS, with drivers 181.22 beta drivers
same error, any help is really appreciated.