I have a C2050 that usually appears to be acting fine. My kernels run without errors, cuda-memcheck reports no errors, and the results are correct.
However, every now and then the card produces incorrect results and/or reports kernel execution times that are nearly zero. All without any errors reported by CUDA. This can take a while to go away when it occurs.
The GPU temperature is 74. This is running in an Alienware Area-51 with a 1kW PSU, i7-980x, OpenSuSE 11.2, and a GTX-480 for display (the computer is rated for dual GTX-480).
When this happens, the GTX-480 still seems to be working fine.
Does anyone know what might be going on? Do I have a bad Tesla?