I have been running my CUDA-based program for the last 6 months using CUDA 1.0 on a GeForce 8600M GT in my laptop and under OpenSUSE 10.2. It worked perfectly until last week I changed the plattform. This is giving me some problems. My new plattform is an Intel Quad Core 2.4GHz with OpenSUSE 10.2 and CUDA 1.1. My program now runs on TESLA C870 and the pseudocode is basically as follows:
while (n<1000) H_kernel<<configurationH>>(paramsH); E_kernel<<configurationE>>(paramsE); updateSource(); end
The problem is that the while loop never executes 1000 times. It typically performs a random number of iterations between 50 and 600 (more or less). Sometimes more, sometimes less. It thought that it might be related to the 5 seconds running time restriction for the kernels but the C870 is not connected to an X display and I am quite sure that the running time of these kernels is on the order of miliseconds.
Does anyone know which changes from CUDA 1.0 to CUDA 1.1 can produce such a behaviour?
Thank you in advance for your help.