Results wrong in GTX 280 but ok in GeForce 8600 GT CUDA 2.1 (64 bits)

I have a program that produces the correct results in a GeForce 8600 GT, but produces wrong results in a GTX 280 (with random behaviour).

Similar problems may be found here [url=“http://forums.nvidia.com/index.php?showtopic=95813&pid=537419&mode=threaded&start=#entry537419”]http://forums.nvidia.com/index.php?showtop...rt=#entry537419[/url]

People are blaming sync problems, so I tryed to place __syncthread() after each instruction of the kernels (to make sure it wasn’t a sync problem) - that still didn’t help.

Any ideas on what the problem may be?

thanks.

I already found the problem in my kernel (it was a syncronization problem).