Hi everybody and happy new year!
I am working on a particle simulation code. It is based on the discrete element method. I’ve finished with programming in CUDA and tested my program in debug mode. Now, I changed the mode into the release and did the same calculations. However, the same calculation goes wrong in release mode. In the release mode, the kernel can not detect the wall and particles and hence memory accesses go out of range. Execution fails.
Now my main questions are:
- is there any differences between computational accuracy of GPU in debug or release mode? I dont have the same problem with CPU since my previous work was discrete element method on CPU.
- do I have to consider any special situation when I want to use release mode, i.e. refusing nested ifs or etc.!!!?
I use cuda 5.0 integrated into Visual Studio 2010. The GPU is geforce GTX 660 Ti.