I am trying to divide two floating type numbers simply by / (division) in kernel. But the result always returns the MAX_NUM of floating data type.
In debugging mode, I set breakpoint on the line contains /(division), but program does not stop at breakpoint. No debugging msg feedback too.
My cuda computing ability is only 1.1. Does anyone meet similar problem? Any input is welcome,
Could you show a self-contained small example where floating-point division does not work? Note that with compute capability 1.1 you are limited to operations on “float”, as “double” requires compute capability >= 1.3 . What CUDA version are you using?
I guess I should have expressed myself more clearly. By “self-contained example code”, I meant code that one can compile with nvcc without needing any additional files and then run the resulting executable, also without needing any additional files.
Please note that CUDA 4.0 requires a driver r270 or higher. If indeed you are really running with r260 drivers as indicated, CUDA 4.0 should fail to even initialize. Please make sure your program checks the return code of every API call and every kernel launch.