Should I use the release compile mode in configuration solution and use compute_10,sm_10? But when I switch to this mode, the results are wrong. Although the speed seems to be increased.
Thanks.
Should I use the release compile mode in configuration solution and use compute_10,sm_10? But when I switch to this mode, the results are wrong. Although the speed seems to be increased.
Thanks.
There is a post on the NVIDIA blog on this topic.
The CUDA Compiler Driver NVCC doc provides even more detail.