How to create a portable cuda .exe file?

Should I use the release compile mode in configuration solution and use compute_10,sm_10? But when I switch to this mode, the results are wrong. Although the speed seems to be increased.

Thanks.

There is a post on the NVIDIA blog on this topic.

The CUDA Compiler Driver NVCC doc provides even more detail.