Suppose I have a big program coding in C++, I know the main cost of this program is a big for loop, I want to compute the for loop on GPU, but don’t change other source files. How can I do it?
I use visual 2005 with CUDA integrated in. Is there anyway I can realize my goal? Or If I have to develop the whole project in CUDA and rewrite all the source file?