Hello to everyone,
Me and my colleagues have a simulation program that requires much process power and that must be run each time for each unit of the whole simulation. A complete simulation has about a million units, i.e, the program runs about a million times.
Recently we thought about moving this project to the GPU world, beginning with CUDA. But as the program is really big, we were wondering if (not as a final solution, of course) there is the possibility of sending the code to the GPU for execution in every core. The code is in Fortran and takes about 2h in an average CPU to run each time.
That being said, I would like to know if anyone is available to help me find some references that can give me information about the possibility (or impossibility) of doing this.
I hope I explained everything right.