I am very new to CUDA programming. I have an opensource code, DualSPHysics, which can run only on single-GPU. DualSPHysics is written in C++ and the CPU version can run only on single-node (OpenMP) and the GPU version only on single-GPU (CUDA). But, the node I use for computing has two GPU cards (K20’s).
I want to convert this DualSPHysics code to run on the two GPU’s. Please enlighten me on what should I do to make this happen.
The basic ideea is to divide the work in 2 equal parts (if the cards are identical) have them perform the calcuations, then collect the data. The biggest problem is the communication between the cards.