Data region for 2 GPUs

Gfzhang · July 14, 2016, 10:03pm

Hi,

I have accelerated my Fortran code using OpenACC. For K20 with 4GB memory, I can simulate ~ 1 million particles.
My problem now is how to simulate more particles using two K20.

In my code, I have a big matrix, the matrix is calculated before OpenACC parallelization, and the matrix remains constant till the end of the simulation.

I copyin the matrix to device and used data region to store it in device.

To use two GPUs, I plan to transfer first half of the matrix the first GPU and last half matrix to the second GPU. No data exchange is needed between two GPUs. I have read several tutorials on multiple GPUs, but I didn’t find examples to show how to use data region for multi-GPUs. Does anyone know how to do this ?

Thanks,
GZ

MatColgrove · July 15, 2016, 5:55pm

Hi GZ,

A data region cannot span across multiple accelerators.

For multi-GPU programming you need to use a host-side parallel model like MPI or OpenMP, to first do the host domain decomposition, assign the host thread to an accelerator, then enter a data region to do the mapping between each thread’s host data with the mirrored device data.

Mat