OpenACC and OpenMP data interoperability

MatColgrove · June 25, 2021, 6:13pm

Hi iomagkanaris,

I don’t believe we claim interoperability between OpenACC and OpenMP Target to GPUs, but the two models do share some of runtime, in particular data management, so this aspect should work ok. Though we haven’t thoroughly tested it so there may be issues we’re unaware. Normally I’d recommend sticking to one model or the other, but it sounds like you’re wanting to port an existing code from OpenACC to OpenMP and do it incrementally.

Is there really only one copy done from the Host to the Device?

It appears so. The models share the same runtime data management so the device copy of “x” would be visible when the compiler does the present check upon entering the compute region.

Where is the Unified Memory CPU page fault coming from?

Sorry, no idea. I don’t see it when I profile the code, but I’m using Nsight-Systems which doesn’t have the print-gpu-trace option. Possibly an artifact of the profiling?

Is again only one HtoD copy of the x array really?

Since x and y both point to the same device memory, the present check will pass in the same device pointer for both. The copies only occur when you call “acc_copyin” and “exit data copyout”

Does OpenMP figure out automatically that the pointer x is associated to the x_dev pointer and is already present in the GPU memory using the OpenACC present table?

They share the same present table so should work as expected in this case.

Have I understood correctly that this is the proper usage and benefit of omp_target_associate_ptr , meaning that it’s used to associate another pointer to the same data existing on the GPU? It also seems to me that this is not needed for the x array pointer. Am I right?

I wouldn’t necessarily recommend mapping two host pointers to the same device address in the same kernel, as you do here, since it has the potential to introduce bugs, but it is a use case. The typical use case is to re-use device memory, i.e. create some device memory, map it to some host pointer, use it in a kernel, then map it to a different host pointer for another kernel thus re-using the device memory.

No, it’s not needed for “x” since this is already implicitly mapped as part of the acc_copyin call.

-Mat

Topic		Replies	Views
OpenMP + OpenACC model Legacy PGI Compilers	3	2642	September 18, 2018
Call to collective mpi subroutine with openacc host_data directive Legacy PGI Compilers	8	1021	March 26, 2021
Pointer and OpenACC in Fortran Legacy PGI Compilers	1	3828	November 16, 2012
Data copies of the same size vary greatly in different program times nvc, nvc++ and nvfortran	2	332	July 7, 2023
Combine OpenACC and Unified Memory for Productivity and Performance Technical Blog	0	339	August 25, 2020
OPENACC changes value of array Legacy PGI Compilers	12	9712	May 17, 2016
Deviceptr vs present OpenACC directives nvc, nvc++ and nvfortran	10	646	March 12, 2024
Different GPU memory usage between OpenACC and OpenMP Offload nvc, nvc++ and nvfortran	10	896	April 28, 2023
Questions about omp offload and memory transfer nvc, nvc++ and nvfortran	13	1468	October 15, 2021
Direct GPU-to-GPU data transfer with OpenACC+managed+MPI nvc, nvc++ and nvfortran	4	1192	April 12, 2022

OpenACC and OpenMP data interoperability

Related topics