Hello everyone,
I’m using Cuda Fortran to accelerate an existing application written in Fortran. I would like to use the API function cudaMemcpy3DAsync
to asynchronously copy part of a 3D array. I see in the Cuda Fortran programming guide that this function uses a data structure called cudaMemcpy3DParms
, unlike cudaMemcpy2D
/cudaMemcpy2DAsync
which are more straightforward. However, I haven’t been able to find any documentation on this data structure in Cuda Fortran nor any examples of anyone using cudaMemcpy3DAsync
in Cuda Fortran; the page in the API documentation is in C/C++ only. Am I missing something in the docs about how the API data structures work in Fortran? And does anyone have any examples/snippets on how to use this function/data structure in Fortran, or suggestions for another way to asynchronously copy part of a 3D array?
Thanks!
Cooper