How to Optimize Data Transfers in CUDA Fortran

Hi, how is the syntax to use cudaMemcpy3dAsync? I can't understand what the documentation says with cudaMemcpy3DParms derived type.
Thanks in advance.