Using Shared Memory in CUDA Fortran

Originally published at: https://developer.nvidia.com/blog/using-shared-memory-cuda-fortran/

CUDA Fortran for Scientists and Engineers shows how high-performance application developers can leverage the power of GPUs using Fortran. In the previous post, I looked at how global memory accesses by a group of threads can be coalesced into a single transaction, and how alignment and stride affect coalescing for various generations of CUDA hardware. For…