How to Overlap Data Transfers in CUDA C/C++

Originally published at: https://developer.nvidia.com/blog/how-overlap-data-transfers-cuda-cc/

In our last CUDA C/C++ post we discussed how to transfer data efficiently between the host and device. In this post, we discuss how to overlap data transfers with computation on the host, computation on the device, and in some cases other data transfers between the host and device. Achieving overlap between data transfers and other…