Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 2

Originally published at: Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 2 | NVIDIA Developer Blog

In part 1 of this series, we introduced new API functions, cudaMallocAsync and cudaFreeAsync, that enable memory allocation and deallocation to be stream-ordered operations. In this post, we highlight the benefits of this new capability by sharing some big data benchmark results and provide a code migration guide for modifying your existing applications. We also…