Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 1

Originally published at: Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 1 | NVIDIA Developer Blog

Most CUDA developers are familiar with the cudaMalloc and cudaFree API functions to allocate GPU accessible memory. However, there has long been an obstacle with these API functions: they aren’t stream ordered. In this post, we introduce new API functions, cudaMallocAsync and cudaFreeAsync, that enable memory allocation and deallocation to be stream-ordered operations. In part…