How can I turn an already allocated host memory buffer into a page-locked memory using the CUDA driver API? Is there any equivalent procedure to achieve the same behaviour of the CUDA runtime cudaHostRegister?
Maybe cuMemHostRegister() ?
cuMemHostRegister (void* p, size_t bytesize, unsigned int Flags)
“Registers an existing host memory range for use by CUDA.”
Page-locks the memory range specified by p and bytesize and maps it for the device(s) as specified by Flags. This memory range also is added to the same tracking mechanism as cuMemHostAlloc to automatically accelerate calls to functions such as cuMemcpyHtoD(). […]