cudaHostRegister and cudaHostRegisterPortable, cudaHostRegisterDefault

Ken_Domino · January 13, 2012, 1:39pm

I’m trying to understand the function [i]cudaHostRegister/i.

In the reference manual, cudaHostRegisterPortable can be used for the third parameter of [i]cudaHostRegister/i (see related discussion here). But, I can’t figure out how to get cudaHostRegisterPortable pointers to work. If I pass an aligned host pointer from [i]malloc/i, then register the pointer with cudaHostRegister(host_ptr, …, cudaHostRegisterPortable), that “works”, but cudaHostGetDevicePointer(…, host_ptr, 0) fails. If I try to pass the host pointer directly to kernel code after registering using cudaHostRegister(host_ptr, …, cudaHostRegisterPortable), that fails, as I would expect. If I first allocate a device pointer by cudaMalloc(&dev_ptr, …), then pass the device pointer to cudaHostRegister(dev_ptr, …, cudaHostRegisterPortable), that fails, as I would expect. An example using cudaHostRegisterMapped, based on the simpleZeroCopy.cu in the CUDA GPU Toolkit examples, works fine, but that offers no help in understanding how cudaHostRegisterPortable pointers are allocated and used. And, the previous discussion does not show an example, nor can I find any examples online or in the GPU Toolkit samples.

The manual says: “cudaHostRegisterPortable: The memory returned by this call will be considered as pinned memory by all CUDA contexts, not just the one that performed the allocation.” This doesn’t quite make sense because there are no call-by-reference parameters to the function. Further, the symbol cudaHostRegisterDefault is defined in the CUDA header file driver_types.h, but how it is used is not documented for [i]cudaHostRegister/i. How does it work?

Ken

Ken_Domino · January 13, 2012, 1:39pm

I’m trying to understand the function [i]cudaHostRegister/i.

In the reference manual, cudaHostRegisterPortable can be used for the third parameter of [i]cudaHostRegister/i (see related discussion here). But, I can’t figure out how to get cudaHostRegisterPortable pointers to work. If I pass an aligned host pointer from [i]malloc/i, then register the pointer with cudaHostRegister(host_ptr, …, cudaHostRegisterPortable), that “works”, but cudaHostGetDevicePointer(…, host_ptr, 0) fails. If I try to pass the host pointer directly to kernel code after registering using cudaHostRegister(host_ptr, …, cudaHostRegisterPortable), that fails, as I would expect. If I first allocate a device pointer by cudaMalloc(&dev_ptr, …), then pass the device pointer to cudaHostRegister(dev_ptr, …, cudaHostRegisterPortable), that fails, as I would expect. An example using cudaHostRegisterMapped, based on the simpleZeroCopy.cu in the CUDA GPU Toolkit examples, works fine, but that offers no help in understanding how cudaHostRegisterPortable pointers are allocated and used. And, the previous discussion does not show an example, nor can I find any examples online or in the GPU Toolkit samples.

The manual says: “cudaHostRegisterPortable: The memory returned by this call will be considered as pinned memory by all CUDA contexts, not just the one that performed the allocation.” This doesn’t quite make sense because there are no call-by-reference parameters to the function. Further, the symbol cudaHostRegisterDefault is defined in the CUDA header file driver_types.h, but how it is used is not documented for [i]cudaHostRegister/i. How does it work?

Ken

tera · January 13, 2012, 4:08pm

What happens if you use [font=“Courier New”]cudaHostRegisterMapped|cudaHostRegisterPortable[/font] for flags? You need [font=“Courier New”]cudaHostRegisterMapped[/font] in order to be able to map the memory into the device address space.

tera · January 13, 2012, 4:08pm

What happens if you use [font=“Courier New”]cudaHostRegisterMapped|cudaHostRegisterPortable[/font] for flags? You need [font=“Courier New”]cudaHostRegisterMapped[/font] in order to be able to map the memory into the device address space.

Ken_Domino · January 13, 2012, 5:07pm

OK, thanks.

cudaHostRegister(…, …, cudaHostRegisterMapped|cudaHostRegisterPortable) seems to work. I will have to test this further. It would be nice to say in the doc that cudaHostRegisterPortable must also be used with cudaHostRegisterMapped.

What does cudaHostRegisterDefault do? Why is it even defined?

Ken

Ken_Domino · January 13, 2012, 5:07pm

OK, thanks.

cudaHostRegister(…, …, cudaHostRegisterMapped|cudaHostRegisterPortable) seems to work. I will have to test this further. It would be nice to say in the doc that cudaHostRegisterPortable must also be used with cudaHostRegisterMapped.

What does cudaHostRegisterDefault do? Why is it even defined?

Ken

tmurray · January 13, 2012, 7:13pm

is this in a non-UVA environment?

tmurray · January 13, 2012, 7:13pm

is this in a non-UVA environment?

Topic		Replies	Views
[UVA + HostPinnedMem + HostRegistered] Clarify device properties params CUDA Programming and Performance cuda	3	787	June 16, 2022
cudaHostRegister overload for const void * CUDA Programming and Performance	2	898	April 22, 2018
cudaMallocHost() vs cudaHostAlloc(cudaHostAllocPortable) CUDA Programming and Performance	1	4915	August 22, 2013
does anybody have experience on cudaHostRegister zero copy memory CUDA Programming and Performance	8	14558	May 21, 2011
cudaHostRegister on multiple threads CUDA Programming and Performance	15	557	June 18, 2024
Strange behaviour of cudaHostRegister CUDA Programming and Performance	16	1083	July 11, 2024
cudaHostRegister alternative for already pinned memory CUDA Programming and Performance cuda	0	390	September 5, 2022
Does Jetson TK1 support cudaHostRegister ? Jetson TK1	0	1242	May 26, 2014
cudaHostRegister and Fortran Legacy PGI Compilers	4	8303	February 8, 2013
cudaHostRegister breaks data CUDA Programming and Performance	0	467	April 8, 2019

cudaHostRegister and cudaHostRegisterPortable, cudaHostRegisterDefault

Related topics