How to pass two flags to cudaHostAlloc()?

snowlxm · June 17, 2009, 4:57pm

I want the page-locked memory to be portable and mapped in my multithread program.
so, can i do like this cudaHostAlloc((void **)&address,size,cudaHostAllocPortable|cudaHostAllocMapped) ?
but when I do the cudaHostAlloc in the main thread, and do cudaHostGetDevicePointer() in the children thread, I am failed.

by the way, i used a GTX295 with 2 GPU
Who know how to do that?
Thanks.

Geka · June 17, 2009, 5:40pm

CUDA contexts are attached to a given thread, so you probably cannot do that in two different threads.

snowlxm · June 17, 2009, 6:01pm

but the section 3.2.5.1 in the manual said that the portable page-locked memory can be shared and used by all the thread.

Because I will process many big data set which bigger than 512Mb or 1Gb, it is impossible to copy them to device or create some copies of them for each thread.

MisterAnderson42 · June 17, 2009, 6:03pm

But the use of the cudaHostAllocPortable flag is to make the memory pinned in all contexts :)

Interestingly, this mention about error messages in the CUDA reference manual seems to indicate that you should be able to map memory allocated in a different thread:

(emphasis added)

I unfortunately don’t have access to a system with both a G200 board and CUDA 2.2, so I can’t test this out for myself.

navier-stokes · June 17, 2009, 6:17pm

Section 3.2.5.3 of the Programming guide states

So do not forget to call the

cudaSetDeviceFlags(cudaDeviceMapHost)

function!

Regards

Navier

snowlxm · June 17, 2009, 6:53pm

you are right.

I only call cudaSetDeviceFlags in the children thread , but not in the main thread.

After i added it in the main thread, it work well.

Thank you .

Another questions is that the manual mentioned that the use of page-locked memory will influence the performance of the computer.

I know little about the page-locked memory how to work, I only feel if have enough memory, it doesn’t matter to use the page-locked memory.

Am I right?

if I have 8Gb or 16Gb memory, is there influence when i allocate about 4Gb page-locked memory?

Topic		Replies	Views
Questions for multiple GPUs CUDA Programming and Performance	8	7162	April 20, 2009
Data transfer between multiple GPUs How to do it fast ? CUDA Programming and Performance	4	2541	January 21, 2010
memory allocated by cannot be accessed by parent function pinned memory, mapped host memory, no-zer CUDA Programming and Performance	2	4767	May 20, 2011
cudaMalloc and threads "invalid device pointer" error CUDA Programming and Performance	4	5446	June 26, 2007
cudaMallocHost() vs cudaHostAlloc(cudaHostAllocPortable) CUDA Programming and Performance	1	4795	August 22, 2013
CUDAFreeHost() not clearing allocated host memory, when multiple devices are used. CUDA Programming and Performance	2	1180	November 13, 2019
cudaHostAlloc and thread safety problems with pinned, portable memory CUDA Programming and Performance	2	1811	April 8, 2011
Mapped memory across multiple GPUs CUDA Programming and Performance	3	8738	October 28, 2010
Pinned memory size problem CUDA Programming and Performance	4	3895	December 11, 2009
MultiGPUs newbie question Data transformation problem CUDA Programming and Performance	12	5152	March 18, 2008

How to pass two flags to cudaHostAlloc()?

Related topics