any way to write to CPU memory in kernel? why function cudaMapAddress is missing ?

ymzhang · March 10, 2007, 11:37pm

It seems there is a function “cudaMapAddress” in the prior version of CUDA, but I didn’t see it in the new version(0.8 Beta).

There are two questions I want to ask :

1)To my understanding, that function enables writing from kernel to the main memory, is that correct?

2)why it disappears in the 0.8 Beta version? there will be similar function in future version ?

any hints ?
thanks

Cyril_Zeller · March 12, 2007, 7:48am

cudaMapAddress() has been removed from the API because it is slow and uses the bus very inefficiently. Memory copying (as opposed to memory mapping) is the best way to read from or write to device memory.

To answer your first point below: cudaMapAddress() didn’t enable writing to main memory from a kernel; you cannot call cudaMapAddress() from a kernel or any function from the host runtime component for that matter (see section 4.5 of the programming guide).

ymzhang · March 12, 2007, 3:20pm

Although I cannot call cudaMapAddress() from a kernel, it seems if you write to an address in a kernel which has been mapped to some address in the CPU memory, this writing may be redirected to the CPU memory.

Here is another question, CUDA doesn’t support writing to CPU memory in the kernel? this ability should be very important in combing the CPU and GPU, especially when using GPU as parallel co-processor.

prkipfer · March 12, 2007, 3:36pm

As Cyril pointed out above, you have the cudaMemcpy functions instead. Indeed, I prefer the copy approach to the mapping approach as with the copy you have less concurrent memory accesses to worry about, ie. you are sure not to have race conditions with CPU threads accessing memory that is currently mapped. Else the synchronization between the CUDA threads would need an extension to also lock CPU access, which will be a very slow implementation.

Peter

ymzhang · March 12, 2007, 5:33pm

Thanks for the answer.

But, cudaMemcpy is inappropriate in my situation. I use GPU as co-processor to the CPU.

Initially, CPU assigns a computational task to GPU, later CPU wants to check whether there is some interesting results obtained by GPU. I hope there is a very cheap way to detect the status of data on GPU. in OpenGL it is occlusion query which is pipelined operation. but in CUDA, I can only use cudaMemcpy, which,I presume,will flush the GPU pipeline and slow down the CPU.

Is there any other way to query the data on GPU ?

Thanks.

prkipfer · March 12, 2007, 5:42pm

I think you misunderstand how CUDA works. There is no such thing as a display context so there is no asynchronous command submission (currently). In other words, your CUDA kernel call will block until completed.

So if you want parallel work to be done on CPU and GPU, you need a separate (CPU) thread for CUDA anyway. It can download a small status array at the right moment and place it in CPU shared memory for the other CPU threads to read. So this should be perfectly parallel.

Be warned however if you run CUDA and rendering on the same card. See other discussions in this forum why.

Peter

Topic		Replies	Views
Could you guys please bring back cudaMapAddress? a few words to cuda developer CUDA Programming and Performance	2	7334	March 23, 2007
mmap()ing device memory CUDA Programming and Performance	1	1269	September 23, 2009
disk access CUDA Programming and Performance	1	1267	August 10, 2009
Writing from GPU memory to memory on PCI device CUDA Programming and Performance	1	831	February 24, 2011
Mapping PCIe memory in user-space Mapping video memory in user-space to avoid DMA transfers CUDA Programming and Performance	3	16399	December 14, 2009
gpu memory address in c CUDA Programming and Performance	5	3675	December 13, 2008
mapping vs. copying CPU memory CUDA Programming and Performance	4	7249	April 20, 2011
Host Memory mapping to GPU CUDA Programming and Performance	3	6059	February 3, 2012
Copying data in a specific address device memory CUDA Programming and Performance	2	3434	November 26, 2009
Access gpu device memory CUDA Programming and Performance	2	1564	August 11, 2010

any way to write to CPU memory in kernel? why function cudaMapAddress is missing ?

Related topics