Could you guys please bring back cudaMapAddress? a few words to cuda developer

ymzhang · March 14, 2007, 7:06pm

The function “cudaMapAddress” is available in prior version of CUDA, but missing in 0.8BETA version. This function is very important for my task, could you please consider including it in CUDA?

Hers is my situation. As a co-processor to CPU, I wish GPU would bring as little overhead as possible to CPU.

While using cudacpymem will slow down CPU a lot, the cudaMapAddress is a good function for this. To my understanding, it establishes a link between specific part of CPU and GPU memory. if I write to that part of GPU memory, this writing operation is redirected to CPU memory. This is really great, since CPU can only read that part of memory to decide whether GPU has found something important. Since GPU won’t alway find useful information, there won’t be a lot of memory writing from GPU to CPU.

Sorry to bring this topic since similar one is here:
[url=“The Official NVIDIA Forums | NVIDIA”]http://forums.nvidia.com/index.php?showtopic=30349[/url]

paulius · March 14, 2007, 11:11pm

Are you trying to have CPU read some data written by the GPU while the CUDA kernel is still running? If so, I’m not sure you can do that reliably since there’s no way for the CPU to know where the GPU is in its computation.

If you are trying to find out whether GPU has come up with “useful” data after the GPU kernel has completed, why not try this:

have the CUDA kernel write a “flag” to global memory indicating whether the computation lead to “useful” data.
use memCopy to read the flag, if it indicates that computation was “useful,” then do a large memCopy of the data computed by the GPU.

You have to realize that whatever method you use to get the data to the CPU (whether its mapping or copying), the data still has to travel across the bus from the device to the host memory. If that host memory area is cached, then CPU has to get involved at one point or another, to avoid outdated cache lines. So, I’m not sure you would really get improved performance with mapping. Is there a slowdown in your app when you change it to use memcopies?

Paulius

ymzhang · March 23, 2007, 11:44pm

I test the speed of cudamemcopy and cudamapaddress in the previous version. cudamemcopy is much slower than cudamapaddress.

Topic		Replies	Views
any way to write to CPU memory in kernel? why function cudaMapAddress is missing ? CUDA Programming and Performance	5	8120	March 12, 2007
mapping vs. copying CPU memory CUDA Programming and Performance	4	7249	April 20, 2011
mmap()ing device memory CUDA Programming and Performance	1	1269	September 23, 2009
disk access CUDA Programming and Performance	1	1267	August 10, 2009
Copying data in a specific address device memory CUDA Programming and Performance	2	3434	November 26, 2009
Mapping PCIe memory in user-space Mapping video memory in user-space to avoid DMA transfers CUDA Programming and Performance	3	16399	December 14, 2009
list of addresses GPU vorious CUDA Programming and Performance	3	1895	July 7, 2009
gpu memory address in c CUDA Programming and Performance	5	3675	December 13, 2008
How does Memcpy work ? CUDA Programming and Performance	1	6858	December 8, 2007
Implicit mapped memory access is there any way to access device mem in implicitly? CUDA Programming and Performance	1	7856	October 16, 2009

Could you guys please bring back cudaMapAddress? a few words to cuda developer

Related topics