Explicitly changing context when using cuMemcpyDtoH

randomx80 · January 30, 2018, 8:35pm

I’m using the CUDA Driver API to run a multi-GPU application. Currently, I have 2 contexts, one for each device. I’m doing a simple transfer from the first GPU to the second GPU, and then I transfer data from the second GPU to the host. When I allocate memory for each GPU (via cuMemAlloc), I make sure to call cuCtxPushCurrent/cuCtxPopCurrent to get the right context. When I schedule the transfer between the first GPU to the second one, I call the memcopy in between the corresponding cuCtxPushCurrent/cuCtxPopCurrent calls. However, to transfer data from the second GPU, do I need to use cuCtxPushCurrent/cuCtxPopCurrent to get the right context? I tried removing them and the application still worked. Are memcopies independent of the context of the involved buffers?

Here is the pseudo code of what I’m talking about?

//Transfer from GPU1 to GPU2
cuCtxPushCurrent(cudaContexts[0]);
// Do work and transfer data
cuCtxPopCurrent(cudaContexts[0]);

//Transfer from GPU2 to host
cuCtxPushCurrent(cudaContexts[1]);
cuMemcpyDtoH(hostBuffer, devBuffer, bufSize)
cuCtxPopCurrent(cudaContexts[1]);

If I remove cuCtxPushCurrent/cuCtxPopCurrent with cudaContexts[1], the program still works. Is this because of UVA?

Thanks!

Topic		Replies	Views
How can I pass data across two contexts cuMemcpyPeer across contexts CUDA Programming and Performance	1	3178	June 28, 2011
Threaded CUDA application Using threads in application that utilizes CUDA api CUDA Programming and Performance	3	697	February 8, 2011
Contexts: Performance question overhead by switching the context CUDA Programming and Performance	3	2791	February 6, 2009
CUDA context and multi-threading CUDA Programming and Performance	0	2691	June 17, 2009
Why do we need cuCtxPushCurrent? CUDA Programming and Performance	8	5499	January 18, 2018
CUDA inter-process data transfer and cuMemcpyPeer seg fault CUDA Programming and Performance	4	1419	July 27, 2011
How to copy data between 2 contexts inside GPU? CUDA Programming and Performance	3	1140	November 12, 2019
What's the use of driver API "cuMemcpyDtoDAsync()"? CUDA Programming and Performance	1	1457	April 15, 2015
multi-gpu and cudamemcpyasync CUDA Programming and Performance	12	10859	April 15, 2010
Using multiple GPU devices from a single host thread CUDA Programming and Performance	1	855	November 7, 2010

Explicitly changing context when using cuMemcpyDtoH

Related topics