A simple threading question Do memory copies have to occur in the device thread?

TL1 · March 23, 2009, 3:57pm

Here’s a simple question:

When you invoke kernels you have to be in the right host thread. But what about when doing memory copy operations, like cudaMemcpy2D and cudaMemcpy2DToArray? Is it necessary to make sure these get called in the same thread as the one that created the CUDA resources?

Thanks!

eyalhir74 · March 23, 2009, 4:39pm

Yes :)

Fugl · March 24, 2009, 1:38pm

This is also true for pinned (page-locked) host memory allocated with cudaMallocHost(). You have to call cudaMallocHost() and the cudaMemCpy*() functions from the same thread.

Ojiisan · March 26, 2009, 9:40am

or use context magic to get the context to your current thread.

Fugl · March 26, 2009, 1:36pm

Good point, though that would require him to use the Driver API exclusively.

I’m pretty much in the same situation and I’m stuck with a codebase written for the Runtime API. What I do is

to forward memory allocation requests and frees to the GPU thread via a message queue.