Data setup for multi-gpu program can't setup outside of thread?

e.ping · July 20, 2007, 2:34pm

I have a program that works for a single GPU and I’m porting it to a multi-GPU version using the project in the SDK as a reference. What I want to do is set up a bunch of global memory, textures, etc., store handles in structures, then spawn threads to run the kernels. Inside the threads the structure would be used to access the per-gpu variables, upload data run the kernel, and download data.

The problem is that when I do things like cudaMalloc outside of the thread, I get garbage out of my kernel. When I do the cudaMalloc inside the thread, the kernel works fine. When trying to call cudaMalloc before spawning the thread, if I print the address of the pointer returned by cudaMalloc outside and inside the thread, they are the same but something breaks. Any ideas? :huh:

paulius · July 20, 2007, 6:15pm

CUDA resources created by different host (CPU) threads cannot be shared (Programming Guide, section 4.5.1.1).

Paulius

e.ping · July 20, 2007, 6:17pm

I’d say this is due to CUDA contexts. Each thread can have only one context which has it’s own memory space etc. Basically if you transfer memory inside your main thread it will use the context of the main thread. In the second newly created thread you will be using a different context and hence cannot access data outside your context.
My guess is that there is really no way to prevent this, however maybe someone can prove me wrong.

e.ping · July 20, 2007, 7:30pm

Thanks, good to know.

Topic		Replies	Views
cudaMalloc and threads "invalid device pointer" error CUDA Programming and Performance	4	5442	June 26, 2007
Why exactly cant you share CUDA ressources amongst different host threads? CUDA Programming and Performance	1	3736	November 26, 2009
cudaMalloc and sharing between CPU threads CUDA Programming and Performance	0	4342	May 20, 2009
cudaMalloc, cudaFree from different threads CUDA Programming and Performance	6	10894	August 27, 2007
cudaMalloc from inside a kernel CUDA Programming and Performance	3	12475	September 2, 2009
CUDA + CPU threads CUDA Programming and Performance	5	11642	August 20, 2008
MultiGPU example in the CUDA SDK some stack problems CUDA Programming and Performance	5	3124	March 11, 2018
Shared memory between several kernels CUDA Programming and Performance	6	1777	April 6, 2010
want to allocate memory inside kernel CUDA Programming and Performance	2	1436	July 13, 2009
Using CUDA from nested thread Is it safe or not ? CUDA Programming and Performance	2	1038	January 28, 2009

Data setup for multi-gpu program can't setup outside of thread?

Related topics