Urgent. Need help MultiGPUs bug

Linh_Ha · March 19, 2008, 8:46pm

I try to understand what i can do multiGPUs. I make a small program that perform 3D gradient computation on 2 Quadro 5600 GPU. I give them the same input, and hope that they will produce same output.

However when i run the code, the test failed most of the time. Can any one try and explain why that happen.

Thank you. I really need to understand why it failed, without it i can not continue my research. I’m stuck.
multiGPU.tar.gz (2 KB)

jimh · March 19, 2008, 9:50pm

The first problem I see is:

cudaArray* d_array3D = NULL;

float* d_data;

float* d_grad;

are all global and therefore shared between threads. You need to have separate variables for each thread, otherwise one thread will overwrite the address placed there by another thread.

Linh_Ha · March 19, 2008, 11:30pm

The first problem I see is:
cudaArray* d_array3D = NULL;

float* d_data;

float* d_grad;
are all global and therefore shared between threads. You need to have separate variables for each thread, otherwise one thread will overwrite the address placed there by another thread.

[snapback]345852[/snapback]

I think these variable will be created per context, but it is likely that I’m wrong.

Thank you, you save me.

By the way i don’t have separate variable for cudaArray but it still works. Do i really need separate variable for cudaArray

seb · March 19, 2008, 11:42pm

You definitely need them for pointers to global memory. You don’t need them for pointers to constant memory (Symbols). Not sure about arrays.

Linh_Ha · March 19, 2008, 11:51pm

Do you know where in the Programming Guide i can find all these information

seb · March 20, 2008, 1:15am

The programming guide states that constant variables have to be defined at file scope. The implications are obvious.

jimh · March 20, 2008, 5:21pm

The programming guide isn’t going to tell you that you need separate CPU variables to avoid race conditions between threads. This is a basic multi-threading concept. (This sounds somewhat confusing, but remember the pointers to device memory actually exist in the CPU’s memory space.)

If it helps, I’ve learned you need separate variables for each thread/GPU/CUDA context for everything except texture references and constant symbols.

Topic		Replies	Views
__device__variables and multiple devices CUDA Programming and Performance	4	2777	September 11, 2008
Data setup for multi-gpu program can't setup outside of thread? CUDA Programming and Performance	3	2820	July 20, 2007
global Variables and MultiGPU CUDA Programming and Performance	2	1732	September 15, 2010
Simple multiGPU - Why is it failed Example to understand how multiGPU work CUDA Programming and Performance	8	4468	March 6, 2008
On which device are __device__ variables allocated? CUDA Programming and Performance	21	6741	March 13, 2009
CUDA Fortran+Openmp problem Legacy PGI Compilers	9	1268	March 3, 2022
__constant__ and multiple GPUs CUDA Programming and Performance	5	3318	January 22, 2008
Multi GPU question CUDA Programming and Performance	7	5382	April 27, 2009
Constant memory multi GPU Magnagement global constant memory multiGPU CUDA Programming and Performance	3	7367	May 5, 2011
accessing the same variable from different threads? CUDA Programming and Performance	1	4302	March 11, 2010

Urgent. Need help MultiGPUs bug

Related topics