Matrix Addition

wolfshark · June 14, 2012, 2:42am

Apologies. (I posted in wrong section before)

Hi, I am very fresh in learning CUDA and I need some help adding matrices. So far i have this as my adding function:

#define N 3
const dim3 threadsPerBlock(N, N);
const dim3 numBlocks(N / threadsPerBlock.x, N / threadsPerBlock.y);

global void compute(int a[N][N], int b[N][N], int c[N][N])
{
int i = blockIdx.x * blockDim.x + threadIdx.x;
int j = blockIdx.y * blockDim.y + threadIdx.y;
if (i < N && j < N)
c[i][j] = a[i][j] + b[i][j];
}

It is very similar to the NVIDIA programming guide example.
From there i have:

int main(void)
{
int a[N][N], b[N][N], c[N][N];
int dev_a[N][N], dev_b[N][N], dev_c[N][N];

cudaMalloc( (void**)&dev_a, (NN)sizeof(int) );
cudaMalloc( (void*)&dev_b, (NN)sizeof(int) );
cudaMalloc( (void*)&dev_c, (N*N)*sizeof(int) );

//THEN I FILL THE MATRICES UP WITH RANDOM NUMBERS

and finish off with this:

cudaMemcpy(dev_a, a, (N*N)sizeof(int), cudaMemcpyHostToDevice);
cudaMemcpy(dev_b, b, (NN)*sizeof(int), cudaMemcpyHostToDevice);

compute<<<numBlocks,threadsPerBlock>>>(dev_a, dev_b,dev_c);

cudaMemcpy(c,dev_c, (N*N)*sizeof(int), cudaMemcpyDeviceToHost);

The addition is not happening. Can anyone see where i went wrong? Thanks.

Gilles_C · June 14, 2012, 7:35am

Hi,
Your kernel’s parameters are not to be define as 2D arrays, but rather as 1D pointers. To access the data at index (i,j), use “a[j*N+i]”.
In addition, I’m not too sure if your host’s 2D automatic arrays can be used directly for cudaMemcpy. I wouldn’t do like this, but my knowledge of the C standard dark corners is not sufficient to say if this is actually wrong, or just super counter-intuitive.

Topic		Replies	Views
Matrix Addition CUDA Programming and Performance	2	1716	June 14, 2012
CUDA Matrix Addition - 1D Memory, threads and blocks in 1D Matrix Addition in CUDA C using global m CUDA Programming and Performance	0	1069	November 26, 2011
basic matrix addition CUDA Programming and Performance	3	1859	March 9, 2012
2matrix addition CUDA Programming and Performance	3	895	April 28, 2010
matrixAddition a simple cuda program, not working. please help CUDA Programming and Performance	1	3569	August 18, 2009
Matrix Calculations/Manipulations CUDA Programming and Performance	1	405	March 20, 2017
CUDA Matrix Addition - 1D Memory, threads and blocks in 1D Matrix Addition in CUDA C using Texture a CUDA Programming and Performance	1	2343	November 26, 2011
2D matrix addition question CUDA Programming and Performance	7	12186	May 17, 2009
cudaMallocPitch CUDA Programming and Performance	5	4488	October 5, 2010
matMul in Guide CUDA Programming and Performance	1	2601	February 1, 2009

Matrix Addition

Related topics