What do I do wrong in my simple "sum two 3x3 matrices" program?

rabotavladoz · April 13, 2019, 6:59pm

I wrote this program:

#include "cuda_runtime.h"
#include "device_launch_parameters.h"

#include <stdio.h>
const int ARRAY_SIZE = 3;

__global__ void soberiMatricaKernel(float c[ARRAY_SIZE][ARRAY_SIZE], float a[ARRAY_SIZE][ARRAY_SIZE], float b[ARRAY_SIZE][ARRAY_SIZE])
{
	int i = threadIdx.x;
	int j = threadIdx.y;

	c[i][j] = a[i][j] + b[i][j];
}

int main()
{
	
	float d_a[ARRAY_SIZE][ARRAY_SIZE] = { {1, 2, 3}, {4, 5, 6}, {7, 8, 9} };
	float d_b[ARRAY_SIZE][ARRAY_SIZE] = { {9, 8, 7}, {6, 5, 4}, {3, 2, 1} };
	float d_c[ARRAY_SIZE][ARRAY_SIZE];

	cudaMalloc((void**)&d_a, ARRAY_SIZE * ARRAY_SIZE * sizeof(float));
	cudaMalloc((void**)&d_b, ARRAY_SIZE * ARRAY_SIZE * sizeof(float));
	cudaMalloc((void**)&d_c, ARRAY_SIZE * ARRAY_SIZE * sizeof(float));

	soberiMatricaKernel <<<1, ARRAY_SIZE * ARRAY_SIZE >>> (d_c, d_a, d_b);

	for (int i = 0; i < ARRAY_SIZE; i += 1)
	{
		for (int j = 0; j < ARRAY_SIZE; j += 1)
		{
			printf("%d\n", d_c[i][j]);
		}
	}

	cudaFree(d_a);
	cudaFree(d_b);
	cudaFree(d_c);

	return 0;
}

Although it compiles with no error, the execution doesn’t get inside my soberiMatricaKernel() CUDA function. Can somebody spot what do I do wrong?

Should I use cudaMemcpy() before soberiMatricaKernel() ? How it will look like?

Robert_Crovella · April 13, 2019, 8:50pm

You’ve made several errors. Study the cuda vectorAdd sample code. Follow a similar sequence.

Topic		Replies	Views
I got the wrong result from matrix summation CUDA Programming and Performance	2	573	June 1, 2011
Matrix Addition CUDA Programming and Performance	1	1187	June 14, 2012
CUDA Kernel seems not to be excecuted CUDA Programming and Performance	1	802	July 11, 2009
My first program it doesn't behave as expected CUDA Programming and Performance	2	2557	July 19, 2009
First CUDA program... looks good, executing wrong? CUDA Programming and Performance	3	1066	June 24, 2009
Matrix multiplcation peoblem CUDA Programming and Performance	2	1170	July 9, 2010
Newbie: Super simple first CUDA program what's wrong? CUDA Programming and Performance	4	3601	October 2, 2009
[Newbie] Getting "unspecified launch failure" errors CUDA Programming and Performance	2	640	September 7, 2014
Matrix Addition CUDA Programming and Performance	2	2113	June 14, 2012
CUDA kernel from matlab CUDA kernel for matrix operations from matlab CUDA Programming and Performance	1	1070	May 12, 2012

What do I do wrong in my simple "sum two 3x3 matrices" program?

Related topics