2D matrix through 1D array

qiangbo · July 25, 2011, 4:29am

Hi,

I am learning cuda and trying to implement a 2D matrix. However, my code doesn’t work so far and I couldn’t figure out the problem. So, I am posting my code here hoping to get some help.

The code is simple: it initiate a matrix and assign each element’s value according to its thread ID. The code compiles fine by hang the system whenever I run it.

Thanks!

#include "./common/book.h"

#define WIDTH	10

#define HEIGHT	10

typedef struct

{

	float *data;

	int *width;

} matrixStruct;

__device__ void SetMatElement(matrixStruct *mat, int x, int y, float val)

{

	int width;

	width = *(mat->width);

	*(mat->data+y*width+x) = val;

}

__global__ void SetMat(matrixStruct *mat)

{

	int x = threadIdx.x+blockIdx.x*blockDim.x;

	int y = threadIdx.y+blockIdx.y*blockDim.y;

	int offset = x+y*blockDim.x*gridDim.x;

	SetMatElement(mat, x, y, (float)offset);

}

int main(int argc, char *argv[])

{

	matrixStruct *mat;

	int width;

	float *data;

	data = (float*)malloc(10*10*sizeof(float));

	for(int i = 0; i < WIDTH*HEIGHT; i++)

	{

		data[i] = 0.0f;

	}

	width = WIDTH;

	HANDLE_ERROR(cudaMalloc((void**)&mat, sizeof(matrixStruct)));

	HANDLE_ERROR(cudaMalloc((void**)&(mat->data), WIDTH*HEIGHT*sizeof(float)));

	HANDLE_ERROR(cudaMemcpy(mat->data, data, WIDTH*HEIGHT*sizeof(float), cudaMemcpyHostToDevice));

	HANDLE_ERROR(cudaMalloc((void**)&(mat->width), sizeof(float)));

	HANDLE_ERROR(cudaMemcpy(mat->width, &width, sizeof(int), cudaMemcpyHostToDevice));

	dim3 grids(WIDTH/2,HEIGHT/2);

	dim3 threads(2,2);

	

	SetMat<<<grids,threads>>>(mat);

	HANDLE_ERROR(cudaMemcpy(&width, mat->width, sizeof(int), cudaMemcpyDeviceToHost));

	printf("width = %d\n", width);

	HANDLE_ERROR(cudaMemcpy(data, mat->data, WIDTH*HEIGHT*sizeof(float), cudaMemcpyDeviceToHost));

	for(int i = 0; i < WIDTH; i++)

	{

		printf("data[%d] = %f\n",i,data[i]);

	}

	HANDLE_ERROR(cudaFree(mat->data));

	HANDLE_ERROR(cudaFree(mat->width));

	HANDLE_ERROR(cudaFree(mat));

	free(data);

	system("pause");

	return(0);

}

MarkusM · July 25, 2011, 7:56am

You can’t access device memory from host code. Thus for mat malloced on the device mat->data can’t be modified in your main.

And before someone explains how to get this to work I’ll suggest a better alternative instead:

Use

typedef struct

{

        float *data;

        int width;

} matrixStruct;

and set it up via

matrixStruct mat;

mat.width = WIDTH;

HANDLE_ERROR(cudaMalloc((void**)&(mat.data), WIDTH*HEIGHT*sizeof(float)));

and pass it directly as a parameter instead of by it’s pointer

__global__ void SetMat(matrixStruct mat)

qiangbo · July 25, 2011, 3:02pm

Thanks! The solution actually works.

However, I still have the following confusion:

My understanding is that the variable ‘mat’ and ‘mat.width’ live in host memory. Can the function that runs on device access mat.width? Thanks.

Bo

MarkusM · July 26, 2011, 8:14am

Kernel parameters are automatically copied to the device. (Specifically to shared (on G80) or constant memory (on Fermi).)
Thus mat.width will also be a device copy of the host’s mat.width. (And of course any changes of the device copy won’t be reflected in the host original.)

qiangbo · July 26, 2011, 2:04pm

Thanks! That is insightful!

Topic		Replies	Views
Help with cuda 2d array CUDA Programming and Performance	6	7452	September 29, 2014
cudaMemcpy2D: What's wrong in this code? CUDA Programming and Performance	5	1436	December 21, 2011
2d matrix passing values help with this code CUDA Programming and Performance	4	3205	November 10, 2010
CUDA 2D Array Problem Need help to manipulate 2D arrays in CUDA CUDA Programming and Performance	4	26438	March 17, 2011
Padding in Pitch memory CUDA Programming and Performance	2	3963	October 16, 2009
2D matrix transfer and handling problem Help required CUDA Programming and Performance	7	1464	July 13, 2010
Problems with creating an array of Cuda pointers CUDA Programming and Performance	7	13609	April 20, 2009
3D matrix and 3D threads/blocks problem CUDA Programming and Performance	4	7198	June 16, 2011
structures, pointers, and cudaMalloc/Memcpy CUDA Programming and Performance	1	2808	August 3, 2011
How to convert a 2D Matrix to a 1D matrix in cuda device? CUDA Programming and Performance	10	1110	October 13, 2022

2D matrix through 1D array

Related topics