2D array indexing with double pointers

mjmawson · February 11, 2010, 11:49pm

I know that allocating a 2D array to the GPU that can be addressed in the form [i]array[j] isn’t fast due to the double pointers, but how do you actually allocate an array in such a manner? Searches on the forums have given the answer “flatten it to a 1D array”, but my supervisor wants to see the difference in access time between a double pointered array and the flattened version. Does anyone have a simple example of how to allocate and copy to/from a 2D array in this manner? Thanks in advance.

avidday · February 11, 2010, 11:55pm

This is a pretty trivial example, and I wouldn’t recommend it for benchmarking, but it at least shows one way to do it:

#include <assert.h>

#include <stdio.h>

#include <cuda_runtime.h>

#ifndef gpuAssert

#include <stdio.h>

#define gpuAssert( condition ) {if( (condition) != 0 ) { fprintf( stderr, "\n FAILURE %d in %s, line %d\n", condition, __FILE__, __LINE__ );exit( 1 );}}

#endif

#define _DSIZE (32)

__device__ float * ad[3];

__global__ void testkernel2(float *d)

{

	unsigned int idx = threadIdx.x + blockDim.x*blockIdx.x;

	d[idx] += ad[0][idx] + ad[1][idx] + ad[2][idx];

}

int main()

{

	float *a, *b, *c, *d;

	float *_a, *_b, *_c, *_d;

	assert( !(( a = (float *)malloc(_DSIZE * sizeof(float)) ) == NULL) );

	assert( !(( b = (float *)malloc(_DSIZE * sizeof(float)) ) == NULL) );

	assert( !(( c = (float *)malloc(_DSIZE * sizeof(float)) ) == NULL) );

	assert( !(( d = (float *)malloc(_DSIZE * sizeof(float)) ) == NULL) );

	gpuAssert( cudaMalloc( (void**)&_a, _DSIZE * sizeof(float) ) );

	gpuAssert( cudaMalloc( (void**)&_b, _DSIZE * sizeof(float) ) );

	gpuAssert( cudaMalloc( (void**)&_c, _DSIZE * sizeof(float) ) );

	gpuAssert( cudaMalloc( (void**)&_d, _DSIZE * sizeof(float) ) );

	for(int i = 0; i < _DSIZE; i++) {

		a[i] = 3.f;

		b[i] = 5.f;

		c[i] = 7.f;

		d[i] = (float)i;

	}

	gpuAssert( cudaMemcpy(_a, a, _DSIZE * sizeof(float), cudaMemcpyHostToDevice) );

	gpuAssert( cudaMemcpy(_b, b, _DSIZE * sizeof(float), cudaMemcpyHostToDevice) );

	gpuAssert( cudaMemcpy(_c, c, _DSIZE * sizeof(float), cudaMemcpyHostToDevice) );

	gpuAssert( cudaMemcpy(_d, d, _DSIZE * sizeof(float), cudaMemcpyHostToDevice) );

	gpuAssert( cudaMemcpyToSymbol( ad, &_a, sizeof(float *), sizeof(float *) * (size_t)0, cudaMemcpyHostToDevice) ); 

	gpuAssert( cudaMemcpyToSymbol( ad, &_b, sizeof(float *), sizeof(float *) * (size_t)1, cudaMemcpyHostToDevice) ); 

	gpuAssert( cudaMemcpyToSymbol( ad, &_c, sizeof(float *), sizeof(float *) * (size_t)2, cudaMemcpyHostToDevice) ); 

	testkernel2 <<< 1, _DSIZE >>> (_d);

	gpuAssert( cudaThreadSynchronize() );

	gpuAssert( cudaMemcpy(d, _d, _DSIZE * sizeof(float), cudaMemcpyDeviceToHost) );

	for(int i = 0; i < _DSIZE; i++) {

		fprintf(stdout, "%2d %6.1f\n", i, d[i]);

	}

	cudaFree(_a);

	cudaFree(_b);

	cudaFree(_c);

	cudaFree(_d);

	free(a);

	free(b);

	free(c);

	free(d);

	return cudaThreadExit();

}

You might be able to turn it into something that suits your needs.

Topic		Replies	Views
2D arrays, pointers to pointers CUDA Programming and Performance	1	1030	February 11, 2010
2D arrays with cuda confusion CUDA Programming and Performance	2	1098	May 9, 2010
2 Dimensional Array CUDA Programming and Performance	3	1573	July 28, 2010
Multidimensional Arrays multidimensional array allocation CUDA Programming and Performance	6	6289	December 8, 2007
How can I allocate 2-dimensional array on the device memory? CUDA Programming and Performance	5	15708	August 6, 2009
Question about cudaMemcpy2D CUDA Programming and Performance	3	1605	November 9, 2017
2D array inside a structure CUDA Programming and Performance	2	2105	February 14, 2012
Allocating a multidimensional array onto a device variable CUDA Programming and Performance	6	1585	July 15, 2015
How to pass 3D array in CUDA? CUDA Programming and Performance	1	3610	February 15, 2015
allocating double pointer memory in GPU CUDA Programming and Performance	3	11672	February 3, 2011

2D array indexing with double pointers

Related topics