dynamically allocate array of structs

yamsha · September 28, 2008, 10:15pm

Given something along the lines of:

typedef struct
{
int ival;
float fval;
char *str;
}Thing;

Thing *arrayOfThings;

How would I correctly allocate memory to ‘arrayOfThings’ during runtime after two arguments are specified:
int arrLen - size of ‘arrayOfThings’
int maxStrLen - maximum length of variable ‘str’ of structure Thing
I assume I would use cudaMalloc for ‘arrayOfThings’:

cudaMalloc( (void **)&arrayOfThings, arrLen*sizeof(Thing) );

But then how would I get a pointer to ‘str’ variable of every element of array ‘arrayOfThings’ so that I could malloc ‘str’ to be of length ‘maxStrLen’? Should I use cudaGetSymbolAddress() ?

Also, as an expansion to this question, how would I allocate ‘arrayOfThings’ if it was a two-dimensional array: “Thing **arrayOfThings”?

-Andrey

tmurray · September 28, 2008, 10:30pm

Allocating pointer-based data structures on the GPU is an exercise in not-very-fun coding. The outline is something like:

allocate an array of Things on the host for marshalling to the GPU
fill in the other two values based on whatever structures you have on the CPU
allocate each string on the device
set the poniter str in each Thing to the respective string you allocated on the device
allocate an array of Things on the device
copy

If you have a maximum length of your string (and it isn’t horribly inefficient–256 bytes when 99% of strings are 8 bytes or shorter, for example), just using a fixed length array is a much easier alternative. It’s one cudaMalloc instead of elements + 1.

Doing a two-dimensional array is pretty much the same thing–lots of host-side marshalling. A one-dimensional array that you index into via your grid and block dimensions is much nicer (both for perf and for readability).

However, there’s one thing I’m not sure of, now that I think about it. What would happen if you stored the array of pointers to arrays of Things in constant memory but stored the Things themselves in global memory? Has anyone tried something like this? I’m curious as to what the perf would be like.

yamsha · September 29, 2008, 2:15pm

Allocating pointer-based data structures on the GPU is an exercise in not-very-fun coding. The outline is something like:

allocate an array of Things on the host for marshalling to the GPU

fill in the other two values based on whatever structures you have on the CPU

allocate each string on the device

set the poniter str in each Thing to the respective string you allocated on the device

allocate an array of Things on the device

copy

If you have a maximum length of your string (and it isn’t horribly inefficient–256 bytes when 99% of strings are 8 bytes or shorter, for example), just using a fixed length array is a much easier alternative. It’s one cudaMalloc instead of elements + 1.

Doing a two-dimensional array is pretty much the same thing–lots of host-side marshalling. A one-dimensional array that you index into via your grid and block dimensions is much nicer (both for perf and for readability).

However, there’s one thing I’m not sure of, now that I think about it. What would happen if you stored the array of pointers to arrays of Things in constant memory but stored the Things themselves in global memory? Has anyone tried something like this? I’m curious as to what the perf would be like.

[snapback]445325[/snapback]

Thank you for the reply. That makes sense. I’ll give it a shot.

Also, I’ve wondered this before, should I use cudaMallocArray() ? And if not when should that function be used?

tmurray · September 29, 2008, 3:12pm

cudaMallocArray is used for allocating a cudaArray type, not an array of Type.

yamsha · September 29, 2008, 4:34pm

I understand that, what I meant was what is an advantage of using cudaArrays and why would someone use C style arrays (float arr) rather than cudaArrays?

jack · September 29, 2008, 4:41pm

I believe the cudaArray type does something with reading from texture memory, which may be faster than normal memory (but it is also read-only).

tmurray · September 29, 2008, 5:12pm

cudaArrays are used with texture memory, yes.

Topic		Replies	Views
Arrays of Structure Allocating memory for array of structures. CUDA Programming and Performance	7	3646	September 24, 2009
allocating double pointer memory in GPU CUDA Programming and Performance	3	11787	February 3, 2011
Multi-dimensional arrays in global memory CUDA Programming and Performance	3	3550	August 11, 2008
How can I allocate 2-dimensional array on the device memory? CUDA Programming and Performance	5	15731	August 6, 2009
Any alternative to using array in GPU function In __global__ or __device__ functions CUDA Programming and Performance	3	3604	November 25, 2007
Multi-GPU array CUDA Programming and Performance	2	586	June 4, 2021
Multidimensional Arrays multidimensional array allocation CUDA Programming and Performance	6	6299	December 8, 2007
Global arrays? CUDA Programming and Performance	24	10638	August 18, 2010
a problem with double pointer CUDA Programming and Performance	6	9142	February 16, 2011
question about memory allocation CUDA Programming and Performance	1	1622	October 16, 2007

dynamically allocate array of structs

Related topics