Passing a dynamic array in C++ to CUDA kernel

owjian · December 2, 2013, 9:55pm

Hi all;

I’m having a problem in passing a dynamic array in C++ into the CUDA Kernel. Here is the snapshot of the program

char *pWordListArray[WORDLENGTH] ; // Dynamic allocated array with a very large number of WORDLENGTH
char *dev_pWordListArray; // Pointer to the GPU (device) memory

// Allocate memory
const int dev_pWordListArray_sizeof = (WORDLENGTH)sizeof(char);
cudaMalloc((void*)&(dev_pWordListArray), dev_pWordListArray_sizeof);

// Copy from Host to Device
cudaMemcpy(dev_pWordListArray, pWordListArray, dev_pWordListArray_sizeof, cudaMemcpyHostToDevice);

// Launch the kernel
insert <<< 1, 1>>>(dev_pWordListArray);

// Inside global function insert

global void insert(char *dev_pWordListArray) {

 for(int i=0; i < 5; i++) {
	 printf("%s\n", dev_pWordListArray[i]);
 }

When my program reach the function global, the program got killed unexpectedly and unable to print the string stored in the dev_pWordListArray.

This application has requested the Runtime to terminate it in an unusual way.
Please contact the application’s support team for more information.
Press any key to continue . . .

I would expect my program above to have the similar output like in Standard C++ as shown by the code below:

for(int i=0; i < 5; i++) {
printf(“%s\n”, pWordListArray[i]);
}

Ceq · December 3, 2013, 9:17am

Please check array types of your code:

pWordListArray is ‘char**’
pWordListArray[i] is ‘char*’
dev_pWordListArray is ‘char*’
dev_pWordListArray[i] is type ‘char’

Therefore, it looks like when you print ‘dev_pWordListArray[i]’ inside the kernel you are using a single char as a pointer parameter. Check also your cudaMemcpy because you are trying to copy an array of strings 'pWordListArray ’ (not their content) to a single string ‘dev_pWordListArray’. If your intention was to cudaMemcpy the pointer and not its content, the CPU memory has to be allocated as mapped memory to be accessible from the device.

owjian · December 3, 2013, 11:13am

my intention is to copy the content, in this case, what is the best way for me to solve this problem ??
Any input ?

owjian · December 3, 2013, 11:17am

Also, how can I allocate the CPU memory as mapped memory to be accessible from the device ?

owjian · December 4, 2013, 1:08pm

I made some modification to the code, but it still doesnt work …

char *pWordListArray [WORDLENGTH];
char *dev_pWordListArray[WORDLENGTH];

const int dev_pWordListArray_sizeof = (WORDLENGTH)sizeof(char);
cudaMalloc((void*)&(dev_pWordListArray), dev_pWordListArray_sizeof);

for(int i=0; i < 6; i++) {
cudaMemcpy(dev_pWordListArray[i], pWordListArray[i], (strlen(pWordListArray[i]) + 1), cudaMemcpyHostToDevice);
}// Launch the kernel
insert <<< 1, 1>>>(dev_pWordListArray);

// Inside global function insert

global void insert(char **dev_pWordListArray) {

for(int i=0; i < 6; i++) {
printf(“%s\n”, dev_pWordListArray[i]);
}

However;

It works on the standard C++ flow of passing a string of character in an array

// Call function printit
printit(pWordListArray);

// function printit

void printit(char **word) {

for(int i=0; i < 6; i++) {
	printf("%s, %d\n", word[i], (strlen(word[i]) + 1));
}

}

owjian · December 4, 2013, 1:19pm

Also,

by following the existing thread (cudaMemcpy seg fault Segmentation fault copying array - CUDA Programming and Performance - NVIDIA Developer Forums), by allocating memory as below :

for(int i=0; i < 6; i++) {
cudaMalloc((void**)&dev_pWordListArray[i], (strlen(pWordListArray[i]) + 1)*sizeof(char));
}

It still doesnt work and give me an error.

I appreciate that if someone could help in providing input to solve this problem …

Topic		Replies	Views
IS copying an array of character strings to device memory absolutely impossible? CUDA Programming and Performance	14	14914	March 24, 2011
How do I pass a double pointers array to the device? I'm getting cudaErrorIllegalAddress CUDA Programming and Performance	12	3740	January 17, 2024
how to create a dynamic array in the device function? CUDA Programming and Performance	4	14989	November 13, 2009
Is copying an array of character strings to device memory absolutely impossible? CUDA Programming and Performance	3	14815	June 26, 2010
Passing **char to kernel CUDA Programming and Performance	7	2278	February 26, 2018
passing an array to a kenel ? CUDA Programming and Performance	9	13731	June 10, 2009
copy string from host to string CUDA Programming and Performance	4	2924	August 1, 2008
Transfer string array to cuda kernel CUDA Programming and Performance	2	2024	October 7, 2013
Problems with creating an array of Cuda pointers CUDA Programming and Performance	7	13746	April 20, 2009
need some help with cudaMemcpy/cudamemcpy2D CUDA Programming and Performance	2	2052	June 9, 2010

Passing a dynamic array in C++ to CUDA kernel

Related topics