Problem with kernel launch too many paremeters, trying to use a struct

ShingleServ · July 1, 2011, 7:11pm

Hello,

I’m trying to convert a C++ program over to use CUDA for a research project.

I have all the code compiling and building but when running the program the kernel is never launched and exits with a “Invalid device function”

However since the kernel doesn’t launch i can’t debug what is wrong. So after digging around I found that there is a limit to the size of parameters you can pass into a kernel call. currently I have 46 double parameters for input (all are required for the program) yet the max limit is 256 byte maximum. I’m trying to implement a struct to pass the parameters since that was the option I found to be suggested by by others online.

However after implementing the struct I am still getting the same error message. I’m not sure if I have created the struct correctly for CUDA use though. I was wondering if someone could help with my struct and getting it to work the way I need it to.

I have been using the example at http://forums.nvidia.com/index.php?showtopic=106708 as a model of my own code. The code for it that I am using now is:

//structure

typedef struct inputStruct{

	double *dev_beta;											//temp K, beta=1/kT (kcal/mole)

	int *dev_nTotal;											//number of iterations

	double *dev_xVn, *dev_yVn, *dev_zVn;						//xyz values for n'th iteration

	double *dev_thetaXn, *dev_thetaYn, *dev_thetaZn;			//xyz angle values

	double *dev_dXn, *dev_dYn, *dev_dZn;						//dx, dy, dz values for n'th iteration

	double *dev_dthetaXn, *dev_dthetaYn, *dev_dthetaZn;			//angle values for n'th iteration

	double *dev_dXmax, *dev_dYmax, *dev_dZmax;					//max diff between xyz values for molecule 1 and 2

	double *dev_dthetaXmax, *dev_dthetaYmax, *dev_dthetaZmax;	//max diff between angles for molecule 1 and 2

	double *dev_dispMC, *dev_dispTheta, *dev_maxDistance;			

	//double *dev_results[blocks * threads];

	//electrostatic 

	double *dev_I1x, *dev_I1y, *dev_I1z;						//Isomer 1 xyz values 1

	double *dev_SP1x, *dev_SP1y, *dev_SP1z;						//Isomer 1 xyz values 2

	double *dev_I2x, *dev_I2y, *dev_I2z;						//Isomer 2 xyz values 1

	double *dev_SP2x, *dev_SP2y, *dev_SP2z;						//Isomer 2 xyz values 2

	double *dev_QI1, *dev_QSP1, *dev_QI2, *dev_QSP2;			//charges: Isomer 1 - 1, Isomer 1 - 2, Isomer 2 - 1, Isomer 2 - 2

	//vandervals

	double *dev_rStarI1, *dev_rStarI2;							//rStar for both isomers

	double *dev_epsilonI1, *dev_epsilonI2;						//epsilon for both isomers

	double *dev_seed;

	curandState *state;

	curandState *dev_state;

};

int main(){

	cudaSetDevice(0);

	check_cuda_errors(__FILE__, __LINE__);

	inputStruct host_struct;

	inputStruct *dev_struct;

	maxDistance = 20.01;						//max allowed distance between analyte and SP

	cudaMalloc((void**)&host_struct.dev_maxDistance, size * sizeof(double));

	cudaMemcpy(host_struct.dev_maxDistance, &maxDistance, size * sizeof(double), cudaMemcpyHostToDevice);

//other cudaMalloc/cudaMemcpy calls

	cudaMalloc((void**)&dev_struct, sizeof(inputStruct));

	cudaMemcpy(dev_struct, &host_struct, sizeof(inputStruct), cudaMemcpyHostToDevice);

	kernel<<<blocks,threads>>>(dev_struct);

//kernel

__global__ void kernel(inputStruct *dev_struct){

    //do stuff

}

I’m referencing the variables in the struct by calling *dev_struct->dev_maxDistance

It compiles and builds but still exits with the same error.

Any suggestions? I really need the help.

Thank you!

hyqneuron · July 2, 2011, 6:11am

You shouldn’t use pointers in your struct. Convert them all to double instead of double*

Skybuck · July 3, 2011, 6:39am

He could use pointers in his structure… but then he would need to initialize them inside the cuda kernel.

Perhaps it’s even possible to initialize the pointers on the host side if it’s possible to do “pointer math” on cuda pointers somehow… (?).

ShingleServ · July 5, 2011, 3:27pm

I’ll give this a try, removing the pointers.
Thanks for the help!

ShingleServ · July 11, 2011, 4:05pm

Hi again,

I got my program running and now I need to focus on input that is not hard coded.

I setup some code to read the data through .csv files an example is

#define col		10

#define row		100

ifstream Isomer1;

ifstream Isomer2;

ifstream SP;

ifstream Shift;

string value;

string first = "~";

double ioTemp;

double Isomer1_arr[row][col];

double Isomer2_arr[row][col];

double sp_arr[row][col];

double Shift_arr[row][col];

//opens isomer1 and reads data into array

void isomer1(){

	cout << "Opening and reading Isomer1.csv" << endl << endl;

	Isomer1.open("Isomer1.csv", ifstream::in);

	//input isomer 1 file into isomer1_array

	if (Isomer1.is_open())

  {

		while (Isomer1.good()){

			for(int y = 0; y < col; y++){

				for(int x = 0; x < row; x++){

					getline(Isomer1, value, ',');

					//check for end of row

					if(value.compare(first)==1){

						y = col;

						x = row;

					}

					

					else

						cout << value;

						ioTemp = atof(value.c_str());

						Isomer1_arr[x][y] = ioTemp;

				}

			}

		}

			for(int j = 0; j < col; j++){

				for(int i = 0; i < row; i++){

					cout << Isomer1_arr[i][j] << "\t";

				}

		}

		Isomer1.close();

	}

	else cout << "Cannot open file. Error num:" << errno << endl;

	

	cout << endl << "\n Press any button to exit...";

	_getch();

}

I have 4 statements like that to read in the values I need to pass into the kernel.

I need to use a struct to pass the values in since I’m short on variable space in the launch.

I’m having trouble with figuring out how to create the struct, fill it with values from .csv files, and load it into the kernel with those values.

Mostly how do I load the values in on the host then send it to the kernel?

All examples I can find send an empty array or one filled with static values.

Whats the best way to go about doing this?

struct input{

	//i/o arrays, all data is stored in these

	double Isomer1_arr[row][col];

	double Isomer2_arr[row][col];

	double sp_arr[row][col];

	double Shift_arr[row][col];

};

The problem is when I try to fill the arrays it can’t find the arrays since they are declared in the struct.

But if I change the variables to input. it gives me an error of “a nonstatic member reference must be relative to a specific object”

Topic		Replies	Views
Parameters passed to a CUDA kernel exceed 256 bytes. CUDA Programming and Performance	13	7014	September 21, 2009
Formal parameter space overflowed kernel launch error CUDA Programming and Performance	18	18511	May 2, 2010
Struct in CUDA can i use this struct in CUDA CUDA Programming and Performance	15	89518	June 26, 2009
Pointer as formal parameter in kernel call CUDA Programming and Performance	8	6413	March 13, 2009
How to pass large arguments in CUDA kernels Kernel arguments CUDA Programming and Performance	10	19102	December 18, 2009
Parameter Passing to Device CUDA Programming and Performance	6	4911	June 11, 2008
Kernel Formal parameter space overflowed CUDA Programming and Performance	4	2598	July 31, 2009
struct of arrays for parameters between host and device CUDA Programming and Performance	6	5118	September 21, 2009
Complex data structures CUDA Programming and Performance	3	4223	April 22, 2008
cudaMalloc structure CUDA Programming and Performance	5	8294	July 10, 2008

Problem with kernel launch too many paremeters, trying to use a struct

Related topics