Problem accessing global memory General protection fault

Hyena · October 30, 2007, 11:01am

Hello there.

I encountered a problem when I try to access a variable located in device memory. Here is the situation basically:

// ------ CODE -------

struct mystruct
{
int dummy;
};

#define ARRAY_SIZE 16

device struct mystruct* myarray;

void test()
{
// dynamically allocate the array of structs
cudaMalloc((void**)&myarray, ARRAY_SIZE*sizeof(struct mystruct));

  // this triggers an exception ?!?!?
  myarray[0].dummy = 1;

}

// ------- END CODE -------

An important thing to mention is that this happens only when running on the device. The code runs OK in emulation mode.
My GPU is 8600 GTS.

Anyone had the same problem? I appreciate any help!

Peter

prkipfer · October 30, 2007, 11:10am

cudaMalloc is an API call for host code. It allocs device mem and returns the start address. There is no need for declaring the device mem then.

The pointer to the pointer variable you pass into cudaMalloc therefore should be a pointer variable on the host, not on the device. (it sometimes works in emulation as everything is on the host then)

For accessing the mem, you can pass this pointer to the kernel as parameter.

Peter

Hyena · October 30, 2007, 12:22pm

Thanks for the quick response, but now there is another problem. Suppose the struct is defined as

struct mystruct

{

float* dynarray;

};

and I need to allocate dynamically the array of structs first, and then allocate an array of floats for each member of that array? I still get the unhandled exception…

MisterAnderson42 · October 30, 2007, 1:34pm

ANY memory allocated by cudaMalloc is DEVICE memory. You cannot dereference such a pointer anywhere except in a kernel. If you need to allocate a pointer inside an allocated structure, you are going to need to make a “mirror” structure on alocated on the host (with normal malloc/new/whatever), then cudaMalloc all of the pointers inside and cudaMemcpy the mirror structure to device memory.

prkipfer · October 31, 2007, 12:59pm

As MisterAnderson42 emphasized, you cannot dereference a device mem pointer on the host. So you probably need to “invert” you structures, ie. turn an array of structs into a struct of arrays. Then you alloc all the arrays, put their start pointers into the struct and upload the struct.

Peter

Topic		Replies	Views
question about memory allocation CUDA Programming and Performance	1	1637	October 16, 2007
Pointer in "complex" structure CUDA Programming and Performance	4	3188	March 8, 2009
Access violation in cudaMalloc CUDA Programming and Performance	4	7230	May 4, 2009
Allocating memory for structure of 2D arrays on device CUDA Programming and Performance	3	828	November 15, 2010
Allocate memory for object in cuda CUDA Programming and Performance	1	1942	May 2, 2011
C Structures CUDA Programming and Performance	1	4633	May 23, 2007
Multidimensional array, cudaMalloc CUDA Programming and Performance	1	7197	December 8, 2008
Is this memory allocation fine? CUDA Programming and Performance	2	921	March 26, 2009
Transfering struct with pointers to device memory Used for variable argument list CUDA Programming and Performance	11	27032	January 19, 2011
cudaMalloc structure CUDA Programming and Performance	5	8303	July 10, 2008

Problem accessing global memory General protection fault

Related topics