memcpy problem

riclas · May 9, 2008, 4:12pm

hi,

i have this function:

extern "C" void cuMagnitude(cufftComplex *src, int *dst,int len){

   CUT_DEVICE_INIT();

	

	cufftComplex *srcDevice;

	int *dstDevice;

	

	CUDA_SAFE_CALL(cudaMalloc((void**)&srcDevice, len*sizeof(cufftComplex)));

	CUDA_SAFE_CALL(cudaMalloc((void**)&dstDevice, len*sizeof(int)));

	

	CUDA_SAFE_CALL(cudaMemcpy(srcDevice, src, len*sizeof(cufftComplex), cudaMemcpyHostToDevice));

	

	for(int i=0;i<len;i++){

  dstDevice[i]=sqrt(powf(srcDevice[i].x,2)+powf(srcDevice[i].y,2))/32768+0.5;

	}

	

	CUDA_SAFE_CALL(cudaMemcpy(dst, dstDevice, len*sizeof(int), cudaMemcpyDeviceToHost));

	

	CUDA_SAFE_CALL(cudaFree(srcDevice));

	CUDA_SAFE_CALL(cudaFree(dstDevice));

	

}

the variables look allocated with valid adresses at the beggining of the function, so, when it gets to

CUDA_SAFE_CALL(cudaMemcpy(srcDevice, src, len*sizeof(cufftComplex), cudaMemcpyHostToDevice));

srcDevice becomes invalid… can’t access it …

what can be the problem???

EDIT: i changed the order of the variables and now the problem is in dstDevice but in the same line…

also, the program continues after this line but srcDevice is never set correctly.

i don’t understand these cuda problems :\

seibert · May 9, 2008, 4:58pm

You cannot directly read from or write to device memory from the host. Only memory copies to and from the device can be done.

riclas · May 9, 2008, 5:13pm

so i have to put all the math operations code in a global function, is that it?

is that referenced in the programming guide or somewhere?? i didn’t know that…
and i think this is the source of all my problems :S

thank you

seibert · May 9, 2008, 5:31pm

Yeah, this is pretty fundamental to CUDA. Device memory is on the graphics card, and separated by a PCI-Express bus from the CPU, so you have explicitly copy data to or from the device when you need it. All other operations on device memory happen in global functions.

riclas · May 10, 2008, 12:17pm

i understood the need to copy data from host to device from all the samples i saw, but i thought i could put the device code in the same function…

maybe the people who write the programming guide could put this explanation there?
or if it is there, point me to the right place, because i didn’t find it …

thanks again

seibert · May 10, 2008, 2:59pm

This is the first relevant quote I found, section 4.2.2.4 (in CUDA 2.0 guide, not sure what section # it is in earlier guides):

“Dereferencing a pointer either to global or shared memory in code that is executed
on the host or to host memory in code that is executed on the device results in an
undefined behavior, most often in a segmentation fault and application termination.”

Topic		Replies	Views
program crash when copying from device to host <br /> CUDA Programming and Performance	11	1977	March 31, 2009
Problems with cudaMemcpy. cudaErrorInvalidDevicePointer. CUDA Programming and Performance	1	3460	October 6, 2007
CUDA class - allocate memory using malloc (Dynamic Global Memory Allocation and Operations) CUDA Programming and Performance	3	3201	February 2, 2017
Newbie question about data transfer CUDA Programming and Performance	4	2756	July 25, 2008
Device Memeroy allocation and data transfer Data transfer between host and device CUDA Programming and Performance	5	2639	June 16, 2011
strange problem accessing device memory cudaMalloc and cudaMemcpy CUDA Programming and Performance	0	2320	April 2, 2010
global to global memory transfer problem CUDA Programming and Performance	3	4372	November 28, 2007
The most basic problem,ask for help CUDA Programming and Performance	5	2159	February 2, 2009
Can't copy struct data from host to device CUDA Programming and Performance	4	1032	July 27, 2013
Copying Data from host to Device and Back CUDA Programming and Performance	5	1528	August 14, 2015

memcpy problem

Related topics