cudaMemcpy slow down

catcool27 · May 8, 2009, 8:33pm

We have been experiencing some strange behavior when allocating global and texture memory and repeatedly copying from global to texture memory. We have written a small test program that replicates the behavior of our application.

We timed the cudaMemcpy and for iterations 0-1134 the average time was ~10 microseconds, after that it jumped up to ~275 microseconds and stays there.

We are using a Tesla C1060 with CUDA 2.1.

Does anyone have any insight into why this might be happening?

Thanks!

cudaArray *cuda_arr;

   unsigned short *gpu_arr;

   unsigned int size_y(1000), stride(1000);

cudaChannelFormatDesc channelDesc = cudaCreateChannelDesc(16, 0, 0, 0, cudaChannelFormatKindUnsigned);

   cudaMallocArray( &cuda_arr, &channelDesc, stride, size_y );

   cudaMalloc((void **)&gpu_arr, stride*size_y*sizeof(short int));

for(int i=0;i<32000;i++){

	  cudaMemcpyToArray(cuda_arr, 0, 0, gpu_arr, stride * size_y * sizeof(unsigned short), cudaMemcpyDeviceToDevice);

   }

tmurray · May 8, 2009, 8:54pm

What OS are you using?

catcool27 · May 11, 2009, 1:29pm

Linux 2.6.23.1-42.fc8 #1 SMP Tue Oct 30 13:18:33 EDT 2007 x86_64 GNU/Linux

Topic		Replies	Views
Possibly Studpid question bout cudaMemcpy CudaMemcpy getting slow by time CUDA Programming and Performance	4	2092	February 26, 2010
Inconsistent cudaMemcpy Timing cudaMemcpy and kernel timing hiccups at 1 second intervals CUDA Programming and Performance	1	1140	October 6, 2010
cudaMemcpy timing CUDA Programming and Performance	1	6826	December 8, 2010
cudaMemcpy too slow CUDA Programming and Performance	1	1162	May 11, 2021
About CUDA CUDA Programming and Performance	2	4770	December 3, 2008
copy memory slow? CUDA Programming and Performance	2	4862	February 12, 2009
cudaMemcpy sometimes very slow CUDA Programming and Performance	1	1055	May 21, 2018
Slow cudaMemcpy execution Tested in GTX480 and GT240 CUDA Programming and Performance	6	2330	April 25, 2012
Slow OpenGL Interoperabilty with texture memory memcopyDtoA CUDA Programming and Performance	2	1790	August 4, 2010
cudaMemcpy takes more than 2 seconds, then driver crashed. CUDA Programming and Performance	6	4314	May 2, 2016

cudaMemcpy slow down

Related topics