cudaThreadSynchronize() does not make the CPU wait

sacrif · March 3, 2010, 11:19pm

Environment: Windows7, VS2008, QT4.5, CUDA 2.3

Hi,

I have a c++ program in which a CUDA kernel is launched inside a for loop(at least the lauch is initiated there). The result of the launch is a rendered image which should be saved within the loop (in my c++ program) after rendering. So one iteration of the for loop is setting a parameter according to which a image should be rendered, rendering the image, and saving the resulting image (as soon as the kernal finished the render task). Thereby the rendering is done by an external program which renders using cuda.

I found that the cuda function cudaThreadSynchronize() should help in that case and makes the CPU wait until my previously launched CUDA kernel finishes.

So I wrote the following code in my .cpp file/class-method:

void MyWidgetClass::setTf(unsigned int index)

{

  for(unsigned int i = 0; i < myVector_.size(); i++)

  {

	  QGradient gradient = myVector_[i]->getGradient();

	  tfEditor_->setGradient(gradient); // initiates a cuda kernel launched by emiting a QSignal - renders an image which is shown in a viewer_ widget

	  

	  cudaThreadSynchronize(); // here my program on the CPU should wait for the external programs calculation on the GPU in each iteration of the for loop

	  

	  QImage image = viewer_->grabFrameBuffer(); // get the rendered image an put it in an image

	  image.save();

	  imagevector_.push_back(image);

   }

}

Unfortunately I still get wrong images (which where rendered before) with this code when I save them. Is there anything special I should consider when using this function? Or should it work like that? cudaThreadSynchronize() also returns cudaSuccess, but still it does not seem to make the CPU wait.

Could it be a problem, that the CUDA kernel is not directly lauched within the block, but is just initiated by a QSignal which is emited by tfEditor_->setGradient(gradient);? Probably the cudaThreadSynchronize() does not find a cuda kernel to wait for and just proceeds? And would there be other ways to work around?

Regards

tmurray · March 3, 2010, 11:22pm

It sounds like you’re launching the kernel from a separate thread; this will not work. You can only synchronize from the same thread as the one currently holding the CUDA context.

Topic		Replies	Views
cudaThreadSynchronize usage CUDA Programming and Performance	3	2972	October 21, 2008
cudaThreadSynchronize() CUDA Programming and Performance	1	2264	July 11, 2007
cudaThreadSynchronize() stalls? CUDA Programming and Performance	2	9028	January 8, 2008
cudaDeviceSynchronize() doesn't wait for kernels launched by other CPU threads, why? CUDA Programming and Performance synchronization	7	2434	October 12, 2021
Behaviour of Multithreaded programs with cudaThreadSynchronize() The semantics of cudaThreadSynchron CUDA Programming and Performance	1	7250	January 9, 2012
KERNELS are NOT queing , bug in cuda 2.0 ? cudaThreadSynchronize(); makes no difference ? CUDA Programming and Performance	0	1870	August 8, 2009
Program hangs at cudaThreadsynchronize CUDA Programming and Performance	12	9712	April 7, 2011
Problem with cudaThreadSynchronize on Xubuntu 10.04 CUDA Programming and Performance	2	11890	July 20, 2010
Can kernel function parallel with CPU code? CUDA Programming and Performance	12	7869	December 5, 2008
Async Kernel launch cpu seems not getting control after kernel launch CUDA Programming and Performance	7	3283	December 3, 2008

cudaThreadSynchronize() does not make the CPU wait

Related topics