Hi, when should cudaThreadSynchronize() be called and what is this function doing?
In the code below, there are three cudaThreadSynchronize() calls. Which of them are redundant? Which of them are necessary? Does not the kernel functions call cudaThreadSynchronize() automatically?
Hi, when should cudaThreadSynchronize() be called and what is this function doing?
In the code below, there are three cudaThreadSynchronize() calls. Which of them are redundant? Which of them are necessary? Does not the kernel functions call cudaThreadSynchronize() automatically?