Hi, all.
I have a question of timing about cudaThreadSyncronize and cudaMemcpy.
Some sample programs don’t use cudaThreadSyncronize before cudaMemcpy.
In this case, does cudaMemcpy function wait for finishing GPUkernnel ? or does it works asynchronous ?
Can I found specific about it in programming guid ?