cudaGetLastError() for asynchronous calls

kometa_triatlon · August 17, 2015, 1:00pm

I have a typical for loop that asynchronously copies data to device and calls (also asynchronously) kernels that process those chunks of data.

My question is: how to control that kernels executed successfully?

For the synchronous case we call:

cudaDeviceSynchronize(); cudaGetLastError();

But, is the cudaGetLastError() a proper choice for the async case? Currently, I do the following:

for( size_t si = 0; si < streams_num; si++ ) {
cudaMemcpyAsync(..., streams[si]);
kernel<<<....,streams[si]>>>();
checkCudaErrors(cudaStreamSynchronize(streams[si]));
checkCudaErrors(cudaGetLastError());
}

checkCudaErrors is an auxiliary macros that handles the return code.

Robert_Crovella · August 17, 2015, 2:01pm

google “proper cuda error checking”

take the first hit

it explains how to handle the API call case, and the kernel call case

Topic		Replies	Views
Return error codes from previous, asynchronous launches CUDA Programming and Performance	1	998	April 5, 2010
Kernel launch error checking CUDA Programming and Performance	0	1274	April 14, 2013
cudaDeviceSynchronize not returning error of type "invalid configuration argument" CUDA Programming and Performance	2	1712	March 12, 2014
0.9 asynchronous kernel question CUDA Programming and Performance	7	8536	June 14, 2007
cudaGetLastError returns a strange error CUDA Programming and Performance	3	2770	May 23, 2018
Asyncronus call CUDA Programming and Performance	1	2293	September 24, 2009
cudaMemcpyAsync code problem CUDA Programming and Performance	3	4601	September 16, 2008
cudaGetLastError. Which kernel execution raised it? CUDA Programming and Performance	10	3727	March 8, 2019
How to check if kernel was launched? Is possible that kernel failed to launch but it was not recorde CUDA Programming and Performance	3	3320	March 8, 2010
Asynchronous execution of kernels CUDA Programming and Performance	1	3048	July 10, 2008

cudaGetLastError() for asynchronous calls

Related topics