cuda-memcheck versus cuda-racecheck

little_jimmy · August 5, 2014, 3:04pm

Hello,

Both my project’s debug build and release build works fine - both complete execution, and the release build yields the same result as the debug build

I can run racecheck (cuda-memcheck --tool racecheck) on the project’s release build, and it finishes without error, in good time

But cuda-memcheck itself gets nowhere, and after an infinite amount of time (1 hour) I gave up waiting; the release build generally finishes in under 1 minute

I tried memcheck on both the release build and the debug build

Any ideas why memcheck is seemingly ‘mis-behaving’…?

Robert_Crovella · August 5, 2014, 3:09pm

cuda-memcheck can result in substantially longer execution time for some kernels. It’s doing a variety of things under the hood in order to validate memory accesses, and this can result in a substantial slowdown. It’s not mis-behaving, you just have to wait.

If you think there is a bug, file a bug. It would be best if you provide a short, complete, compilable code that reproduces the problem.

little_jimmy · August 5, 2014, 3:19pm

Like I mentioned, I waited an hour

1 minute normal execution time :: 60 minutes execution time with memcheck; that is at least a x60 increase
Are you telling me that memcheck can take that long…?

Robert_Crovella · August 5, 2014, 4:31pm

The factor varies depending on the underlying code. Like I mentioned, if you think its a bug, file a bug.

I ran the code I posted here with cuda-memcheck:

[url]Efficient in-place transpose of multiple square float matrices - CUDA Programming and Performance - NVIDIA Developer Forums

and the cuda code ran 30-45x slower.

little_jimmy · August 6, 2014, 4:50am

Is it possible to run memcheck within the debugger (cuda-gdb), and would kernel launches still be asynchronous?

debugging is a rather slow process, such that memcheck should ‘blend in’; and perhaps this way I can keep a closer eye on memcheck and its progress

Robert_Crovella · August 6, 2014, 3:31pm

This may be of interest:

[url]CUDA-GDB :: CUDA Toolkit Documentation

little_jimmy · August 6, 2014, 3:35pm

Thank you txbob; i’ll have a look

Topic		Replies	Views
a question about cuda-memcheck CUDA Programming and Performance	4	684	April 16, 2018
CUDA on release and debug mode (faster or not) CUDA Programming and Performance	3	3039	December 20, 2010
enable/disable cuda memcheck and/or racecheck at run time? CUDA Programming and Performance	2	1473	May 14, 2013
Running program standalone works fine but using cuda-memcheck gives errors CUDA Programming and Performance	1	590	May 1, 2013
debug build versus release build CUDA Programming and Performance	9	1755	June 24, 2014
DEBUG vs RELEASE data transfer times Unexpected results CUDA Programming and Performance	5	3406	January 26, 2008
vector data with and without cuda-memcheck CUDA Programming and Performance	1	451	July 6, 2018
Debug version is 10x faster than Release version? CUDA Programming and Performance	2	2600	March 10, 2008
Release mode error debug CUDA Programming and Performance	11	5162	February 4, 2015
cuda-memcheck (with cc 2.x) and synchronous execution CUDA-MEMCHECK	2	3757	March 28, 2014

cuda-memcheck versus cuda-racecheck

Related topics