Running code on different compute devices (each with different behavior)

vinaybabu · December 22, 2016, 6:38pm

Hi All,

I am working on an algorithm for research and will publish it if results are good. I developed the initial code on the laptop (it has 950M compute 5.0), however, it has small device memory so CPU version of the code is faster than GPU. I tried running the code on K40c and K40m (compute 3.5). In all three cases, I used same dataset in spite of which the computation is different. Is this common? Is my algorithm behaving differently depending on which GPU device I am using? I don’t have a whole lot of experience programming CUDA but assume it should not be the case.

Please suggest how I can go about identifying and fixing the problem.

Thank you,
Vinay

njuffa · December 22, 2016, 6:53pm

If you use exactly the same CUDA version on both devices, use identical compilation switches, and your code contains neither bugs (e.g. race conditions, out-of-bounds accesses) nor non-deterministic operations (e.g. atomic floating-point operations), then you should get matching results for the same input data set passed to the GPU.

Use cuda-memcheck to check for race conditions and out of bounds accesses. Note that this tool will find many but not all instances of such problems. Your host computation (if any) may have similar issues, use valgrind or a similar to to find some of the issues. Your host computation may also pass different input data to the GPU due to different compilers, compiler versions, or libraries. When in doubt, dump the entire GPU input in raw binary form to double check.

Avoid JIT compilation, as that makes enforcing the “same CUDA version” provision more tricky, as the JIT compiler is part of the CUDA driver and may be updated at different intervals than CUDA (and therefore the offline compiler) itself.

Topic		Replies	Views
Different performance from different GPUs with Identical Code CUDA Programming and Performance	18	4365	April 11, 2012
well how do I know if cuda runs on the gpu CUDA Programming and Performance	20	13415	July 9, 2008
Run CUDA on CPU while GPU is present CUDA Programming and Performance	5	3005	April 25, 2011
same code gives different results on two Nvidia 2080Ti GPU CUDA Programming and Performance	7	1455	November 2, 2019
same kernel, different behavior linux-windows CUDA Programming and Performance	8	1938	February 11, 2012
Program work only on one computer, why? CUDA Programming and Performance	11	994	March 6, 2017
Are different GPU models compatible with each other? CUDA Programming and Performance	6	901	August 20, 2014
CUDA Driver Version / Runtime Version problem? CUDA Programming and Performance	4	1371	January 25, 2019
Running Code in 9800 GT is different than in GTX 580? GPU Upgrade Problem CUDA Programming and Performance	3	2310	May 4, 2012
CUDA code runs on one card, not another CUDA Programming and Performance	10	904	July 21, 2011

Running code on different compute devices (each with different behavior)

Related topics