Initial delay of 4 seonds on CUDA program before executing on a multi-GPU machine

Gaurish · January 30, 2012, 3:38pm

I have a GPU desktop machine, having 4 Tesla C2050 cards. When executing a CUDA program I notice, that there is a delay of about 4 seconds before my code actually starts running. This delay happens only at the beginning of the program and not during individual kernel launches.

This does not happen on my other machine which has a single GTX 570.

Why could this be happening? I am using the Red-Hat Santiago 6.1 Linux OS on the Tesla machine.

short · January 30, 2012, 4:19pm

We have seen similar issues but only on linux32 + 285.xx and later drivers. We have filed a bug with NVIDIA and they are currently at work for future driver versions (incident #929288).

–If possible, downgrade to the 27x driver series.

Topic		Replies	Views
5 times speed up when using 4 GPUs? CUDA Programming and Performance	1	681	January 17, 2013
Slow CUDA programs' startup CUDA Programming and Performance	10	7246	January 23, 2012
Latency when I launch a program on Tesla S2050 CUDA Programming and Performance	0	2904	January 9, 2012
Trying to reduce delays between kernel launches CUDA Programming and Performance	0	6638	January 4, 2011
Tesla C1060 and cudaSetDevice CUDA Programming and Performance	2	6369	July 2, 2009
Different exe times on same type cards CUDA Programming and Performance	0	2051	September 2, 2011
why any CUDA program takes more than 1s? driver initialization time? CUDA Programming and Performance	7	3395	March 25, 2009
CUDA Application Startup Speed on Different Cards CUDA Programming and Performance	2	690	September 2, 2014
5 seconds running time limitation CUDA Programming and Performance	2	2123	August 28, 2008
[ runtime initialization very slow as gpu count increases, linux ] CUDA Programming and Performance	1	7379	April 7, 2011

Initial delay of 4 seonds on CUDA program before executing on a multi-GPU machine

Related topics