Strange delay on CUDA initialization

BizNoK · April 26, 2011, 6:28am

Hi, thank you for reading this topic.
I want to get some help on my current issue about strange delay on CUDA initialization.

I’m using three GTX480 cards with no-SLI mode, and running my CUDA application on Linux platform.
In addition, I have another system consisted of three GTX285 cards with no-SLI mode, and running same platform.

When I running my program on the first platform(3 GTX480) at initialization step,
one card will initialized almost immediately(about 0.2ms), while another two cards will take 3000ms±0.5ms.
This is not appear only the first time to execute after re-booting, but it occurs again from second running.
This is also occurs not only in the initialization step(cudaSetDevice function), but also in the data transfer step(first cudaSend function)

But this phenomenon doesn’t occurs in the second platform(3 GTX285) and it makes me crazy…

Consequently, this strange delay yields 6 secs(sometimes 9 secs) delay to my first system.
I tried nvidiasmi tool to hold up the device initialization in the Linux system as I searched, but it doesn’t work at all.
So I think I have another problem on my system, but I can’t find what it is.

Please help me to fix this!

Simon_Green · May 3, 2011, 10:46am

This is a known problem (Fermi-based devices do some additional work at startup for UVA etc), and will be improved in future driver releases.

BizNoK · May 16, 2011, 6:48am

Thank you, I got a valuable answer from you.

I hope NVidia’s developers will fix this problem as soon as possible…

miccim · November 25, 2011, 2:21pm

Is that problem solved by actual driver? I have this problem too, which is a real problem as this makes 1/5 of my total program running time. Persistent Mode is enabled.

spwanasin · November 25, 2011, 7:23pm

Hey,

Well I had the same problem as well… It seemed by adding “cudaSetDevice(0);” to the very beginning of my program worked to initialize the GPU. Even though the time is greatly reduced, now it takes about 60ms which is still not fast.

I have a GTX560 TI

Btw, How exactly do you enable Persistent Mode? I’m using VS2010

miccim · November 30, 2011, 10:39am

nvidia-smi -pm 1

cudaSetDevice(0) did not solve my problem, i have a delay of about 11sec. The system has 2x C2050 cards.

spwanasin · November 30, 2011, 6:34pm

You would have to use

cudaSetDevice(0);

cudaSetDevice(1);

to intialize both cards. But yeah, 11sec is crazy long. I hope you figure it out soon.

But for now you can just measure your results by neglecting the 11seconds since its not something that should not be happening.

Topic		Replies	Views
Slow CUDA programs' startup CUDA Programming and Performance	10	7246	January 23, 2012
Runtime initialization slow (1 sec) on 400-500 series cards, very slow (5 sec) with CUDA 3.2 CUDA Programming and Performance	5	5592	April 22, 2011
First CUDA call takes 13 seconds CUDA Programming and Performance	6	4294	July 2, 2015
Device initialization takes 60 Seconds CUDA Programming and Performance	7	481	July 24, 2023
Initialization time on GTX 460 CUDA Programming and Performance	17	8576	November 9, 2011
Persistence Daemon and Slow Initialization CUDA Programming and Performance	1	1099	December 18, 2018
Slow Initialization CUDA Programming and Performance	7	2697	July 30, 2009
First CUDA function call very slow (more than a minute) on GTX 680 only CUDA Programming and Performance	4	7032	February 27, 2014
really slow cudaGetDeviceCount() several seconds to complete a cudaGetDeviceCount() call CUDA Programming and Performance	3	1198	May 18, 2011
CUDA initialization takes long time that varies up to 30 seconds on Amazon p3.16xlarge Windows machi... CUDA Programming and Performance	5	1455	December 8, 2019

Strange delay on CUDA initialization

Related topics