slow CUDA application start-up times on headless compute box

sagrailo · December 25, 2011, 6:55pm

Am experimenting with multi-GPU programming, at the moment using Amazon EC2 Cluster GPU instances. I’ve noticed that significant time is needed for the start-up of my CUDA application (basically, querying device info to verify that all devices have CC>=2.0, and then allocating device memory, of the order of 100MB per GPU, and copying input data there) - typically, 2 to 3 seconds are needed for this (while these operations take negligible time on my desktop development machine), and it doesn’t matter if I’m using single or both GPUs on given node. On the other side, my kernels run as expected, achieving almost 2x speedup in two-GPU configuration vs. single-GPU configuration; however as this is sort of demo application, these lengthy start-up times are really crippling overall speedup numbers. In this post http://www.aigfx.com/2010/12/four-cuda-tips/, it is suggested that running X server may help, but I’m wondering are there any other solutions (and why exactly this happens at all)?

Thanks.

Topic		Replies	Views
CUDA initialization takes long time that varies up to 30 seconds on Amazon p3.16xlarge Windows machi... CUDA Programming and Performance	5	1455	December 8, 2019
Performance Issues on headless server CUDA Programming and Performance	5	1442	November 12, 2012
Delay when running CUDA APP from a startup service Jetson TX2	2	437	August 13, 2019
Slow Initialization CUDA Programming and Performance	7	2697	July 30, 2009
CUDA Application Startup Speed on Different Cards CUDA Programming and Performance	2	690	September 2, 2014
Running CUDA programs without starting X server CUDA Programming and Performance	8	8703	December 8, 2020
IO and Execution Pipelining CUDA Programming and Performance	7	4883	August 3, 2007
Device initialization takes 60 Seconds CUDA Programming and Performance	7	481	July 24, 2023
why any CUDA program takes more than 1s? driver initialization time? CUDA Programming and Performance	7	3395	March 25, 2009
CUDA hangs on GPU but not in emulation CUDA Programming and Performance	7	5352	August 21, 2008

slow CUDA application start-up times on headless compute box

Related topics