speed nvcc compiler

wlangdon · January 3, 2014, 11:00am

The question of how fast the CUDA compiler is has come up several times.
It appears, with small kernels, (see
http://www.cs.ucl.ac.uk/staff/W.Langdon/cuda5/nvcc_timing.gif )
the compiler gets faster when it is asked to compile more kernels.
With a fairly broad peak at about 500 lines per second for above 10000
lines of code but falling away after 20000. With more than 260,000 ptxas
failed but I suspect this is related to running out of memory
(nvcc 5.0, Linux 4GB dual 2.66GHz core).
I am sure details will vary. This CUDA code was created by concatinating many
times the same 87 line kernel. But it does suggest if you want to optimise your code
it makes sense to compile multiple versions of it together rather than separately.
Bill

njuffa · January 3, 2014, 11:42am

I suspect that compilation times are highly dependent on the optimization phases triggered by particular pieces of code. The time complexity of various compiler phases appears to be super-linear with respect to lines of code. This is not surprising as many of the underlying problems are in NP and only the use of heuristics leads to manageable compilation times in the first place.

Lengthy compilation times (> 10 minutes) and massive memory use by the compiler often go hand-in-hand. This usually happens with voluminous source code, and is a good indication that a particular compiler phase needs optimization work. I would encourage CUDA programmers to file bugs for such occurrences with real-life code bases.

A few months back I cooked up some code that took 25 minutes to compile on a 3.4 GHz machine, while the compiler chewed through 3 GB of memory. The resulting machine code worked correctly, but the lengthy compile time really threw a wrench into my engineering process. I filed a bug, and after the issue was fixed, the same code compiled in 16 seconds.

Topic		Replies	Views
Slow Compilation with multiple calls of same function CUDA Programming and Performance	1	764	September 30, 2011
How to reduce compile time for big kernel function? CUDA Programming and Performance	3	5433	November 23, 2009
Long compilation time with CUDA 5.0 CUDA Setup and Installation	4	2366	October 16, 2013
why adding 1 line =exploding time to compile CUDA Programming and Performance	13	8447	June 8, 2009
High compilation time CUDA Programming and Performance	4	1538	September 26, 2008
CUDA v2.0 beta is slower than CUDA v1.1 Is it just temporarily ? CUDA Programming and Performance	3	2664	July 20, 2008
Slow compile and cudaMalloc CUDA Programming and Performance	8	3691	February 2, 2011
Program compilied with CUDA 5.5 is slower than with 5.0 (about 10% degradation) CUDA Programming and Performance	4	904	May 22, 2014
Reducing Application Build Times Using CUDA C++ Compilation Aids Technical Blog	1	632	October 31, 2021
Cuda compilation time issue nvcc 10x slower than cl to parse boost/shared_ptr.hpp CUDA Programming and Performance	0	1285	December 5, 2011

speed nvcc compiler

Related topics