Documentation for ptx compilation with --oFast-compile?

pseudoname · October 20, 2024, 10:26pm

PTX Compiler API v12.6 has added a new option --oFast-compile
https://docs.nvidia.com/cuda/archive/12.6.0/ptx-compiler-api/index.html#compilation-options

I dont see it mentioned in the CUDA Toolkit 12.6 release logs. Is there anywhere I can find commentary on the design intent and any known effects / usecases?

How many settings (e.g. 0 1 2 3) does it offer, and what features are turned off / on for each setting?

Could you provide any information for what percent speedup one might expect?

pseudoname · October 21, 2024, 2:42pm

The allowable values seem to be “0” and “max”

When compiling my ptx however, it seems to make compilation run slower:

# without
time for i in {1..300}; do ptxas -arch=sm_60 my_code.ptx; done
real    0m6.026s
user    0m3.725s
sys     0m2.343s

# with -Ofast-compile
time for i in {1..300}; do ptxas -arch=sm_60 my_ptx -Ofc=max; done
real    0m6.450s
user    0m3.796s
sys     0m2.694s

That said, I’ve also found that -O0 compiles this same ptx slower than -O1 which makes little sense to me.

Any tips on making compilation run faster would be appreciated.

Curefab · October 22, 2024, 10:47am

I perhaps improves linking speed:

(24x speedup)

Topic		Replies	Views
Why acceleration of -split-compiler option is not obvious CUDA NVCC Compiler	0	357	April 10, 2023
Compilation time with !$acc routine very long Legacy PGI Compilers	3	2020	December 1, 2017
Ptxas compiler speed. CUDA Programming and Performance	23	12148	December 20, 2012
Compilation time Legacy PGI Compilers	2	2574	October 26, 2010
Suggestion to decrease compilation time CUDA Programming and Performance	8	49	January 31, 2025
Newbie - How can I execute the manually modified PTX file? CUDA Programming and Performance	3	3448	December 8, 2008
How to do -O3 optimization in visual Studio for CUDA code CUDA Programming and Performance	6	7932	July 23, 2015
Generated PTX file twice bigger than Optix7 SDK sample OptiX	3	605	June 14, 2022
How to reduce compile time for big kernel function? CUDA Programming and Performance	3	5434	November 23, 2009
Ptxas error while migrating from OptiX 6.0 to 7.2 OptiX	7	1992	October 12, 2021

Documentation for ptx compilation with --oFast-compile?

Related topics