nvcc optimization flags

laurenluckiez · January 17, 2019, 5:39pm

Dear all,
I would like to be aware of all the optimization flags of nvcc compiler. Is there any clear list or document of all the optimization options with flags of nvcc I have and the description of them?

Thank you in advance!

Robert_Crovella · January 17, 2019, 7:37pm

they are documented in the nvcc manual:

[url]https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/index.html[/url]

You can also get command line help with:

nvcc --help

laurenluckiez · January 18, 2019, 1:53pm

Hi,
thank you very much for your reply!

I read the document you proposed me and I found that the flags that automatically optimize the CUDA C code are the -O options that like in gcc optimize the host code (correct me if this is incorrect).

I would like to ask:
a)Do I have the ability with nvcc compiler to modify specific oprimization flags (-faggressive-loop-optimizations, -falign-functions, -falign-jumps, -falign-labels, -falign-loops, …) like in gcc?
b)Are there optimization flags that can optimize the GPU kernels (the device code)?

Thank you in advance!

Robert_Crovella · January 18, 2019, 2:59pm

The only thing officially supported and documented is what is listed in the manual link I pointed out.

The -O flag:

gets passed to the host compiler for its use
may also impact what is used on the ptxas compilation command line. ptxas is the primary tool that generates optimized device code.

a) No, not that I am aware of
b) The nvcc flags do affect the optimization of GPU kernels (device code) to the extent that they impact what gets passed to ptxas. For example, the -G option will disable optimizations in ptxas

You can use the --verbose flag on nvcc to see experimentally the affect of adding various -O optimizations and/or -G on the nvcc command line, as it pertains to ptxas, to learn about how it impacts ptxas operation.

I wouldn’t be able to answer detailed questions like what do the various -O1 -O2 -O3 optimization levels affect in ptxas, as that is not documented anywhere that I know of, and is probably subject to change from one CUDA version to the next.

njuffa · January 18, 2019, 5:11pm

As far as I recall, -On affects only host code. To set the PTXAS optimization level, one would need to use -Xptxas -On; the default is -Xptxas -O3.

The front portion of the CUDA compiler (where architecture-independent optimizations happen) is based on LLVM, and I think (not sure) the open-source LLVM distribution comes with a PTX code generator, so if you want to experiment with specific optimization strategies in the context of CUDA, that may be a way to experiment with the details of various optimizations.

Note that in the CUDA toolchain, PTX code is compiled down to machine language by PTXAS, which despite its name is an optimizing compiler. This means that PTX serves a dual role as a virtual architecture and a compiler intermediate format.

pis2017001 · April 29, 2019, 6:03am

Hi all,

I am exploring NVIDIA TX1 for optimization related stuff( just started). Can you please provide some ways to do the same. What I want to do is:
We optimize normal c/c++ codes using llvm via compiler flags like O1, O2 … Ofast. Or maybe by writing our own optimization/analysis pass. Can we do the same with cuda code? If yes then how.

Thanks,
Pravin Srivastav

Robert_Crovella · April 29, 2019, 10:13am

TX1 uses nvcc also. All you have to do is read this thread. For TX1 specific questions you may wish to post those on the TX1 forum.

Topic		Replies	Views
How to do -O3 optimization in visual Studio for CUDA code CUDA Programming and Performance	6	7967	July 23, 2015
enable_language(CUDA) ignores NVCC Compiler flags CUDA Programming and Performance	6	5372	August 10, 2023
PGI OpenACC nvcc compiler flags (or cuda flags) Legacy PGI Compilers	4	6699	March 21, 2014
Looking for a list of values --optimize and --ptxas-options can take NVCC compiler options CUDA Programming and Performance	3	9942	January 31, 2009
optimize host code CUDA Programming and Performance	3	829	June 16, 2011
nvcc -O0 not working (CUDA 3.2) CUDA Programming and Performance	9	16382	December 16, 2011
SOLVED? nvcc optimization options problem CUDA Programming and Performance	5	7162	July 15, 2010
nvcc -O3 problem CUDA Programming and Performance	7	8125	October 22, 2011
Is this a bug of NVCC 5.5 on code generation/optimization? CUDA Programming and Performance	4	815	April 25, 2014
Compilation flags help CUDA Programming and Performance	8	1834	November 10, 2016

nvcc optimization flags

Related topics