Compiler -Xptxas flag has no effect

Sasha_Buzko · December 16, 2011, 10:57pm

Hi all,
I’m trying to determine register usage by adding the -Xptxas -v flag to the compiler options. However, I get no additional information in the compiler output. The flag is simply ignored.
Are there version variations to the usage of this flag? Or is there some other prerequisite for it to work correctly that’s not mentioned in the documentation?
I use CUDA 4.0 on a x86_64 Linux with the CUDA plugin in Eclipse.

Thanks for any suggestions.

Sasha

tera · December 16, 2011, 11:29pm

What is the full command line? You might only be generating PTX code and thus never invoke ptxas, so the option would simply get ignored.

Sasha_Buzko · December 16, 2011, 11:38pm

Sounds like a possible reason… Here’s the full command line:
/usr/local/cuda/bin/nvcc -I/usr/local/cuda/include -I/work/v/boost_1_41_0 -I"/home/sasha/workspace/V2/cuda" -I"/home/sasha/workspace/V2/lib" -I/usr/include -I/usr/lib/gcc/x86_64-redhat-linux/4.1.2/include -I/usr/include/c++/4.1.2/backward -I/usr/local/include -I/usr/include/c++/4.1.2/x86_64-redhat-linux -I/usr/include/c++/4.1.2 -O3 -g -c -Xcompiler -fmessage-length=0 -arch=compute_20 -Xptxas -v -o “cuda/cuda_main.o” “…/cuda/cuda_main.cu”

Thanks

Sasha

tera · December 17, 2011, 12:25am

Indeed. [font=“Courier New”]-arch=compute_20[/font] generates PTX only. Add [font=“Courier New”]-code=compute_20,sm_20[/font] to it to also run ptxas and thus see the register use.

Sasha_Buzko · December 17, 2011, 1:06am

Thanks for the advice, Tera.
Addition of the -code flag worked right away. A quick question, though: the compiler printed out information only for several functions. The kernel is fairly complex, and I have well over three dozen functions. Is there a way to force it to produce info for all of them? Or perhaps indicate which ones I’m interested in?

Thanks again

Sasha

tera · December 19, 2011, 11:51am

The functions that produce no register usage diagnostics probably get inlined. Add [font=“Courier New”]-Xopencc -noinline[/font] to the nvcc arguments to see some approximation to their register usage (and have their register usage removed from the calling function or kernel).

Allow function inlining for production compilation however, it will probably generate faster kernels.

Sasha_Buzko · December 19, 2011, 5:37pm

The -noinline flag works great - all functions are listed individually.

Thanks for your help

Sasha

Topic		Replies	Views
nvcc -Xptxas doesn't seem to work ? CUDA Programming and Performance	5	5287	July 14, 2011
Nvrtc compiler summary CUDA Programming and Performance	5	399	January 20, 2024
Adding --ptxas-options=-v flag into Cmake CUDA Programming and Performance	1	1072	November 15, 2018
--ptxas-options=-v is not working CUDA Programming and Performance	1	1651	April 30, 2011
setting ptxas options Another simple question CUDA Programming and Performance	1	2939	February 20, 2008
ptxas register use CUDA Programming and Performance	5	1809	March 4, 2014
Compiler option --ptxas-options=-v gives wrong register count? CUDA Programming and Performance	3	3174	July 15, 2010
Looking for a list of values --optimize and --ptxas-options can take NVCC compiler options CUDA Programming and Performance	3	9959	January 31, 2009
--ptxas-options=-v Equivalent for CUDA Fortran? Legacy PGI Compilers	10	12293	September 2, 2010
CUDA low-level programming - strange ptxas behavior CUDA Programming and Performance	4	1494	February 17, 2014

Compiler -Xptxas flag has no effect

Related topics