nvcc -O0 not working (CUDA 3.2)

RezaRob3 · April 11, 2011, 4:57am

CUDA 3.2.

This compiles without errors but clearly doesn’t work. I still get optimized code!
/usr/local/cuda/bin/nvcc -gencode=arch=compute_20,code="sm_20,compute_20" -m32 -O0 --ptxas-options -O0 --compiler-options -fno-strict-aliasing -I. -I/usr/local/cuda/include -I…/…/common/inc -I…/…/…/shared//inc -DUNIX -o vectorAdd.cuo -c vectorAdd.cu

Thanks a lot for responding.

Reza.

RezaRob3 · April 11, 2011, 6:25am

Okay, it sort-of works. I guess it’s eliminating completely unused variables/counters and such.
It apparently works for me now.

Thanks,
Reza.

RezaRob3 · April 11, 2011, 7:02am

Actually, I’m very sorry, my question still stands:

-O0 isn’t working. Why?

Reza.

tera · April 11, 2011, 8:35am

I believe Nvidia made the compiler execute many optimization passes unconditionally since e.g. compute capability 1.x devices have to use inlining to make functions work at all.

wlangdon · April 14, 2011, 9:02am

I think -O applies to host code and (I think) it does not appply to GPU kernel code.

Bill

RezaRob3 · April 14, 2011, 9:12am

Yes, I suspect you’re right. I think I strace’d my nvcc and it’s just passing that to gcc. Presumably that’s all it’s doing(?)

Thanks.

tera · April 14, 2011, 10:25am

Oops, yes. You need to pass [font=“Courier New”]–opencc-options=-O0[/font] to nvcc. Somehow I missed you were doing that for ptxas and for the host compiler, but not for nvopencc.

RezaRob3 · April 14, 2011, 11:09am

Oh thanks! nvcc.doc probably should make that more clear.

I appreciate it.

King_Crimson · December 16, 2011, 5:54pm

What does the number 0 following -O mean? To disable optimization? I didn’t find any information about this from NVCC manual. External Image

RezaRob3 · December 16, 2011, 6:30pm

Heh… I would definitely believe so because that’s the tradition in gcc, but a quick glance at the nvcc manual didn’t turn up anything for me either.

You might try it and then look at the assembly code like this to see if it does what you want:

cuobjdump -sass test > machine-code.txt

(where test is an executable CUDA program.)

Do checkout other cuobjdump options by just running ‘cuobjdump’(with --help).

Topic		Replies	Views
nvcc optimization flags CUDA Programming and Performance	6	19229	April 29, 2019
How to do -O3 optimization in visual Studio for CUDA code CUDA Programming and Performance	6	7836	July 23, 2015
NVCC optimize level CUDA Programming and Performance	0	5497	November 6, 2009
enable_language(CUDA) ignores NVCC Compiler flags CUDA Programming and Performance	6	5281	August 10, 2023
How to pass optimization options to cicc? CUDA Programming and Performance	1	3017	February 1, 2012
Using --optimize or -O with NVCC Looking for documentation CUDA Programming and Performance	2	8315	November 9, 2011
nvcc -O3 problem CUDA Programming and Performance	7	8072	October 22, 2011
How to specify optimization level in device code? CUDA Programming and Performance	1	651	January 6, 2012
Why am I unable to compile a CUDA program even though I have nvcc? CUDA Setup and Installation	3	586	December 4, 2023
Nvrtc compiler summary CUDA Programming and Performance	5	347	January 20, 2024

nvcc -O0 not working (CUDA 3.2)

Related topics