Do global memory caching compiler options work on windows?

Lev · November 6, 2011, 3:17pm

Do global memory caching compiler options work on windows? I tried ca cg and see now difference in speed. Also ptx code is the same. I use runtime api. Has anybody get any difference with this option? Does it change ptx code with prefixes?

seibert · November 7, 2011, 1:53pm

I think this was mentioned in another forum thread. When you use those options, the cache usage modification is applied by ptxas. You won’t see the modified instructions when you look at the PTX output of nvcc because ptxas has not run yet. Compile a .cubin and use cuobjdump to see what instructions were actually generated after ptxas ran.

Lev · November 7, 2011, 4:36pm

Thanks! I was confused because of I see no difference in speed. Btw, if I use 1.2 target, but put this option, wonder, will this option be applyed if the code will run on Fermi card?

tera · November 7, 2011, 5:27pm

No. In order to apply this option to the code, it has to be run through ptxas. And if you run it through ptxas with compute capability 1.2 as target, it will not run on Fermi at all.

Lev · November 7, 2011, 5:33pm

I put 1.2 in compiler options and it runs on fermi. It generates code for 1.2 but it runs of fermi.
Hm, can I use this option with runtime api?

tera · November 7, 2011, 5:55pm

It runs because it uses the PTX representation of the code, which is not influenced by the ptxas options. ptxas-compiled code for compute capability 1.x will not run on Fermi.

Lev · November 7, 2011, 5:57pm

It will, it will be recoded etc. My program contains a few versions of ptx code, for different arhitectures. But I prefer to run 1.2 on fermi.

Lev · November 8, 2011, 2:23pm

Here are my options
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin\nvcc.exe" --keep -Xptxas -dlcm=ca -gencode=arch=compute_11,code="sm_11,compute_11" --machine 32 -ccbin “C:\Program Files (x86)\Microsoft Visual Studio 9.0\VC\bin” -use_fast_math -Xcompiler “/EHsc /W3 /nologo /Ox /Zi /MT -I"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v3.2\include” -maxrregcount=32 --ptxas-options=-v --compile

would i put cg, will it make affect?

tera · November 8, 2011, 5:09pm

Sorry, can’t help you with that one. Maybe someone else can explain that [font=“Courier New”]code=sm_11,compute_11[/font] means that both the PTX code for compute capability 1.1 (which has not run through ptxas and thus is not influenced by [font=“Courier New”]-Xptxas -dlcm=ca[/font]) and the cubin binary (which has run through ptxas and thus uses the cache operator specified by -dlcm=…, but does not run on Fermi) will be included in the binary file, but neither will help getting the right cache operator on Fermi.

By the way, isn’t [font=“Courier New”]-Xptxas -dlcm=ca[/font] the default anyway?

Lev · November 8, 2011, 5:23pm

Yes, it is default, i try to switch to another. This options are from visual studio. I put cuda options in a panel. Probably one more nvidia issue. In visual studio, you select which gpu to target. It creates a few different variants of ptx for different archtecture.

Topic		Replies	Views
Cuda Portability and SharedMem vs Cache CUDA Programming and Performance	9	11621	October 18, 2010
shared memory latency CUDA Programming and Performance	7	5885	May 18, 2011
PGI Acc on Fermi: Does the compiler disable caching? Legacy PGI Compilers	5	4851	March 22, 2011
Disabling cache on Fermi architectures Try to disable L1 and L2 CUDA Programming and Performance	11	9245	August 30, 2013
L1 Cache, L2 Cache and Shared memory in Fermi CUDA Programming and Performance	5	23478	March 21, 2011
__constant__ on Fermi being read through global mem CUDA Programming and Performance	4	2634	March 21, 2011
Memory Corruption on a Fermi-Class GPU Error only on Fermis, program works on non-Fermis. CUDA Programming and Performance	18	7108	July 22, 2011
Going to learn PTX and write a GPU compiler CUDA Programming and Performance	20	26792	January 19, 2009
How does cuda global memory's L1 caching work CUDA Programming and Performance	5	431	July 12, 2024
Turn off L1 caching on Fermi GPUs via the driver API? CUDA Programming and Performance	2	660	September 28, 2011

Do global memory caching compiler options work on windows?

Related topics