Setting cache control, when compiling into PTX with NVCC Xptxas -dlcm doesn't work

mallee · September 30, 2011, 8:37pm

Hello,

I want to compile my .cu kernel into PTX code with Visual Studio & Cuda SDK 4.0.

It generates the following commandline for compiling:

My problem is that -Xptxas -dlcm doesn’t work, because there is no difference between PTX output. But it is good (compiled properly by driver), if I change all of “ld.global” instructions to e.g. “ld.cg.global” manually.

Is there any solution for this problem?

Thank you!

njuffa · September 30, 2011, 10:50pm

As the name of the switch implies, -Xptxas -dlcm is a component-level switch for PTXAS, which is the compiler backend that translates PTX into machine code. I am not aware of a top-level (nvcc) compiler switch that lets one control the cache mode for load instructions emitted into PTX. I would suggest looking into PTX inline assembly.

mallee · September 30, 2011, 11:55pm

Finally, I’ve made a Powershell script to replace the instructions.

Lev · November 6, 2011, 9:53pm

Any news on this issue? Look like I have the same problem. Should this options generate different ptx output?

tera · November 7, 2011, 1:13am

No, because (as Norbert explained) [font=“Courier New”]-Xptxas[/font] marks an option to the PTX “assembler”, which isn’t invoked at all. The only useful thing Nvidia could do is to add a new option to the device code compiler ([font=“Courier New”]-Xopencc …[/font]) to issue [font=“Courier New”]ld.cg.global[/font] instead of [font=“Courier New”]ld.global[/font].

Topic		Replies	Views
Help me understand "-Xptxas -dlcm=cg" (take 2) CUDA Programming and Performance	1	7031	November 24, 2010
Compiling with non-caching loads CUDA Programming and Performance	0	976	November 23, 2011
Xptxas default cache modifier on global/generic load and store not working CUDA NVCC Compiler	10	993	July 30, 2024
Do global memory caching compiler options work on windows? CUDA Programming and Performance	9	7319	November 8, 2011
-Xptxas -dlcm=cg not available on windows ? CUDA Programming and Performance	0	803	July 14, 2011
Turn off L1 caching on Fermi GPUs via the driver API? CUDA Programming and Performance	2	684	September 28, 2011
nvcc -Xptxas doesn't seem to work ? CUDA Programming and Performance	5	5300	July 14, 2011
Disabling L1 cache in visual studio CUDA Programming and Performance	8	889	February 11, 2021
Compiler -Xptxas flag has no effect CUDA Programming and Performance	6	4720	December 19, 2011
Going to learn PTX and write a GPU compiler CUDA Programming and Performance	20	26916	January 19, 2009

Setting cache control, when compiling into PTX with NVCC Xptxas -dlcm doesn't work

Related topics