PTX JIT caching

plegresley · March 21, 2010, 9:34pm

In the Fermi compatibility guide it shows how to use the CUDA_FORCE_PTX_JIT environment variable to force JIT of the PTX code. It says that the cubin is cached by the driver and that the cache is even persistent across reboots. However, when I try this with the SDK examples it doesn’t seem to be caching at all:

[codebox][plegresl@bigbird release]$ export CUDA_FORCE_PTX_JIT=0

[plegresl@bigbird release]$ time ./simpleCUBLAS -noprompt

simpleCUBLAS test running…

PASSED

real 0m0.260s

user 0m0.170s

sys 0m0.087s

[plegresl@bigbird release]$ export CUDA_FORCE_PTX_JIT=1

[plegresl@bigbird release]$ time ./simpleCUBLAS -noprompt

simpleCUBLAS test running…

PASSED

real 1m13.848s

user 1m13.005s

sys 0m0.833s

[plegresl@bigbird release]$ time ./simpleCUBLAS -noprompt

simpleCUBLAS test running…

PASSED

real 1m13.830s

user 1m12.981s

sys 0m0.837s

[/codebox]

Is this the expected behavior? It seems like if it was working properly the third invocation would be as fast as the first.

plegresley · March 29, 2010, 6:54pm

Any answers?

tmurray · March 29, 2010, 7:20pm

uh, CUDA_FORCE_PTX_JIT is still set to 1 in the third invocation unless I am crazy.

(alternately: FORCE_PTX_JIT actually forces a compile, it does not use the JIT cache.)

plegresley · March 29, 2010, 7:40pm

I understand now. The documentation isn’t really correct:

When starting a CUDA application for the first time with the above environment flag, the CUDA driver will JIT compile the PTX for each CUDA kernel that is used into native CUBIN code. The generated CUBIN for the target GPU architecture is cached by the CUDA driver. This cache persists across system shutdown/restart events.

It specifically says “first time”, which implied to me that on subsequent calls the cache would be used. Thanks, Tim.

Topic		Replies	Views
Disabling driver code cache and minimize disk activity? CUDA Programming and Performance	5	2487	April 20, 2012
JIT .cu CUDA Programming and Performance	17	8180	October 13, 2010
CUDA Pro Tip: Understand Fat Binaries and JIT Caching Technical Blog	1	488	February 22, 2016
How set the system environment flag CUDA_FORCE_PTX_JIT=1 CUDA Programming and Performance	1	5597	February 24, 2010
Driver API: PTX or CUBIN modules? CUDA Programming and Performance	3	2478	July 9, 2009
Speed up initialization of CUDA About how to set the Device code translation cache CUDA Programming and Performance	7	15021	April 26, 2013
JIT compilation PTX to machine code may fail for certain GPUs ? CUDA Programming and Performance	4	6103	January 21, 2015
PTX files Teaching & Curriculum Support	1	1420	September 3, 2013
Turn off L1 caching on Fermi GPUs via the driver API? CUDA Programming and Performance	2	699	September 28, 2011
PTX JIT Dependencies CUDA Programming and Performance	0	592	July 2, 2012

PTX JIT caching

Related topics