Disabling driver code cache and minimize disk activity?

kaoken · April 18, 2012, 6:22pm

The driver caches the binary code generated after JIT compiling PTX. I want the driver to not cache the generated code, and preferably not do any disk activity at all. Is it possible?

njuffa · April 18, 2012, 7:01pm

To avoid JITing of PTX, make sure your build incorporates SASS (machine code) for all desired target platforms into the fat binaryit produces. When the driver loads a fat binary, it will first search the binary for SASS matching the currently bound GPU’s architecture. If it cannot find such code it will look for suitable PTX for JIT compilation to SASS. If that fails, it returns an error.

The CUDA C Programming Guide also mentions an environment variable CUDA_CACHE_DISABLE (section 3.1.1.2, “Just-in-Time Compilation”).

DrAnderson42 · April 18, 2012, 7:18pm

FYI, under some circumstances building in the SASS does not prevent all disk activity. I got it to work for small applications, but for larger applications it still tries to lock access the cache index file for some reason.
I know because this triggers a bug in our NFS appliance that puts the file lock into an infinite loop. CUDA_CACHE_DISABLE=1 thankfully prevents it from even attempting to lock the index.

njuffa · April 18, 2012, 7:32pm

Interesting. I was not aware of this behavior, and do not know if there is a solid technical reason behind it or whether it may be unintentional. If this is troublesome, I would suggest filing a bug / enhancement request against the CUDA driver. At least the environment variable appears to provide a workaround.

DrAnderson42 · April 19, 2012, 1:23pm

Yeah I did that. In my particular issue, the bug was resolved as “Not an NV bug”. As far as I know we’ve still got an open bug with the vendor of the NFS appliance, who’s bug system appears to be a simple black hole. File locks are causing problems for many other applications too, so I’m OK with NVIDIA’s response. I would have preferred an update so that CUDA simply reported an error instead of waiting indefinitely for the lock.

kaoken · April 20, 2012, 1:03pm

Thanks everyone. I am now using CUDA_CACHE_DISABLE=1 and that appears to work.

Topic		Replies	Views
Disable PTX JIT Compilation CUDA Programming and Performance	15	839	September 8, 2023
Driver JIT compilation CUDA Programming and Performance	6	4405	September 9, 2016
How to speed up JIT compilation? CUDA Programming and Performance cuda	4	1288	December 24, 2021
Consuming a populated JIT cache with read-only permissions CUDA Programming and Performance	3	822	December 23, 2021
PTX JIT caching CUDA Programming and Performance	3	3582	March 29, 2010
CUDA Pro Tip: Understand Fat Binaries and JIT Caching Technical Blog	1	452	February 22, 2016
Fatbinary best practices CUDA Programming and Performance	6	1260	November 23, 2022
JIT Details CUDA Programming and Performance	14	3373	January 9, 2018
Running PTX Code from CUDA 4.0 in CUDA 4.1 or CUDA 4.2 CUDA Programming and Performance	5	2472	May 30, 2012
Avoiding JIT compiling on system with 2 different GPUs CUDA Programming and Performance	6	1086	June 22, 2017

Disabling driver code cache and minimize disk activity?

Related topics