NVIDIA Developer Forums

Bypassing cache in Fermi

Accelerated Computing CUDA CUDA Programming and Performance

AlexanderMalishev August 28, 2010, 6:27am 16

Couple of complier intrinsic would solve this problem at the CUDA C level:

template T __load(T *address , LOAD_OPTIONS options);
template void __store(T *address , T value, STORE_OPTIONS options);

Topic		Replies	Views	Activity
Declare area of the on-card memory as non-cacheable? on card memory and it's use. CUDA Programming and Performance	13	8619	November 12, 2010
Switch off L1 cache CUDA Programming and Performance	2	3409	March 24, 2015
Disabling cache on Fermi architectures Try to disable L1 and L2 CUDA Programming and Performance	11	9259	August 30, 2013
Fermi: Cache configuration default at compile time From shared to L1 CUDA Programming and Performance	4	19525	April 16, 2010
disable L1 cache on Fermi GPU running OpenCL CUDA Programming and Performance	9	4116	September 4, 2011
L1 Cache, L2 Cache and Shared memory in Fermi CUDA Programming and Performance	5	23531	March 21, 2011
Disable cache per variable CUDA Programming and Performance	4	1278	October 21, 2015
Fermi L1 Cache coherent? CUDA Programming and Performance	5	14913	May 20, 2010
How to optimize for cache + shared memory on Fermi? CUDA Programming and Performance	8	3038	April 25, 2010
Is there a way to force something to always be in L1 cache? CUDA Programming and Performance cuda	5	835	November 1, 2023