GMEM loads: caching vs. non-caching

sWienke · March 6, 2013, 10:18am

Hi,
on Fermi GPUs, the default global memory access pattern are caching loads (i.e. a granularity of 128-bytes). With CUDA, you may change it to non-caching loads by compiling with nvcc and “-Xptxas -dlcm=cg”.
With PGI’s OpenACC, I assume we also have caching loads be default. Right? Is there any chance to use non-caching loads with OpenACC (compiler flag, environment variable,…)?
Sandra

MatColgrove · March 8, 2013, 10:40pm

Hi Sandra,

We do have an experimental flag (-Mx,180,8) that will disable the L1 cache. You are welcome to give it a try. The caveat being that since it’s not been exposed at the user level, it is subject to change.

Mat

sWienke · March 10, 2013, 11:02am

Thanks Mat! I will give it a try and will report my results.

istvanreguly · February 10, 2014, 9:44pm

Hi,

Apologies for resurrecting this thread - since in the K40 we can once again use caching loads and dlcm=ca, I was wondering how I could enable this in the CUDA Fortran compiler - could you help me with that please?

Thank you,
Istvan

MatColgrove · February 10, 2014, 11:45pm

Hi Istvan,

We added this as the flag “-ta=tesla:noL1”.

Mat

Topic		Replies	Views
How does cuda global memory's L1 caching work CUDA Programming and Performance	5	1168	July 12, 2024
Fermi-style L1 cache in K40 and upwards Legacy PGI Compilers	1	4684	March 20, 2015
Disabling cache on Fermi architectures Try to disable L1 and L2 CUDA Programming and Performance	11	9414	August 30, 2013
Compiling with non-caching loads CUDA Programming and Performance	0	997	November 23, 2011
Disabling L1 cache in visual studio CUDA Programming and Performance	8	946	February 11, 2021
Turn off L1 caching on Fermi GPUs via the driver API? CUDA Programming and Performance	2	710	September 28, 2011
Disabling cache and positive L1 throughput CUDA Programming and Performance	5	522	April 22, 2024
How to use the flags to enable\disable L1 Cache of GPU on Windows? CUDA Programming and Performance	1	2092	April 19, 2020
cannot disable L1 on Fermi CUDA Programming and Performance	0	3744	June 8, 2011
disable L1 cache on Fermi GPU running OpenCL CUDA Programming and Performance	9	4221	September 4, 2011

GMEM loads: caching vs. non-caching

Related topics