New cudaDeviceSetCacheConfig and cudaFuncSetCacheConfig mode

Scott314 · April 22, 2013, 4:11pm

GK110 supports a new cache config mode where the L1 and shared memory are split 32:32

references:
[url]Kepler Tuning Guide :: CUDA Toolkit Documentation

and slide 29 of:

[url]http://developer.download.nvidia.com/GTC/PDF/GTC2012/PresentationPDF/S0514-GTC2012-GPU-Performance-Analysis.pdf[/url]

However the cuda reference manual only list the older 16:48 and 48:16 split using cudaDeviceSetCacheConfig or cudaFuncSetCacheConfig, page 23 and 52 respectively in the Toolkit Reference Manual.

How do I set the 32:32 split?

allanmac · April 22, 2013, 6:13pm

For the Runtime API:

/**
 * CUDA function cache configurations
 */
enum __device_builtin__ cudaFuncCache
{
    cudaFuncCachePreferNone   = 0,    /**< Default function cache configuration, no preference */
    cudaFuncCachePreferShared = 1,    /**< Prefer larger shared memory and smaller L1 cache  */
    cudaFuncCachePreferL1     = 2,    /**< Prefer larger L1 cache and smaller shared memory */
    cudaFuncCachePreferEqual  = 3     /**< Prefer equal size L1 cache and shared memory */
};

… and for the Driver API:

/**
 * Function cache configurations
 */
typedef enum CUfunc_cache_enum {
    CU_FUNC_CACHE_PREFER_NONE    = 0x00, /**< no preference for shared memory or L1 (default) */
    CU_FUNC_CACHE_PREFER_SHARED  = 0x01, /**< prefer larger shared memory and smaller L1 cache */
    CU_FUNC_CACHE_PREFER_L1      = 0x02, /**< prefer larger L1 cache and smaller shared memory */
    CU_FUNC_CACHE_PREFER_EQUAL   = 0x03  /**< prefer equal sized L1 cache and shared memory */
} CUfunc_cache;

Scott314 · April 22, 2013, 6:27pm

Thanks!

Topic		Replies	Views
How to use cudaFuncSetCacheConfig() correctly ? One of the most anticipating features does not seem CUDA Programming and Performance	8	5587	June 23, 2010
changing L1 cache configuration using â€œcudaFuncSetCacheConfig" not working CUDA Programming and Performance	6	4527	February 3, 2012
Reconfiguring the cache / shared memory on a Fermi understanding the cudaFuncSetCacheConfig command CUDA Programming and Performance	19	34776	June 7, 2010
issue using cudaFuncSetCacheConfig setting cudaFuncSetCacheConfig(MyKernel, cudaFuncCachePreferShare CUDA Programming and Performance	1	932	November 16, 2010
cudaFuncSetCacheConfig - call overhead CUDA Programming and Performance	1	701	November 5, 2010
cudaFuncSetCacheConfig( Kernel1, cudaFuncCachePreferL1) No effect on shared memory CUDA Programming and Performance	1	1913	January 30, 2012
Set cache config from OpenCL CUDA Programming and Performance	0	1413	September 10, 2011
Cuda Portability and SharedMem vs Cache CUDA Programming and Performance	9	11673	October 18, 2010
Change L1 cache size in Fermi Legacy PGI Compilers	11	32351	May 25, 2011
What's the take on cudaFuncSetCacheConfig() these days? CUDA Programming and Performance	1	390	August 28, 2022

New cudaDeviceSetCacheConfig and cudaFuncSetCacheConfig mode

Related topics