Constants vs Texture Memory

sward · February 19, 2007, 3:06pm

I have been developing a protein folding simulation using CUDA, and have some questions about using the constants memory (and cache) vs the texture memory (and cache).

Specifically, I have a large amount of reference data which is required during the calculation process, which I have been able to get down to around 60 KB via very aggressive compression techniques. However, the access pattern in this data is random for the input data (due to the compressed nature), primarily consisting of reading single float values from nonadjacent memory locations. Is this pattern of access more likely to gain a benefit from the constant cache, or the 1d texture cache, or a balance across both (assuming the 8kb caches are seperate for each memory region?)?

Also, how much of the 64 KB constant cache is actually accessible? I find that when I use > 50 KB, that I will get inconsistent failures at launching the CUDA kernel, which can be reproduced just by a noop kernel which includes a large __constant array.

-Sean

Mark_Harris · February 19, 2007, 3:43pm

It sounds like the constant cache is probably your best bet since you have less than 60KB of data. Texture is a good possibility also. 2D Texture will help if you have (or you can create) good 2D locality in the addressing.

As for the issue with the constant arrays. Can you provide more information? Are you using cudaMemcpyToSymbol to download data into the constant array?

Thanks,
Mark

sward · February 19, 2007, 6:15pm

At present, it is very difficult to create 2d locality, as the data tables are highly compressed representations of a much larger matrix, which necessarily loses locality with the compaction. However, the relative cost of indexing into the original (8 MB +) data, given the latencies of global memory access seemed to be worth the tradeoff of unpacking a dataset that could fit within a faster memory region…

I was loading the static constant data by defining a static constant table via an include. For example:

constant const int d_nLRLen = 4;

constant const int d_nLRIndex[21][21] = {

{ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 }, (rest removed)

as defined in a .cu file that was #included in the main kernel .cu file. There are no samples in the SDK which utilize the __constant keyword, so if I should be using MemcpyToSymbol instead, I’ll take a look at that. Changing the size of the matrix included in this fashion will enable the application to run correctly, with the only variable being the size of the constant matrix. I provided a full sample file via NVIDIA bug ID 288638, since I don’t seem to have permission to attach a .cu file to a post.

-Sean

sward · February 20, 2007, 3:39pm

One other followup clarification, are the per MPP group 8kb cache’s for constants and textures seperate memory pools (in which case an application could gain a benefit from using both memory regions)?

-Sean

Mark_Harris · February 21, 2007, 9:51am

Yes, they are separate.

Mark

Topic		Replies	Views
Constant or Texture Memory Which is better for my application? CUDA Programming and Performance	3	2467	November 16, 2007
Constant Arrays CUDA Programming and Performance	13	30941	November 24, 2007
How to choose the good memory CUDA Programming and Performance	2	4334	December 7, 2007
Speed of Constant memory over Textures CUDA Programming and Performance	2	7042	December 24, 2009
Use of constant caches for large data? CUDA Programming and Performance	10	27146	February 23, 2007
Really slow constant memory Random access to constant memory CUDA Programming and Performance	13	4719	December 4, 2009
the worse performance using texture memory any ideas? CUDA Programming and Performance	4	1492	July 5, 2011
Constant memory usage and comparison against textures CUDA Programming and Performance	9	4238	December 24, 2008
question about texture and constant caches on the gtx 200 CUDA Programming and Performance	1	4354	April 2, 2010
Which cache for irregular access to array of constants? CUDA Programming and Performance	5	639	September 6, 2018

Constants vs Texture Memory

Related topics