Using Texture Memory for Matrix Data?

lain_iw · March 25, 2024, 6:36pm

The matrix multiplication sample in CUDA C++ Programming Guide uses 1D global memory to maintain the matrix data:

// Host code
// Load A and B to device memory
Matrix d_A;
d_A.width = A.width; d_A.height = A.height;
size_t size = A.width * A.height * sizeof(float);
cudaMalloc(&d_A.elements, size);
cudaMemcpy(d_A.elements, A.elements, size,
            cudaMemcpyHostToDevice);
// ...

However, texture/surface memory is more efficient for 2D data according to the doc:

The texture cache is optimized for 2D spatial locality, so threads of the same warp that read texture or surface addresses that are close together in 2D will achieve best performance. Also, it is designed for streaming fetches with a constant latency

So is it better to store 2D data like matrix in tex2D objects? Is there any example?

Robert_Crovella · March 25, 2024, 6:41pm

There are CUDA sample codes that demonstrate the usage of 2D textures, such as this one

Whether or not texture provides any benefit is not something that can be answered simply yes or no. It will likely depend on problem sizes, and may also depend on GPU type.

If you want the fastest matrix-multiply performance, the usual recommendation is to use CUBLAS. Don’t write the code yourself.

You can find other somewhat similar questions, with a bit of searching.

Topic		Replies	Views
Matrix multiplication using texture CUDA Programming and Performance	6	4818	April 17, 2008
Constant or Texture Memory Which is better for my application? CUDA Programming and Performance	3	2384	November 16, 2007
Use texture memory or global memory in this case? CUDA Programming and Performance	3	1980	August 13, 2016
Texture memory when to use ? CUDA Programming and Performance	6	20142	October 7, 2009
CUDA 2D memory Vs 1D Memory Speed comparision: Texture Mem Vs Gloval vs Shared CUDA Programming and Performance	0	8799	July 23, 2009
utilize texture memory How to use the texture more effectively? CUDA Programming and Performance	0	2813	July 4, 2008
Memory performance in image processing example CUDA Programming and Performance	9	1600	March 24, 2011
For what case should I use texture memory? CUDA Programming and Performance	8	2653	May 26, 2010
CUDA texture memory performance CUDA Programming and Performance	4	33514	January 13, 2009
Texture Memory? How do you use it? CUDA Programming and Performance	1	5836	December 27, 2009

Using Texture Memory for Matrix Data?

Related topics