Matrix Multiplication with Shared Memory

Accelerated Computing CUDA CUDA Programming and Performance

silentlearner September 28, 2009, 3:14pm 1

I have written a code for matrix multiplication for shared memory based on the example in CUDA programming guide. I wish to extend the code for matrices with arbitrary sizes. How can I achieve this for the shared memory case, since block multiplication is used?

Thanks in advanced.

Topic		Replies	Views
matrix multiplication with shared memory (randomly sized) shared memory matrix multiplication random CUDA Programming and Performance	0	1734	May 29, 2009
Matrix multiplication shared memory CUDA Programming and Performance	5	7120	April 6, 2015
Optimize problem regarding problem size CUDA Programming and Performance	4	6128	May 25, 2011
Shared Memory A simple example CUDA code to elaborate the concept? CUDA Programming and Performance	1	1042	August 9, 2009
Example of Matrix Multiplication with Shared memory CUDA Programming and Performance	2	2077	June 22, 2011
matrix multiplication multiplication of two matrix CUDA Programming and Performance	4	1659	August 6, 2010
cuda: matrix multiplication using shared and global CUDA Programming and Performance	1	726	November 22, 2016
Shared Memory Access - Matrix Multiplication CUDA Programming and Performance	1	1037	October 24, 2015
A Question from Programming Massively Parallel Processors: A Hands-on Approach CUDA Programming and Performance cuda , kernel	0	632	September 28, 2021
Use shared Memory CUDA Programming and Performance	3	432	December 26, 2019

Matrix Multiplication with Shared Memory

Related topics