Simple Matrix-Vector Multiplication

dinaharchery · April 5, 2010, 4:49pm

Hello all,

I am sure this is a very simple problem but my brain does not want to work properly today. Given a square matrix and vector I would like to perform matrix-vector operation using CUDA. I found some examples in CUDA SDK for Matrix multiplication where the dimensions of matrices are multiples of defined block size, I would like a more generalized operation if possible. Are there any examples/tutorials for the multiplication of [M-by-N] matrix by a [N-by-P] vector?

Thank you.

LSChien · April 6, 2010, 3:06pm

If you want to know matrix multiplication on general dimension, you can check this thread
[url=“The Official NVIDIA Forums | NVIDIA”]The Official NVIDIA Forums | NVIDIA

I write a report on SGEMM and discuss how to extend SGEMM to arbitrary dimension in section 8

dinaharchery · April 6, 2010, 7:22pm

Wow, great work.

Thank you

dinaharchery · April 6, 2010, 7:55pm

Another quick question. Given that I am just trying to really learn CUDA with regards to matrix-vector operation(s) is there a simple CUDA code for matrix-vector multiplication (does not have to be all that efficient)? I like the code you built but it seems like a bit much for what I need - I just want to learn the basics of a matrix-vector multiplication on GPU.

Thanks again.

LSChien · April 8, 2010, 4:09pm

for matrix-vector multiplication, you can look at reduction example in SDK.

for matrix-matrix multiplication, matrixMul in SDK uses shared memory but only works for specific dimension.

You can try to extend matrixMul in SDK to arbitrary dimension.

if you want to know how to use registers instead of shared memory, then

I strongly recommand volkov’s paper and his code, you can download them in the thread

http://forums.nvidia.com/index.php?showtopic=89084

Jimmy_Pettersson · April 9, 2010, 7:47pm

Here we shared thoughts, ideas, and some code on GEMV: [url=“The Official NVIDIA Forums | NVIDIA”]http://forums.nvidia.com/index.php?showtop...62330&st=20[/url]

dinaharchery · April 16, 2010, 4:50pm

Thanks very much for all the great information.

I have another question - I hope it is not too far off topic. Can Gaussian Elimination be implemented on CUDA? I know that it contains a lot of back/forward substitutions which don’t seem to leave a lot of room for parallelism via CUDA but it would be interesting to see.

dinaharchery · April 16, 2010, 4:50pm

Thanks very much for all the great information.

I have another question - I hope it is not too far off topic. Can Gaussian Elimination be implemented on CUDA? I know that it contains a lot of back/forward substitutions which don’t seem to leave a lot of room for parallelism via CUDA but it would be interesting to see.

Topic		Replies	Views
Is to possible to speed up multiple matrix per vector multiplication using CUDA? CUDA Programming and Performance	2	1444	April 12, 2010
Vector[1xN] * Matrix[NxM] How would you set it up ? CUDA Programming and Performance	3	4400	October 13, 2008
Matrix multiplication CUDA Programming and Performance	3	3839	March 6, 2008
matrix multiplication multiplication of two matrix CUDA Programming and Performance	4	1709	August 6, 2010
Matrix Multiplication with Shared Memory CUDA Programming and Performance	0	1367	September 28, 2009
Simple Matrix - Vector Multiplication CUDA Programming and Performance	3	1259	December 7, 2011
CUBLAS matrix-vector multiplication CUDA Programming and Performance	14	10241	January 20, 2010
Generalized SGMM CUDA Programming and Performance	5	1667	June 14, 2010
vector matrix multiplication, share my code:) CUDA Programming and Performance	1	5865	October 11, 2011
Newbie:Trying Matrix Vector Multiplication CUDA Programming and Performance	3	4248	November 10, 2008

Simple Matrix-Vector Multiplication

Related topics