How to efficeint repeat a vector to a matrix in cuda?

ilovelyy · August 22, 2014, 6:23pm

I want to repeat vector to form a matrix in cuda, avoiding too many memcopy. Both vector and matrix are allocated on GPU.

For example:

I have a vector: a = [1 2 3 4]

expand it to a matrix: b = [1 2 3 4; 1 2 3 4; … 1 2 3 4]

What I have tried is to assign each element of b. But this involves a lot of GPU memory to GPU memory copy.

I know this is easy in matlab (using repmat), but how to do it in cuda efficiently? I didn’t find any routine in cublas.

little_jimmy · August 23, 2014, 7:28am

and do you wish to commence from the host or device?

from the host, I would think that a cudamemcpy for each matrix row would be an elementary solution

from the host, you can use a number of thread blocks to distribute the vector to the different matrix rows, use a single block to read in the vector to shared memory, and then distribute it to each matrix row, or even dynamic parallelism to have single thread initiate a number of cudamemcpy’s, similar to the host solution mentioned above

tera · August 23, 2014, 8:29am

If you want to go beyond something that just copies the data to duplicate it,the most efficient way is to just not copy at all. Change the code that reads a matrix where a vector is enough.

sBc-Random · August 24, 2014, 1:46pm

I can give you a very basic way to turn it into a matrix -
Have a look at the ger function within cublas.

Apply it to 1vectorv or vtranspose(1vector) (depends on orientation)

ilovelyy · August 24, 2014, 6:30pm

That’s brilliant, man!

Topic		Replies	Views
Matrix Vector Multiplication in CUDA CUDA Programming and Performance	2	1710	February 14, 2011
Simple Matrix - Vector Multiplication CUDA Programming and Performance	3	1216	December 7, 2011
CUBLAS matrix device to device copy CUDA Programming and Performance	1	6455	April 23, 2010
multiple small (symmetric) matrix -vector multiplications CUDA Programming and Performance	2	748	May 14, 2012
multiple matrix-matrix multiplications CUDA Programming and Performance	4	1311	May 21, 2014
Vectors in CUSA? how to store dynamic matrix? CUDA Programming and Performance	6	1214	June 19, 2011
Matrix into vector CUDA Programming and Performance	3	447	September 30, 2016
Efficient repeated copying of a vector CUDA Programming and Performance	10	2989	August 24, 2023
Matrix-Vector Multiply with cublasDgemv CUDA Programming and Performance	4	3079	January 2, 2010
Simple Matrix-Vector Multiplication CUDA Programming and Performance	7	6079	April 16, 2010

How to efficeint repeat a vector to a matrix in cuda?

Related topics