Multiply 2D matrix by const vector

veredz72 · September 26, 2018, 9:56pm

Hello,

I have a 2D matrix. Each element is complex float.
Row is consecutive in memory.

Each row should be multiplied by a const vector sample by sample. (.* in Matlab)
In TX2 there is ~50KB of shared memory per block.

Can I launch the kernel and pass as parameter a complex float vector that will be copied to this shared memory ?
Currently this vector is located in global memory.

Thank you,
Zvika

eyalhir74 · September 27, 2018, 6:36am

Hi,
You can. Not sure it would be a good idea as if you use all/most of the shared memory your occupancy can go down too much.

If all threads use the same vector (items in the vector) copying to constant memory would probably be better. Otherwise try to use __ldg. Anyway test which works best for you.

Eyal

veredz72 · September 27, 2018, 7:53am

Hi Eyal,

Thank you for your reply.
Is it possible to copy data to constant memory before running the kernel ?

Best regards,
Zvika

eyalhir74 · September 27, 2018, 8:07am

Yup, you should copy to the constant memory before running the kernel.
Use the cudaMemcpyToSymbol API.

Topic		Replies	Views
2D float matrix x vector: global vs. shared memory: CUDA Programming and Performance	1	548	October 1, 2018
Advice - Complex Matrix-Vector Multiplication CUDA Programming and Performance	3	5619	May 12, 2009
How to save a big data(4M, larger than constant memory) wihch is frequently used by every thread lik CUDA Programming and Performance	4	770	October 26, 2013
Help me Cuda on Matlab CUDA Programming and Performance	1	1211	August 1, 2010
Copy data into shared memory CUDA Programming and Performance	6	1350	May 28, 2009
Using Texture Memory for Matrix Data? CUDA Programming and Performance	1	179	March 25, 2024
How to efficeint repeat a vector to a matrix in cuda? CUDA Programming and Performance	4	2127	August 24, 2014
CUDA memory management CUDA Programming and Performance	3	2428	February 21, 2012
Matrix into vector CUDA Programming and Performance	3	447	September 30, 2016
2d array (_device_ or _constant_) CUDA Programming and Performance	1	559	July 15, 2013

Multiply 2D matrix by const vector

Related topics