Hi there,
I appreciate you sharing your knowhow. :)
I am trying to make a code for below objective.
1) Copy two matrices to device memory.
> One is {Lx(MxN)} matrix, Another is Lx{(Nx1)}.
2) Calculate L times for loop of matrix calculation.
> matrix calculation: L times of {(MxN) x (Nx1)}
3) Display the result (LxN)
4) User can change L iteraion times.
<b> It affects to not only two matrices, but the number of iterations of for loop.</b>
Is this possible to realize by CUDA?
So far, I have seen some codes just calculating direct which means set values without user input.
Previous question, someone(kbam, thank you!) gave me a comment.
nBody sample code helps me.
I checked it. It seems that it does not change the matrix size or copy matrix to device memory.
I am a newbie of CUDA.
At this moment, I really want to study CUDA for lots of huge matrix computation algorithm or complicated calculation problem with taking long time for calculating.
Thank you in adavance.
Sincerely,
Albert