parallel computations with CUDA

kolonel · September 3, 2008, 3:43pm

Hi all,
I have a question and I will appreciate if you could let me know your opinions: Is that possible to run 2 or more Grids concurrently in parallel? e.g. I have 10 matrices which I want to multiply them together, is that possible that I define 5 separate Grids and simultaneously multiply these matrices 2 by 2?

theMarix · September 3, 2008, 4:19pm

A card can only run one grid at a time. But why can’t you do it in one grid?

kolonel · September 3, 2008, 4:36pm

Because I want to do parallel computations: suppose multiplication of each 2 matrices takes 1 p.u. of time. I will need 9 p.u. of time to do multipilcation for 10 matrices. However, if I could run the kerenls in parallel I will need 3 p.u. of time to multiply 10 matrices.

There is not any way that I could change the index of threads or blocks at the beginning of a kernel?

E.D_Riedijk · September 3, 2008, 7:29pm

You can calculate your matrix-specific index if you pass a parameter with size-info of your matrices to the kernel.

Like index = blockDim.x * blockIdx.x + threadIdx.x;

index = rem(index, size_of_matrix);

kolonel · September 3, 2008, 7:57pm

In this case can I run 2 kernels in parallel and simultaneously?

_Big_Mac · September 3, 2008, 8:13pm

No, but you can have a single kernel that calculates many multiplications in parallel. You can compute those 10 matrices in a single kernel.

jack · September 3, 2008, 9:02pm

Depending on exactly how you’re multiplying them together, you could combine them into a larger matrix and run calculations on that instead (possibly by using cuBLAS). You could then separate out the original matrices from the larger “container” matrix after your calculations are completed.

kolonel · September 19, 2008, 10:36pm

Hi, I am back again

Would you please make it more clear, do you have any special method in your mind?

For example I have matrices A, B, C and D which are inputs of a kernel. Is that possible to do AB and CD concurrently (parallel in time) inside of this kernel?

Topic		Replies	Views
Difference between CORE and GRID CUDA Programming and Performance	5	2748	June 9, 2012
Concurrent Kernel Execution CUDA Programming and Performance	6	13587	April 18, 2011
Can we run diff kernels on different cores simultaneously ? CUDA Programming and Performance	3	1056	October 20, 2010
Parallel Kernels Best practices for creating a pipeline CUDA Programming and Performance	7	4695	June 1, 2007
CUDA processor allocation CUDA Programming and Performance	7	3435	October 5, 2007
Problems in deciding Gridsize & Blocksize for kernel CUDA Programming and Performance	13	8808	June 8, 2010
Two tasks on the same GPU? CUDA Programming and Performance	2	2262	February 17, 2010
slow kernel CUDA Programming and Performance	4	1445	June 25, 2009
Threaded CUDA Multiple concurrent kernels? CUDA Programming and Performance	9	5594	October 20, 2009
Can CUDA do sequential processing? CUDA Programming and Performance	7	6554	August 24, 2011

parallel computations with CUDA

Related topics