choosing which to parallelize

jabjab23 · January 30, 2009, 6:14am

Are there requirements for choosing which parts of the codes are going to be parallelized?

MisterAnderson42 · January 30, 2009, 12:41pm

if (code.can_be_parallelized())

	code.parallelize()

I know, this isn’t the answer you are looking for. But every bit of code has to be evaluated on a case by case basis. There are no hard and fast rules about what can and cannot be parallelized. Even seemingly 100% serial operations like:

for (int i = 1; i < n; i++)

	a[i] += a[i-1];

have very efficient parallel implementations (the example I gave is called a “scan”).

There is a general rule of thumb to answer the question “is it worth parallizing on the GPU?”. The answer to that one is, yesif you can run at least a ~5,000 independent threads. Of course, every rule has its exceptions: Say you have an algorithm with steps A B C D. A, B, and D nicely parallelize on the GPU with 10’s of thousands of threads. But C only runs a few hundred. It may still be worth putting C up on to the GPU despite that it might be slower than host code, just to avoid copying all the memory from the device to the host and then back again.

erdooom · February 1, 2009, 6:49am

Yes the slow ones :D

but seriously, first you need to find the bottle necks of you current software (assuming you have something running) And then start thinking about how to parallelizing it.

Topic		Replies	Views
A question the parallelization CUDA Programming and Performance	1	1183	July 28, 2008
Synchronizing Blocks CUDA Programming and Performance	3	2319	January 10, 2018
Designing a CUDA algo question Sort of a newbie question.... CUDA Programming and Performance	2	2362	December 9, 2011
Mapping between CUDA cores and threads CUDA Programming and Performance	7	15284	December 2, 2011
Cuda Master Slave CUDA Programming and Performance cuda	1	421	January 10, 2023
Parallel computing question CUDA Programming and Performance	3	4509	June 3, 2011
My first test on CUDA and some questions sync, thread with CUDA CUDA Programming and Performance	5	3014	November 13, 2007
32-256+ different process running in parallel CUDA Programming and Performance	3	3542	August 4, 2009
Parallelisim CUDA Programming and Performance	3	2604	August 24, 2007
Fine grain threading, correct logic? CUDA Programming and Performance	0	1011	August 4, 2009

choosing which to parallelize

Related topics