Thread Vs CPU

randal · July 16, 2009, 5:55pm

Hi,

For sequential programming (w/o data parallelism possible) I know it is better to use a CPU than a GPU. But i need the result from this sequential part and pass it to a kernel and this has to be done iteratively many times . So i want to perform this sequential programming also in a GPU(thread) just to minimize the memory trasnfers. My question is the performance of a thread used durin sequential programming on par with a CPU ??

Thanks,
randal

eyalhir74 · July 16, 2009, 7:19pm

The answer probably depends on your sequential code. I moved a sequential loop into a somewhat complex kernel code, in order

to acheive what you’ve described - reduce PCI overhead. And indeed it was worth it altough the gpu version of the sequential code

usally performs roughly like the CPU code. You’ll probably know only once you try it :)

eyal

randal · July 16, 2009, 9:13pm

grt… thanks :)

parallelis · July 17, 2009, 7:53pm

If your code is purely sequential (ie: it executes on just 1 Scalar Processor), the performance will fall far behind a modern processor core (ie: Intel core micro-architecture on Core2 Duo), in a factor of 5X to 20X slower.

Worse, you will need to change the algorithm to use Shared Memory instead of Global Memory as much as possible, because memory accesses that are cached on modern processors are not cached on CUDA Global Memory and will impact tremendeously the performance level! A simple loop working in memory may be 100X slower on CUDA Global Memory than on actual CPU core with CACHED Memory!

On the other side, you may use different techniques to improve performances of sequential code, ie: macro-threading to implement a programmeable prefetcher/write-back cache system ( see on my cudachess blog ) that WONT IMPACT the execution time of your computations :-)

And better you may use micro-threading (segmenting pseudo-sequential code) into threads on the same warp (block of 32 threads) to accelerate things, on a way similar to parallel execution in modern CPU. (check my blog, it will come :-) )

Topic		Replies	Views
Performance gap for a short test code between GPU and CPU CUDA Programming and Performance	8	1870	October 26, 2017
CUDA perormances CUDA Programming and Performance	10	7130	January 22, 2008
CUDA is slower than expected. Is something missing? CUDA Programming and Performance cuda , gpu , gpu-computing , parallel-computing	4	251	July 7, 2024
Sequential code in kernal ? Does it still run ? CUDA Programming and Performance	1	2594	June 11, 2008
need a help from employees or guys who know compiler well CUDA Programming and Performance	22	8621	December 18, 2008
Parallel computing by cpu thread and gpu kernel CUDA Programming and Performance	5	1279	November 21, 2014
CPU vs GPU performance CUDA Programming and Performance	3	485	December 16, 2018
Is it possible to take advantage of multi-core CPU parallelism while using CUDA? Considering using C CUDA Programming and Performance	3	5590	April 4, 2011
Parallel computing question CUDA Programming and Performance	3	4514	June 3, 2011
Is there any problems we can solve using CPU faster rather than using GPU? CUDA Programming and Performance	5	1345	November 5, 2013

Thread Vs CPU

Related topics