How Calculate Speed Up with CUDA

elwla · April 10, 2015, 10:41pm

Hello!

I’m working with a NVIDIA GFORCE GT 750M (Kepler, 384 cuda cores … so, 192 cuda cores per SMX).
For calculate the speedup and the efficiency we need compare, for example:

1.- sequential_time(CPU) / parallel_time(GPU) with N cores.
2.- sequential_time(CPU) / parallel_time(GPU) with N+1 cores.
3.- sequential_time(CPU) / parallel_time(GPU) with N+2 cores. …etc.

and generate somethings like this http://www.nathankerr.com/projects/parallel-gis-processing/speedup.png

So… how I can calculate the speedup of my code with varied amount of cores in cuda?? is this possible?

Thanks a lot!!

Robert_Crovella · April 10, 2015, 11:10pm

It’s not a trivial matter to scale CUDA code execution across a subset of SMs or cores provided by a GPU, like it is with OMP threads on a multicore CPU.

It could be done with rather unusual code modifications, but they would be rather extreme and the results would not be indicative of what you should expect on another GPU with that many cores/SMs.

elwla · April 10, 2015, 11:39pm

Thanks!

what would be the correct way for the speedup and efficiency calculations in cuda?

njuffa · April 11, 2015, 3:54am

Speedup compared to what? A CPU implementation? Divide the end-to-end run time of the application on the GPU by the run time on the reference platform. That’s the speedup people care about in practice.

If you want to get an idea about scaling, use different GPU models with different number of SMs and / or multiple GPUs. Ideally the GPUs would be all from the same architecture, as the microarchitecture varies considerably across generations.

An excellent example can be seen in a recent paper on the HPCG benchmark by E. Phillips and M. Fatica. They scaled from a small embedded system all the way to super computers using various Kepler-family GPUs:

[url]http://devblogs.nvidia.com/parallelforall/optimizing-high-performance-conjugate-gradient-benchmark-gpus/[/url]

Topic		Replies	Views
What is maximum speed-up that can be obtained with GPU? CUDA Programming and Performance	6	12006	June 24, 2016
Speed Up Calculation CUDA Programming and Performance	8	7667	April 7, 2016
Speed Up Calculations CUDA Programming and Performance	5	7342	January 10, 2011
How to calculate the speedup ratio between C code and CUDA program? CUDA Programming and Performance	5	12123	April 4, 2013
What CUDA GPU can give 10000 times performance of a CPU(1core 3Ghz)? CUDA Programming and Performance	3	1065	January 25, 2019
Best way to report speedups? CUDA Programming and Performance	2	906	February 10, 2010
CUDA is slower than expected. Is something missing? CUDA Programming and Performance cuda , gpu , gpu-computing , parallel-computing	4	185	July 7, 2024
tool to calculate the speedup of an application that runs on gpu based heterogeneous computing plat CUDA Programming and Performance	8	1004	November 13, 2014
Characterization of the Speed Up on GPGPGU. 400X speed up on a Molecular Dynamics Application. CUDA Programming and Performance	5	1484	December 8, 2009
speed up, S> no. of core ? is it possible ? CUDA Programming and Performance	5	3730	October 5, 2009

How Calculate Speed Up with CUDA

Related topics