Hardware comparison

ElGuapo_Oficial · January 23, 2014, 4:58am

Hello Guys!

Let’s say I ran the exact same algorithm on two different boards: Tesla C2070 and GeForce GTX 680.

Theoretically, on which board will the algorithm run faster? Which of the following parameters is the most significant to formulate a response.

Compute Capability: [C2070 = 2.0][GTX 680 = 3.0]
Processor cores: [C2070 = 448][GTX 680 = 1536]
Processor core clock: [C2070 = 1.15 GHz][GTX 680 = 1.0 GHz]
Memory clock: [C2070 = 1.50 Ghz][GTX 680 = [6.0 Ghz]
Memory size: [C2070 = 6 GB][GTX 680 = 2 GB]
Memory bandwidth: [C2070 = 144 GB/sec][GTX 680 = 192.26 GB/sec]

Thanks in advance!

Cheers!

droettger · January 23, 2014, 8:16am

That’s not possible to predict without knowing what the bottleneck of the algorithm is.
Clock bound? Memory bandwidth bound? Memory size bound? Parallel enough to use more cores? Specific functionality required (float/double)?

Maybe have a look through this: [url]http://docs.nvidia.com/cuda/cuda-c-programming-guide/[/url]

njuffa · January 23, 2014, 6:17pm

Your application may also be limited by specific instructions, like integer shifts, integer multiplies, atomics, or double-precision arithmetic. PCIe throughput is another potential bottleneck. You might want to start exploring your code with the help of the profiler.

ElGuapo_Oficial · January 23, 2014, 6:57pm

Hey Detlef! thanks for taking the time!

I think my real question is: “What makes a GPU superior in terms of speed now days generally speaking” (sorry i’m new to this topic)

In terms of CPU, generally speaking, is the clock speed.

i.e. If I run an algorithm on a CPU A [1.6Ghz] and on a CPU B [3.2Ghz] I would expect a theoretically speed up of 2x of B over A.

Lets say my algorithm is pretty small to run on each board without exceeding any of the hardware capacities, and I’m using float operations only (I think GTX 680 doesn’t handle Double Precision as C2070 do).

So what is the factor to look up to in terms of speed?
[1] Cuda Cores
[2] Core Speed
[3] GFLOPs
[4] Memory Speed

One more questions! How do you compare GPU performance vs CPU performance… is there a unit time that links this two somehow? GFLOPs maybe? I know is not as simple as having just one parameter to measure speed up like when parallelism was made with CPUs only.

Thanks in Advance!

Topic		Replies	Views
Speed Up Calculation CUDA Programming and Performance	8	7993	April 7, 2016
GPU vs CPU performance comparison CUDA Programming and Performance	9	15279	August 13, 2009
Fastest CUDA card on the market choosing best CUDA card for CUDA computation purpose CUDA Programming and Performance	9	9947	July 16, 2011
Benchmarking problem CUDA Programming and Performance	5	6396	December 29, 2008
compare performance across different GPU cards and how to figure out the frequency the GPU clock? CUDA Programming and Performance	4	10026	June 14, 2010
Scalability question CUDA Programming and Performance	3	9186	June 6, 2009
qudra fx 1700 VS tesla c1060 How much performance gain I can expect? CUDA Programming and Performance	3	2231	January 23, 2010
What hardware to get? CUDA Programming and Performance	6	5399	August 10, 2008
performance between the CPU and GPU ? equivalence between the CPU and GPU ?? CUDA Programming and Performance	12	8287	September 21, 2010
Best, bang-for-the-buck, CUDA platform? ... Which? 9800 GX2, Tesla C870, new 2xx ... CUDA Programming and Performance	23	10809	July 15, 2008

Hardware comparison

Related topics