calculating thereticaly possible flops architecture differences G80/GT200/Fermi

noxnet · April 23, 2010, 11:07pm

I want to calculate theoretically possible GFLOPS of different Nvidia Cards based of different architectrures.

G92 - GeForce 9800 GTX:

128 (cores) * 2 (MADD) * 1.688 (GHz) = 432 GFLOPS

I also read that G80/G92/GT200 are able to do MADD/ADD in one cycle this would result in

128 (cores) * 3 (MADD/ADD) * 1.688 (GHz) = 648 GFLOPS

Which of the two values is correct?

What is the difference between G80 and G92?
I know that G92 is 60 nm and 90 nm, but are there any conceptional differencens (like between G80 and GT200)?

GT200 - GTX285:
30 (each SM has dedicated DP Unit) * 2 (MADD) * 1.476 (GHz) = 88 GFLOPS (DP)

Fermi - GTX470:
480 / 2 (2 Cores for one DP FP) * 2 * 1.401 = 672 GFLOPS (DP)

Are these calculations right or am i missunderstanding something?

Which percentage of the theoretical GFLOPS can be really achieved (I know that this depends on the algorithm, but I guess there is a certain threshold of what really can be achieved)?

Lev · April 26, 2010, 3:00pm

I think, opencl forum is not most appropirate place for such question. Fermi dp performance was reduced btw. And why do you have different frequences for single and double calculations? You also may calculate theoretical global and local memory bandwidth.

Topic		Replies	Views
GTX280/GT200 GPU Can you really reach 1TFLOP/s? CUDA Programming and Performance	6	10177	June 19, 2008
Theoretical FLOP speed Need clarification(s) CUDA Programming and Performance	8	28397	March 19, 2009
GTX285 vs C1060 vs GTX480 GFLOP/s ? CUDA Programming and Performance	1	17322	June 25, 2010
How to compute performance in GFLOPS ? CUDA Programming and Performance	25	12160	November 17, 2008
gigaflops CUDA Programming and Performance	16	16473	September 11, 2008
flops calculation by profiler / of maximum CUDA Programming and Performance	6	14311	August 7, 2008
Missing some GFlops CUDA Programming and Performance	3	2263	December 4, 2007
graphic card selection low-cost GF card v.s. professional FX card CUDA Programming and Performance	1	1203	January 10, 2009
8800GTX:345GFlops or 518GFlops? CUDA Programming and Performance	8	9616	December 12, 2007
A maximum performance of 823 GFlops meseared for GTX 295 with mad+muls CUDA Programming and Performance	11	4739	March 8, 2010

calculating thereticaly possible flops architecture differences G80/GT200/Fermi

Related topics