I’m newbie in GPU computing and I’m a bit confused about the theoretical peak computation for nVifia GPUs. There is some listing (certainly nVidia does not give much information about, at least for non high-end products) where one can read about the theoretical peak performance, always in single precision, as for example: http://en.wikipedia.org/wiki/Comparison_of_Nvidia_graphics_processing_units
First of all, I’m confused because sometimes the processing power is stated as “Processing power (FMA)”. I understand a Fused Multiply Add (FMA) as the classical AXPY, so actually 1 FMA = 2 FLOPS. Then, in order to compare with classical CPU computations, should I multiply the “Processing Power (FMA)” times 2?
Another question, indeed the most important for me, is the one related to the double precision theoretical peak computation, because I want to do some benchmark using CULA and MAGMA and I would like to calculate the actual performance in comparison with the theoretical peak. I know that there is not the same numbers of DP units as SP units in a GPU, so the DP performance is a factor (<= 1) of the SP performance. Some information about the number of DP units in a GPU can be found in the web, in forums like this, and similar. I have never seen this kind of information in a technical nVidia brochure. The information sometimes is given as a real count of DP units or as a factor related to the number od SP units. For example, the ratio DP/SP for the GTX 460/560 is 1/12 or 1/8 for GTX 470, GTX 480, GTX 570 and GTX 580. So the question is simple, exists any comprehensive list about the DP/SP ratios for nVidia GPUs? Where is it?
I have a GeForce GTX 550 Ti. Should I understand as 1/12 the ratio DP/SP units for this card? I’m alse intended to purchase a mobile workstation equipped with a Quadro K2100M. I’ve looked for information about the DP theoretical peak and the DP/SP ratio but I have not found anything. Does anyone knows the specs about DP for this GPU?