Single-Precision Floating-Point Basic Arithmetic Throughput

biebo · October 6, 2009, 10:47am

hi all i was reading the programing guide chapter 5.

there is

Single-Precision Floating-Point Basic Arithmetic
Throughput of single-precision floating-point add, multiply, and multiply-add is 8
operations per clock cycle.

Why means it is “8”, why a constant.

can any body elaborate.

Thanks

–Biebo
NUST Pakistan

E.D_Riedijk · October 6, 2009, 12:45pm

In the current generation, each multiprocessor has 8 ALU’s. So for every clock, 8 operations are one in parallel. addition, multiplication or madd, can all be done in 1 clock cycle on a ALU, so 8 of these per clock cycle.

biebo · October 7, 2009, 4:39am

thnx :)