Why 32 SP/SM in Fermi instead of 16SP/SM?

pacard · May 11, 2010, 3:04am

The Fermi white paper states that there are 32 SPs in one SM in Fermi. To my understanding, that means each SM is a 32-way SIMD unit.

But each SM is also equipped with 2 issue units, which is called dual-issue. The two issue units can decode two different instructions and issue them to half of the SM. This seems like two independent 16-way SMs. So why bother putting them together to form a 32-way SIMD?

Also, there are 16 load/store units per SM. So when two load/store instructions are dual-issued, does it mean that the two instructions will have to be executed in two cycles?

plmae · May 11, 2010, 9:50am

i would guess that it allowed to double sp count not requiring to double shared mem and register file.

Jimmy_Pettersson · May 11, 2010, 1:16pm

Good point.

Keldor314 · May 16, 2010, 3:11pm

The real reason for the two warps per SM is that the ALUs can be linked together for double precision. Since instruction issue is only once every two clock cycles, it takes 16 SPs to fill a 32 wide warp every two clocks.

Topic		Replies	Views
Inquisitive about SP cores in SMs CUDA Programming and Performance	3	1413	October 1, 2009
questions about sp and sm CUDA Programming and Performance	5	4103	June 19, 2019
Fermi Warp Sheduling CUDA Programming and Performance	1	3048	September 30, 2011
Question regarding Pascal architecture CUDA Programming and Performance	13	3026	March 16, 2017
Understanding fermi warp scheduler CUDA Programming and Performance	0	2391	December 2, 2011
size of SIMD unit CUDA Programming and Performance	2	3971	December 22, 2009
Fermi doesn't keep all execution units busy? CUDA Programming and Performance	2	4764	February 24, 2010
SP and Warp CUDA Programming and Performance	3	3432	May 2, 2010
Why does a warp consist of 32 threads? Why is a thread not say 16 or 64 threads? Whats the hardware CUDA Programming and Performance	14	20919	September 15, 2009
About the number of CUDA cores in SMSP, less or gerater than warp threads number(32) CUDA Programming and Performance	8	886	June 17, 2024

Why 32 SP/SM in Fermi instead of 16SP/SM?

Related topics