In “NVIDIA Fermi Compute Architecture WhitePaper”, it said “A CUDA core executes a floating point or integer instruction per clock for a thread”. We know that the Processor Clock Rate is two times as high as Core Clock Rate. Which one is the “clock” refer in White Perper?
Another sentence in WhitePaper said “Fermi’s dual warp scheduler selects two warps, and issues one instruction from each warp to a group of sixteen cores, sixteen load/store units, or four SFUs”. Note that if the “clock” means Core Clock, each of the warp scheduler will issue one warp each two Core Clock.
Thanks for your interest!