HOW does CUDA map to the HW General question

giurearenato · May 21, 2008, 5:26pm

Can anybody explain in detail how the CUDA Programming Model can be mapped on the HW on G80.

I mean how Grids, Blocks, Warps , Threads are processed and by wich Hardware Components.

I know that a block is mapped to a multiprocessor but i don’t understand how a warp can run physically in parallel if there are 32 Threads in a Warp and lets and just 8 Streaming Processors in one Multiprocessor.
In my opinion there can only run 8 Threads of a warp physicaly parallel at a time but I think I’m wrong.
So please help me someone External Image
Thanks!

E.D_Riedijk · May 21, 2008, 5:37pm

the instruction decoder is working at 1/4 the clockrate as the streamprocessors. So the stream processors do 4 times the same instruction (on different data). 4x8 = 32.

Topic		Replies	Views
Programming Model/Hardware Implementation mapping CUDA Programming and Performance	4	4978	February 4, 2008
Processing of a warp CUDA Programming and Performance	3	1928	September 16, 2008
Number of threads physically executing in parallel per core? Whats the physical level of parallelism CUDA Programming and Performance	5	12314	November 8, 2010
Parallel thread processing in a warp CUDA Programming and Performance	5	3701	July 17, 2009
Execution of warps CUDA Programming and Performance	1	1552	January 7, 2009
CUDA execution mapping onto GPUs CUDA Programming and Performance	0	2818	March 2, 2009
Organization of threads CUDA Programming and Performance	1	644	December 21, 2011
Relationship between sp, warp, grid tell the difference. CUDA Programming and Performance	2	3167	May 9, 2008
How they work betweem SM and block SM, SP, Block, Thread and so on. CUDA Programming and Performance	1	4322	January 8, 2008
Threads per warp vs number of cores CUDA Programming and Performance	2	2602	February 3, 2009

HOW does CUDA map to the HW General question

Related topics