about occupancy

sgu · December 15, 2009, 2:53am

Hello,
CUDA_PROFILE shows the occupancy of a kernel is equal to one, however, there are incoherent global store and load. does that mean that even I can avoid the incoherent access to the global memory, I cannot get performance benefit? does occupancy equal to one mean the processors are busy all the time and no time are wasted for waiting memory access? Thank you.

LSChien · December 15, 2009, 12:46pm

no, occupancy = 1 means that you have 1024 threads (compute capability 1.3) to hide memory latency,

I depict a picture in the thread

http://forums.nvidia.com/index.php?showtop…rt=#entry604969

maybe it is helpful.

sgu · December 16, 2009, 2:53am

According to the programming guide, Occupancy is the ratio of the number of active warps per multiprocessor to the maximum number of possible active warps. But what warps are active? If some threads within the warp are waiting for the data, is the warp considered to be active or inactive? Thanks.

LSChien · December 16, 2009, 9:32am

“number of active warps per multiprocessor” means how many warps could be seen by warp scheduler of SM.

it is a static value, determined after compilation.

suppose you have 100 thread blocks on TeslaC1060 and each block has 512 threads.

We may assume occupancy is 50 %, say only one thread block is active in one SM, or say

only 16 warps are active in one SM.

initially, only 30 blocks among 100 blocks are scheduled into 30 SMs, warp scheduler choose one warp among 16 warps

to execute, and then pick up next one according to round-robin. If some warp waits for I/O, then it is put into waiting queue,

warp scheduler does not choose warps in waiting queue to execute.

However at this time, warps in waiting queue are still called “active warps”

Topic		Replies	Views
Kernel Occupancy Could someone explain this? CUDA Programming and Performance	1	11882	March 19, 2010
question about calculating occupancy CUDA Programming and Performance	2	6523	April 7, 2010
Exact meaning of "occupancy" Slightly confused CUDA Programming and Performance	2	2271	April 20, 2009
Occupancy CUDA Programming and Performance	3	3886	May 22, 2008
Few performance questions occupancy,active threads,cta_launch CUDA Programming and Performance	4	4754	January 30, 2009
Occupancy calculator CUDA Programming and Performance	2	923	January 31, 2011
CUDA Visual Profiler Vista CUDA Programming and Performance	2	4131	September 11, 2009
Warp switching does anybody understands the mechanism CUDA Programming and Performance	16	8505	March 28, 2008
Occupancy wierdness.... Is the calculator wrong? CUDA Programming and Performance	5	5901	July 25, 2007
Amount of Shared Memory CUDA Programming and Performance	10	4205	June 3, 2010

about occupancy

Related topics