About Grid size selection

stardust1511 · May 14, 2017, 11:05pm

My question is: when setting Grid size, is that better if we set them 110 instead of 25, for example?

bha4395 · May 15, 2017, 2:02pm

Quick Answer: It depends
Better Answer: Trial and error is the best solution.

BulatZiganshin · May 15, 2017, 2:55pm

computations are performed in warps (32) and moreover, optimal SM load usually require grid size of 128 or 256. if you have less work, look for the ways to combine multiple work items into the single grid

next, single SM can perfrom up to 2048 threads simultaneously, so with 128 items per grid, each SM can perfrom up to 2048/128=16 grids

next, note that each grid can be perfromed by only single SM, so you need to have at least as much grids as you have SMs to fill entire GPU, and you may need 16x more to fill all GPU resources if each grid is 128-thread wide

finally, some SMs will finish their work earlier than others, so to ensure almost 100% GPU resource usage, you need to have at least 10-20x more work than required to fill GPU at every particular moment. Alternatively, you can use multiple streams to push work to GPU concurrently and avoid the “tail effect”

so, overall, you need at least ~100k threads to fill GPU. with only 10 threads, your CPU will be definitely faster

grynet · May 17, 2017, 6:03pm

has anybody tried to use cudaOccupancyMaxActiveBlocksPerMultiprocessor to configure grid ?

MutantJohn · May 17, 2017, 6:08pm

No matter what, hopefully the code was written using grid-stride loops :P

The only problem I’ve ever run into when using those though is typing return instead of continue. Don’t do that lol XD

Topic		Replies	Views
How to choose a proper grid size CUDA Programming and Performance cuda , kernel	3	450	January 30, 2024
The choose of grid size and block size CUDA Programming and Performance	8	3356	May 8, 2024
How to decide the optimal block size in CUDA CUDA Programming and Performance	4	27741	February 15, 2010
Hide latency CUDA Programming and Performance	3	520	June 9, 2023
grid size estimation with cudaOccupancyMaxActiveBlocksPerMultiprocessor CUDA Programming and Performance	3	979	May 17, 2017
General Formula for Thread/Block Ratio CUDA Programming and Performance	1	593	June 2, 2011
Grid size performance implications CUDA Programming and Performance	3	2905	October 10, 2009
How to choose grid size ? No. of blocks and threads ? CUDA Programming and Performance	1	818	February 4, 2016
Best general alignment practices for kernel launches CUDA Programming and Performance	6	840	November 20, 2018
How better split threads between block/grid ? CUDA Programming and Performance	4	3445	May 7, 2009

About Grid size selection

Related topics