Grid Block launch configuration

hongzhou · November 18, 2011, 10:17am

Hi,

I would like to ask if there are anyone who knows if the configuration of the grid and block would affect the performance of a CUDA program?

My program works based on each thread to each pixels. To simplify things, I make my block size to be 1D. (i.e. dim3 block(1,N)). However, when trying to optimize, I tried to make it 2D (i.e. dim3 block(M,M) where M is multiple of 16) and it turns out it runs faster.

It really puzzles me, how the configuration improves the performance. Does it have to do with the architecture? Anyone? External Image Thanks!

pasoleatis · November 18, 2011, 10:44am

Maybe it is just your code. Could you share the kernel as well?

pQB · November 18, 2011, 2:34pm

Without the code (at least how do you access data and what do you do with it) we can only give you some hints.

[*]Block size can affect the occupancy (number of simultaneous blocks per multiprocessor).[*]You read data from a texture, which data is cached and prefetched in a 2D neighbors fashion)

Those are the main aspects that come to my head.

Regards,

Pablo.

Topic		Replies	Views
Grid size & performance CUDA Programming and Performance	1	819	September 27, 2016
Orientation of Threads in a Block. CUDA Programming and Performance	4	1278	September 30, 2009
Grids and Threads question CUDA Programming and Performance	2	4421	August 7, 2007
How to device the size of block and grid for Kernel? CUDA Programming and Performance	2	280	September 18, 2023
Significance of Linear Grid vs. 2D Grid CUDA Programming and Performance	1	1729	July 3, 2009
Impact of Grid and Block Dimension on performance CUDA Programming and Performance	1	741	November 1, 2015
speed difference on how to set up blocks sizes blocks (3,3,3) vs. (1,1,3) CUDA Programming and Performance	0	561	September 23, 2010
Strange performance relationship to grid dimension? CUDA Programming and Performance	1	964	November 26, 2009
would the block dimension influences performance? 1d block vs 2d block? CUDA Programming and Performance	7	6284	October 7, 2008
Grid dimension's decision How to take decision for organization of a grid . CUDA Programming and Performance	6	5450	March 10, 2009

Grid Block launch configuration

Related topics