power of 2 question

s002wjh · November 24, 2015, 8:22pm

for mult, filter and etc etc, is it better to design with something power of 2? rather than odd number such as 100, 20 etc. for performance/efficiency reason.

s002wjh · November 24, 2015, 9:25pm

also is there 1d conv filter in cuda lib?

tera · November 24, 2015, 9:26pm

If you intend to use FFT: yes.
Otherwise: potentially. Depends a lot on what you intend to do…

CudaaduC · November 24, 2015, 10:18pm

That is a vague question.

Do you mean would it make a difference if an image was 1024 x 1024 vs 1000 x 1000 , assuming you had the choice to determine those dimensions?

In that case if you were writing your own filter kernel then it may make the code easier to write since 32 (the size of a warp) divides evenly into 1024.
It also may make it slightly faster since there is no remainder, but that difference would be very small.

Probably the best way to look at it would be to try to have the workload/array size be divisible by at least 32 or a large power of two. If you are using commercial/open source libraries then it is probably not worth worrying about.

s002wjh · November 25, 2015, 3:12pm

I look through some example code, it seem for block/thread size or address alignment etc often use power of 2 value.

also is there any example or guide on parallelize multiple nest loop in GPU?

episteme · November 25, 2015, 11:16pm

blockDim(threads/block) → 32n(multiples of warp-size), prefer to 256 or 512 (limited to 1024)
blockDim.x : blockDim.y → 32:8 better than 16:16

Topic		Replies	Views
question about performance diversity CUDA Programming and Performance	1	2978	September 10, 2007
Kernels that modify a 1D FFT Problem because power-of-two issue CUDA Programming and Performance	3	2023	September 25, 2008
Thread size in a block should be multiple of warp size? CUDA Programming and Performance	4	6088	January 17, 2013
CUFFT not a power of two element CUDA Programming and Performance	6	8433	February 27, 2010
using vectors in GPU kernel CUDA Programming and Performance	3	2521	March 24, 2007
need Help with Filter example CUDA Programming and Performance	1	970	June 1, 2009
Choosing gridSize and blockSize for better performance on TX2 CUDA Programming and Performance	2	657	December 29, 2019
1d convolution performance CUDA Programming and Performance	13	8113	November 14, 2018
3D CUFFT strange effect on volume dimensions 3D CUFFT strange effect on volume dimens CUDA Programming and Performance	1	2604	April 25, 2008
choosing the best grid/block dimensions CUDA Programming and Performance	3	1121	January 30, 2016

power of 2 question

very SMALL difference

Related topics