domain operator and optimal grid/block sizes

FangQ · October 14, 2008, 2:54am

hi

I am new to cuda. I had some limited experience with brook, an open-source library for GPGPU computations and I managed to get a 35x boost using brook on a nvidia 8800GT card. Now I want to play with cuda a little bit and see if it can do better.

There is a domain() operator in brook, which allow one to select a subset of a stream (a texture), and apply a kernel function only to these selected elements, for example:

a_gpu_kernel(ins.domain(int2(2,2),int2(4,4)),outs.domain(int2(2,2),int2(4,4)));

will run the kernel only for a the elements between indices (2,2) to (4,4).
I am wondering if there is a similar operator in cuda.

Also, my kernel function is a very simple finite difference operator, it involves a few algebraic operations for each pixel, and all pixels are independent to each other. I am wondering if I want to translate to cuda, what’s the grid size and block size that I shall supply? should I use my array size (2D texture) as grid size and set block size to dim3(1,1,1)?

thank you

Topic		Replies	Views
How to device the size of block and grid for Kernel? CUDA Programming and Performance	2	284	September 18, 2023
block size CUDA Programming and Performance	6	5864	July 21, 2013
Block size and grid size CUDA Programming and Performance	5	8391	April 27, 2009
Noob needs advice CUDA Programming and Performance	1	573	February 7, 2015
Grids and Threads question CUDA Programming and Performance	2	4426	August 7, 2007
Grid is only 2d? CUDA Programming and Performance	5	5485	March 14, 2007
choosing the best grid/block dimensions CUDA Programming and Performance	3	1111	January 30, 2016
LARGE 2D arrays CUDA Programming and Performance	10	8582	August 11, 2011
How to determine the Block Size CUDA Programming and Performance	1	5919	September 4, 2009
What are the limits on block size? CUDA Programming and Performance	1	3855	July 22, 2011

domain operator and optimal grid/block sizes

Related topics