Hi. everyone…

I’m studying Evolutionary Computation using CUDA

I have 9800GT(14MP, 512MB), INTEL E6300

I have implemented Evolutionary Computation on CUDA

So, It was successful. That was faster than CPU

But, Studying, I have any Questions.

First…

About dimGrid and dimBlock…

I have 14MP -> 112 SP;

so, I coded ( dimGrid -> N times 14MP ) possibly.

because i knew that MP -> BLOCK , 8SP -> 8 threads

for example, num of Data = 100

dimGrid = 14, dimBlock = 8

After… I know new information…

Efficient dimBlock is N times 32;

I catched that 32 is warpsize.

Hmm… Then… dimGrid = 4, dimBlock = 32??

if) that is not efficient dimension of GIRD & BLOCK

helps me…

example… Num of data 100, 200, 500;;

dimGrid = ? , dimBlock = ?

Second,

when only 1 cycles;;

if GPU cores Implement 112 datas…

threadIdx = 0 ~ 112 ?

or, blockIdx * blockDim + threadIdx = 0~112 ??

Please Help…me…

thank you, read this post;