I’m studying Evolutionary Computation using CUDA
I have 9800GT(14MP, 512MB), INTEL E6300
I have implemented Evolutionary Computation on CUDA
So, It was successful. That was faster than CPU
But, Studying, I have any Questions.
About dimGrid and dimBlock…
I have 14MP -> 112 SP;
so, I coded ( dimGrid -> N times 14MP ) possibly.
because i knew that MP -> BLOCK , 8SP -> 8 threads
for example, num of Data = 100
dimGrid = 14, dimBlock = 8
After… I know new information…
Efficient dimBlock is N times 32;
I catched that 32 is warpsize.
Hmm… Then… dimGrid = 4, dimBlock = 32??
if) that is not efficient dimension of GIRD & BLOCK
example… Num of data 100, 200, 500;;
dimGrid = ? , dimBlock = ?
when only 1 cycles;;
if GPU cores Implement 112 datas…
threadIdx = 0 ~ 112 ?
or, blockIdx * blockDim + threadIdx = 0~112 ??
thank you, read this post;