Management of threads

Elliole · March 28, 2010, 8:35am

Hello,

I’ve a pretty simple question, but I don’t find answer anywhere : if I have an array with a size bigger than the number of allowed threads on my GPU, which method is the best :
a ) splitting the array in two arrays and calling twice the kernel
b ) a thread manages two elements of the array

Thanks for your help.

mouser58907 · March 28, 2010, 8:50am

It really depends on the application. You should consider your memory access, and see which way would give you the most coalesced reads.

Elliole · March 28, 2010, 2:29pm

Ok, I will do that, thanks for the answer.

MisterAnderson42 · March 28, 2010, 6:47pm

I think you need to worry about GPU memory limits long before the allowed thread limits:

(65 535 * 65 535 * 512 * 4) / (1 024^3) = 8 191.75

Assuming each thread works on 1 4-byte element, that allows the maximum possible number of threads to process 8 Terabytes of data: way more than will fit on current GPUs.

Topic		Replies	Views
Quick Thread Question Regarding Calling a kernel CUDA Programming and Performance	13	3635	June 26, 2008
Determining the kernel dimension CUDA Programming and Performance	3	4465	May 26, 2009
Fast reading of some array CUDA Programming and Performance	3	1543	December 17, 2009
thread / block allocation in function of data size CUDA Programming and Performance	5	4284	November 9, 2009
Optimization problem how many blocks/ threads... CUDA Programming and Performance	1	1908	July 9, 2010
Limit of memory per thread? can't find a solution in the programmers guide CUDA Programming and Performance	1	1128	June 6, 2010
GPU vs CPU - how large can threads be? CUDA Programming and Performance	8	2375	May 12, 2010
Urgent help with threads please! CUDA Programming and Performance	21	10817	March 6, 2008
Can I Control Thread ID? CUDA Programming and Performance	3	4383	June 9, 2008
Block/threads and stuff... CUDA Programming and Performance	5	4924	September 12, 2008

Management of threads

Related topics