understanding threads cant understand how it works..

khurram_hameed · May 29, 2008, 3:21am

in the matrixmultiplication which was given as an example in the programming guide and later on included in the projects alongwith SDK for execution,it did generate threads but i dont know if there are more threads being generated by the loop which assigns each data from the shared memory between host and device to threads.but still the thread index value doesnt seem to change.neither does the block value.correct me if i am wrong somewhere.

as far as i understand, threads can be run parallel in GPU and the threads make up a thread block and thread block of same dimensions make up a thread grid.

apart from that i want to ask question,can GPU like this run only data parallel programs(e.g dot products) or is there any alternative to introduce parallelism within your code that would have been running on CPU but at the moment it is running on GPU.

can i write some programs with sequential flow just to check out the correctness of GPU results.

thanks.

Khurram Hameed

E.D_Riedijk · May 29, 2008, 5:14am

threads are only ‘generated’ when calling the kernel from the host like so :
my_kernel<<<numblocks, numthreads>>>(params)

The code within global void my_kernel(params) is being run by all threads, where each threads can find out who he is by using, blockIdx & threadIdx. But during the runtime of a kernel, no threads are being generated at all.

Maybe you should start out with the reduction example, it is much easier to understand (I speak from experience)

Topic		Replies	Views
about thread context switching CUDA Programming and Performance	1	2207	June 2, 2015
Basic concepts - Kernel, grids, blocks and threads CUDA Programming and Performance	0	2456	August 28, 2007
Help using single GPU among multithreaded CPU CUDA Programming and Performance	4	1254	October 18, 2013
General CUDA Questions New to CUDA and need some help! CUDA Programming and Performance	8	5982	September 5, 2008
Multiple thread/process access to single GPU CUDA Programming and Performance	5	5989	May 13, 2008
Threaded CUDA Multiple concurrent kernels? CUDA Programming and Performance	9	5598	October 20, 2009
Each thread working concurrently ? CUDA Programming and Performance	5	1118	March 2, 2010
Mapping between CUDA cores and threads CUDA Programming and Performance	7	15417	December 2, 2011
CUDA Command Queue... makes sense? CUDA Programming and Performance	2	12942	December 17, 2009
questions about the NVIDIA programming model and GPU architecture newbie in here.... CUDA Programming and Performance	3	2472	November 10, 2008

understanding threads cant understand how it works..

Related topics