Difference between CORE and GRID

achuth · June 3, 2012, 7:02am

Sir,
I have gone through “CUDA by Example”.
But I am still confused in some concepts.
I can add elements of two matrices in parallel at the same time.
Can I do matrix addition and a different operation, image contrast enhancement at the same time?

I got a feeling that If I have 2 cores, then I can do two entirely different applications in parallel. Is that right?
Also I feel that “core contains several grids and grid contains several blocks which in turn contain several threads”?
Is that right?

Please rectify my doubts.

ymc · June 3, 2012, 7:50am

You don’t need to think about cores when you are doing CUDA programming. Number of threads that is actually executed in parallel equals to the warp size. As of Compute 2.0+, the warp size is 32. To take advantage of this, you should make your threads per block multiples of 32.

Number of threads per grid determines how many threads you want to share the same kernel code. If you want your threads to do different things, you should separate them into different kernels.

ymc · June 3, 2012, 8:06am

Actually each MP can execute one warp concurrently. So for example, Tesla K20 has 15 MPs, it can execute 15*32=480 threads concurrently even though it has 2880 cores.

seibert · June 4, 2012, 1:40pm

This is not true. Each MP can execute many warps concurrently. On Fermi, there are two schedulers each issuing two warps every two clocks (not the same as one warp per clock). On Kepler, there are 4 warp schedulers, each of which can issue two instructions on the same warp per clock.

The number of threads executing concurrently (for most definitions of “concurrently”) on a CUDA device is generally much larger than the number of CUDA cores.

Amature · June 9, 2012, 7:40am

Sir,

i am getting confused…can you explain it in more simple way…

thanks.

seibert · June 9, 2012, 8:51pm

In general, you want way more threads than CUDA cores to maximize throughput. For example, on the GTX 580, you generally want a grid that has at least a couple thousand threads, if not more.

My other advice is to not draw analogies between CUDA programming and multithreaded programming on the CPU. A CUDA core is nothing like a CPU core, and a CUDA thread is not the same as a CPU thread. Read the first few chapters of the CUDA programming guide. They are a very good introduction to the basic concepts.

Topic		Replies	Views
parallel computations with CUDA CUDA Programming and Performance	7	3193	September 19, 2008
How many parallel threads? CUDA Programming and Performance	19	9866	October 1, 2021
Using <<<...>>> CUDA Programming and Performance	6	2476	June 19, 2011
A question about the correspondence between warp and core CUDA Programming and Performance	17	7769	February 1, 2019
Blocks/Warps/Threads Allocation I have some doubts about the allocation of blocks/warps/thread in CU CUDA Programming and Performance	5	2571	November 1, 2012
A question the parallelization CUDA Programming and Performance	5	2694	July 29, 2008
How do CUDA cores on a SM execute warps concurrently? CUDA Programming and Performance	8	28636	July 4, 2019
Kernel Launch: number of blocks CUDA Programming and Performance	1	1694	May 21, 2009
Mapping between CUDA cores and threads CUDA Programming and Performance	7	15371	December 2, 2011
CUDA Use Cases run serial algorithms on composite data CUDA Programming and Performance	14	4489	October 24, 2008

Difference between CORE and GRID

Related topics