Quadro 2000M spec's Number of cores

baitNswitch · June 6, 2012, 11:50pm

I’m working with a Quadro 2000M.
The device properties is telling me there are 4 multiprocessors, with a warp-size of 32.
The spec’s say this chip has 192 cores.

4 * 32 = 128
Where do the other 64 cores come from?

mfatica · June 7, 2012, 12:22am

It is a 2.1 cuda capable device, each SM has 48 cuda cores. 4SMx48core=192.
Warp size has nothing to do with the number of cores.

baitNswitch · June 7, 2012, 1:43pm

Here, each SM has 48 HW-cores.
The SM can only run one kernel at a time. That would mean that every HW-core in the SM is running the same kernel. (Not some cores running kernelA, while others run kernelB.) True?

The warp scheduler issues work to the SM in units of warp. (Here a warp is 32 threads each.) True?

How can I keep all 48 HW-cores busy if work is issued in units of 32?
What am I missing?

seibert · June 7, 2012, 2:18pm

Fermi devices issue warp instructions to groups of 16 CUDA cores at a time, not all 32. On compute capability 2.1, there are two warp schedulers and one of them can issue two independent instructions from the same warp. So in general you will keep 32 cores busy if you have 2 warps ready to run all the time, and sometimes you will have 48 busy cores if there are warps with independent instructions. And remember that modern compute hardware is pipelined, so there are something like 10 warp instructions in the process of being executed by each group of 16 CUDA cores at any given time. This is why the CUDA programming guide recommends that you have a lot of warps available to maximize utilization.

Topic		Replies	Views
How the 16 int cores in a processing block in SM execute when 32 integers in a warp is calculated? CUDA Programming and Performance cuda , board-design	4	969	September 28, 2023
Fermi architecture CUDA Programming and Performance	2	734	May 24, 2011
Threads per warp vs number of cores CUDA Programming and Performance	2	2600	February 3, 2009
warp scheduler of Fermi architecture CUDA Programming and Performance	2	3192	February 5, 2012
How do CUDA cores on a SM execute warps concurrently? CUDA Programming and Performance	8	28561	July 4, 2019
How is a warp executed on a SM CUDA Programming and Performance hw , cuda	0	309	September 7, 2020
CUDA WARPS Conceptual question regarding warps CUDA Programming and Performance	6	3619	May 30, 2008
Execution of a warp CUDA Programming and Performance	0	459	November 28, 2013
question about warp, block and threads CUDA Programming and Performance	4	1998	February 3, 2009
Any need to revise the principle "Threads in a half-warp are SIMT synchronous" ? CUDA Programming and Performance	1	693	July 30, 2013

Quadro 2000M spec's Number of cores

Related topics