What is "cores per SM" ?

TimothyMasters · July 6, 2013, 9:04pm

The hardware property includes something called “cores per multiprocessor”. This is typically 8, 32, 48, or 192. But I cannot find a definition of ‘core’ in any of the documentation or the two books on CUDA programming that I have. I’m just curious what this is. Thanks!

Greg · July 6, 2013, 11:00pm

A CUDA core is a arithmetic pipeline capable of performing one single precision floating point operation per cycle. CUDA core count and frequency can be used to compare the theoretical single precision performance of two different NVIDIA GPUs.

As a CUDA programmer you should completely avoid the notion of CUDA coers as they are not relevant to the design, implementation, or performance of a kernel.

A NVIDIA GPUs contains 1-N Streaming Multiprocessors (SM). Each SM has 1-4 warp schedulers. Each warp scheduler has a register file and multiple execution units. The execution units may be exclusive to the warp scheduler or shared between schedulers. Execution units include CUDA cores (FP/INT), special function units, texture, and load store units. The Fermi and Kepler white papers provide additional information.

TimothyMasters · July 7, 2013, 11:04am

Greg - Thank you. That was very helpful. I had guessed that it was something like that, but it’s nice to have confirmation. I’ll hunt around here for the white papers.

Tim

ny1 · June 17, 2021, 9:41pm

Thanks for stating this concept very clearly. I think these wisdom bits should be included in a FAQ :) Thanks a lot.

devin.he1 · August 28, 2024, 8:09am

How to check the number of SMs on GPU

njuffa · August 28, 2024, 8:28am

Call cudaGetDeviceProperties() for the device you want to query, then look at the multiProcessorCount component of the cudaDeviceProp variable filled in by the function.

Topic		Replies	Views
Cuda Cores Cuda Cores - run threads bloocks, kernels etc. CUDA Programming and Performance	5	1747	February 22, 2011
CUDA thread and SM CUDA Programming and Performance	1	932	September 30, 2021
What is the difference between SP and CUDA core? CUDA Programming and Performance	7	7694	October 12, 2021
Organization of threads CUDA Programming and Performance	1	644	December 21, 2011
The number of cores per multiprocessor CUDA Programming and Performance	3	750	December 12, 2021
Significance of Multiprocessor Cores CUDA Programming and Performance	2	7680	February 17, 2011
Difference between cuda core & streaming multiprocessor CUDA Programming and Performance	1	64324	February 13, 2010
Gpu Cores CUDA Programming and Performance	4	478	August 31, 2019
Newbie confusion: thread, block, multiprocessor and processor CUDA Programming and Performance	2	1093	April 13, 2011
How many SM on Jetson tk1 board? Jetson TK1 cuda	5	63	October 8, 2024

What is "cores per SM" ?

Related topics