Warp Number != Core Number

jun.ling · February 21, 2020, 3:16am

Hi there:

./deviceQuery tells me that TX2 accommodates a maximum number of 2048 threads per multiprocessor, and the warp size is 32. This means that a multiprocessor can handle 2048 / 32 = 64 warps.

If I am not very mistaken, a warp itself can only be scheduled on a single core. That is, 64 warps map to 64 cores. but TX2 has 128 CUDA cores per multiprocessor. If that is the case, only half of the CUDA cores are needed (although I am sure this is a wrong conclusion)? Am I missing something here?

AastaLLL · February 24, 2020, 6:56am

Hi,

CUDA cores is a computing units rather than the “core” on a classical CPU.
The mapping unit for a GPU core is thread rather than a warp.

Usually, we map [N,2N] threads to N cores depends on the use case.
This indicates that around 256 threads can make all the TX2 cores active.

Thanks.

Topic		Replies	Views
Physical Limit of Active Thread Number per Multiprocessor CUDA Programming and Performance	2	2166	December 14, 2008
How many CUDA threads can I open at TX2? Jetson TX2	2	2892	October 18, 2021
CUDA WARPS Conceptual question regarding warps CUDA Programming and Performance	6	3619	May 30, 2008
Quadro 2000M spec's Number of cores CUDA Programming and Performance	3	3247	June 7, 2012
How many physical blocks can TX2 concurrently compute? Jetson TX2	3	417	September 13, 2018
Architecture Questions CUDA Programming and Performance	6	8171	February 12, 2008
A question the parallelization CUDA Programming and Performance	5	2694	July 29, 2008
Why GTX460 has 48 CUDA cores, but the warp size is also 32? warp CUDA Programming and Performance	2	19657	March 9, 2011
threads per block / multi processor, contradiction ? CUDA Programming and Performance	5	1656	January 23, 2009
Launching more than 1024 threads per block in Xavier (Solved) Jetson AGX Xavier	2	3047	October 18, 2021

Warp Number != Core Number

Related topics