Hello NVIDIA team,
I am working with a Jetson AGX Thor developer kit and would like to clarify an apparent ambiguity in publicly stated GPU specifications, specifically regarding CUDA core and Tensor Core counts.
According to the Thor SoC TRM, the Blackwell GPU organization is described as follows:
-
Up to 3 GPCs, each with 4 TPCs (12 TPCs total)
-
Each TPC contains 2 SMs (up to 24 SMs total)
-
Each SM has 128 CUDA cores (3,072 CUDA cores in the maximum configuration)
-
Each SM is partitioned into four processing blocks, with each block containing a 5th-generation Tensor Core (up to 96 Tensor Cores total)
Using the CUDA Driver API on the device, I queried the runtime GPU properties and obtained the following:
-
GPU name: NVIDIA Thor
-
SM count: 20
-
L2 cache size: 32 MiB
-
Max shared memory per SM: 228 KiB
Based on the runtime-reported SM count:
- CUDA cores = 20 SM Ă— 128 CUDA cores = 2,560 CUDA cores
(this matches several published specifications, but differs from the maximum configuration described in the TRM)
Tensor Core counting, however, is where ambiguity arises:
-
At the SM (logical / software-visible) level: 20 Tensor Cores (1 per SM)
-
At the micro-architectural level: 20 SM Ă— 4 Tensor Core units per SM = 80 physical Tensor Core units
-
Some external sources reference “96 Tensor Cores,” which appears to correspond to the maximum 24-SM configuration (24 × 4)
Could you please clarify the following:
-
What is the recommended and correct way to report Tensor Core counts for Jetson AGX Thor Dev Board (SM-level logical resources vs internal micro-architectural execution units)?
-
When NVIDIA documentation or marketing material refers to “96 Tensor Cores,” does this explicitly refer to the maximum SKU and count micro-architectural Tensor Core units?
-
For technical publications and performance analysis, is it accurate to describe Tensor Core resources as “one logical Tensor Core per SM, implemented internally as four Tensor Core execution units”?
This clarification would be very helpful for ensuring accurate and consistent reporting in academic and technical work.
Thank you for your time and clarification.