Why does Jetson Xavier NX have 21Tops AI performance?

439290087 · February 22, 2022, 3:55am

As far as I know, Volta’s tensor cores do not support int8, int4, or int1. So it can only be counted by fp16 precision. And I doubt that 48 tensor cores have this level of performance.

AastaLLL · February 22, 2022, 6:32am

Hi,

XavierNX can support INT8 operations and it also has 2DLA cores for inference.
The 21 TOPS is overall performance for GPU+ 2xDLAs:

TOPS 21 = 12.3 (GPU) + 2*4.5 (each DLA)

Thanks.

439290087 · February 22, 2022, 6:58am

I’m quite sure that 384 volta CUDA kernels can’t reach 12.3 TOPS’ speed. What’s the generation of NX’s tensor core? It seems like Turing tensor cores through its INT8 performance.

BTW, can DLAs and GPU work concurrently? I mean, a layer can be implicitly divided into the execution of DLAs and GPU cores? Or I have to manually split the layer’s problem size, and dispatch them to DLAs and GPU cores.

AastaLLL · February 24, 2022, 4:16am

Hi,

Please note that the performance is tested under maximum performance.
Which is set with the following command:

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

We have a sample to demonstrate the Jetson benchmark.
You can find an example below for running GPU and DLAs together:

Thanks.

439290087 · February 24, 2022, 7:00am

It’s better to list models’ FLOPs or MACs.

system · March 23, 2022, 5:24am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
General Question about jetson Xavier NX Jetson Xavier NX dla	15	1576	October 18, 2021
What does 384-core mean? Jetson Xavier NX hw	3	3617	October 18, 2021
How the 32 TOPs of Jetson AGX Xavier is calculated? Jetson AGX Xavier	3	3711	October 18, 2021
Jetson Xavier NX : Running DLA and GPU cores at the same time and Check Nsight System Jetson Xavier NX tensorrt , kernel	2	107	June 6, 2024
Coarse comparision of Xavier with desktop GTX10XX series performance Jetson AGX Xavier	5	5808	December 2, 2020
Big difference between using DLA core and not using DLA core Jetson Xavier NX tensorrt , dla	4	3013	October 18, 2021
Xavier Tensor Core int8 Peformance cannot reach 22TOPS with cublasGemmEx API? Jetson AGX Xavier	8	905	October 18, 2021
Working with TensorRT 5.1 on Jetpack 4.2 Jetson AGX Xavier	1	787	July 2, 2019
With DLA is even slower than without DLA Jetson AGX Xavier tensorrt	7	408	February 14, 2024
Does Jetson Xavier NX support tensorrt INT8？ Jetson Xavier NX tensorrt	3	1576	December 29, 2021

Why does Jetson Xavier NX have 21Tops AI performance?

Related topics