INT4 on Jetson-AGX-Orin or Jetson-Orin-Nano?

andrew70price · August 19, 2024, 8:56am

Hello,

Do the Nvidia Jetson Orin series support INT4 operations?

states that the Jetson AGX Orin supports FP32 on the tensor cores and that the Orin contains 3rd generation tensor cores.

I believe the third generation Ampere cores support INT4 operations (The arithmetic login units are the same as the A100 GPUs).

Can INT4 operation support on the Jetson Orins be confirmed and whether it extends to all the cores or just the Tensor cores?

AastaLLL · August 20, 2024, 6:27am

Hi,

For hardware capability, Orin can support INT8 Tensor Core operation. (IMMA and HMMA)
INT4 usually refers to software-level quantization.

TensorRT-LLM also supports INT4 precision.
But TensorRT-LLM is not available for Jetson yet.
https://nvidia.github.io/TensorRT-LLM/reference/support-matrix.html#software

Thanks.

andrew70price · August 20, 2024, 8:04am

Thank you.

system · September 10, 2024, 8:33am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Does Tensor Core on Jetson AGX Orin support FP32( IEEE 754 single precision floating point number)? Jetson AGX Orin tensorrt , kb	5	1711	April 25, 2023
The tensor core performance detail of Jetson AGX Orin 32GB Jetson AGX Orin	14	1184	June 13, 2023
Fp32 precision support on Jetson AGX Orin Jetson Nano benchmarks	2	500	June 4, 2024
Jetson AGX Orin TOPs / CUDA Cores Explained Jetson AGX Orin jetson-inference	8	6344	May 24, 2023
Tensor core of Jetson AGX Orin Jetson AGX Orin documentation	2	598	June 7, 2023
How to deploy super resolution DNNs on NVIDIA Jetson AGX Orin 32GB? Jetson AGX Orin cuda , dla	3	677	April 26, 2023
NVIDIA Orin Performance Jetson AGX Orin tensorrt	3	261	October 14, 2024
Int8 TensorCores for Jetson Jetson AGX Xavier tensorrt	7	1247	April 26, 2023
Peak FP32 FLOP/s of AGX Orin Jetson AGX Orin performance	4	83	April 14, 2025
Jetson Orin AI Performance Jetson Orin Nano documentation	7	1499	February 21, 2023