TensoRT convewrsion in bfloat16

bojasow · January 18, 2024, 9:33am

This doc state that orin NX support bfloat16

However it is not available when converting a ONNX model to tensorRT.

/usr/src/tensorrt/bin/trtexec --help
  --noTF32                    Disable tf32 precision (default is to enable tf32, in addition to fp32)
  --fp16                      Enable fp16 precision, in addition to fp32 (default = disabled)
  --int8                      Enable int8 precision, in addition to fp32 (default = disabled)
  --best                      Enable all precisions to achieve the best performance (default = disabled)

Same here, only classic FP16 and no bfloat16.
https://docs.nvidia.com/deeplearning/tensorrt/api/python_api/infer/FoundationalTypes/DataType.html

How to enable TensoRT convewrsion in bfloat16 ?

bojasow · January 18, 2024, 11:37am

solved:

kayccc · January 18, 2024, 11:55pm

Glad to know you found the solution, thanks for your sharing!

system · February 1, 2024, 11:56pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
No performance improvement for Tensorflow TensorRT model on converted on Jetsons Xavier NX Jetson Xavier NX tensorrt , tensorflow	2	741	October 18, 2021
Inswapper onnx model conversion to tensorrt model Jetson AGX Orin tensorrt , onnx	29	1869	January 8, 2025
Tensorrt FP16 conversion issue TensorRT tensorrt , cuda , gstreamer , onnx , deep-learning , deepstream	8	2773	March 6, 2023
Convert the TRT model with FP16 Jetson TX2 jetpack , tensorrt , jetson-inference	7	2692	October 18, 2021
Supported layer when converting models on the jetson Orin NX Jetson Orin NX tensorrt , python , onnx	2	91	February 18, 2025
Bfloat16 quantization in tensorrt Jetson Nano tensorrt	2	606	December 15, 2023
Process killed during tensorrt conversion on Jetson orin NX (8 GB) Jetson Orin NX tensorrt	15	941	April 30, 2024
Unable to convert ONNX model to TensorRT TensorRT tensorrt , pytorch , onnx	6	3641	September 30, 2020
Trtexec --onnx --saveEngine --fp16 error Jetson AGX Orin tensorrt	2	812	September 19, 2022
Why inference in jetson nano with fp16 is slower than fp32 Jetson Nano tensorrt , jetson-inference	9	2122	September 5, 2021

TensoRT convewrsion in bfloat16

Related topics