NvInfer Mixed-Precision ONNX

MathiasDall · January 31, 2023, 1:10pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
GPU
• DeepStream Version
6.1.1
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Hello,

I am trying to to deploy an amp trained onnx model to DS in fp16 mode. Some of the layers (specifically the conv layers), require fp32 precision, since TRT will clamp the weights to 1e-7 (fp16 min) causing faulting results:

It should be possible to avoid this, by settinglayer-device-precision=/beit/embeddings/patch_embeddings/projection/Conv:fp32:gpu in the configuration file - however this seems to not work.

Do you have any suggestions as to how I can avoid casting the conv layers to FP16 and run the model in mixed precision?

ONNX model link: https://file.io/qvvjn27luBkt and config file is attached for reference.

Thanks!
beit-base-patch16-224-pt22k-ft22k.config (648 Bytes)

Fiona.Chen · February 1, 2023, 8:45am

I do not reproduce the same build log with your model and configuration file. What is your GPU?

MathiasDall · February 1, 2023, 10:36am

I’m testing on an Dell XPS with RTX 3050 TI Laptop, Cuda 11.8 and driver 522.30

Fiona.Chen · February 2, 2023, 1:34am

Please make sure the compatibility requirement is met in your machine. Quickstart Guide — DeepStream 6.1.1 Release documentation

MathiasDall · February 2, 2023, 9:46am

I think this is a TRT issue and not a DS issue - I have raised the issue on the TRT forums instead of here.

Thanks!

Topic		Replies	Views
Trtexec --layerPrecision and --precisionConstraints not respected when converting onnx model TensorRT	3	1918	February 20, 2023
TRTExec - Force precision on certain ONNX Op Nodes TensorRT	1	1026	February 23, 2023
How to run nvinfer with mixed precision DeepStream SDK	3	218	December 13, 2022
Custom model training but got error on conversion from onnx to engine file DeepStream SDK tensorrt , jetson-inference , deepstream	1	33	April 23, 2025
Convert the TRT model with FP16 Jetson TX2 jetpack , tensorrt , jetson-inference	7	2455	October 18, 2021
Tensorrt FP16 conversion issue TensorRT tensorrt , cuda , gstreamer , onnx , deep-learning , deepstream	8	2510	March 6, 2023
Encountered known unsupported method torch.max_pool3d DeepStream SDK	12	1259	October 12, 2021
Dimensionality Issue: Engine File Lacks Channel Depth for DeepStream For Inference DeepStream SDK	7	25	August 27, 2024
Nvinfer error:NVDSINFER_INVALID_PARAMS DeepStream SDK jetson-inference , gstreamer	7	583	January 30, 2023
Unable to parse custom pytorch UNET onnx model with python deepstream-segmentation-app DeepStream SDK onnx , segmentation , deepstream61	9	1195	August 16, 2022

NvInfer Mixed-Precision ONNX

Related topics