Tensorrt FP16 conversion issue

daxjain987 · February 24, 2023, 11:58am

• Hardware Platform (GPU): Tesla T4
• DeepStream Version: 6.2
• TensorRT Version: 8.5.2.2-1+cuda11.8
• NVIDIA GPU Driver Version: 515.65.01
• Issue Type: Question

Hello,

We have a custom onnx FP32 model. We are trying to convert it to tensorrt FP16. We are converting the model with this command.

/usr/src/tensorrt/bin/trtexec --fp16 --minShapes=input:1x3x480x640 --optShapes=input:32x3x480x640 --maxShapes=input:32x3x480x640 --onnx='/model.onnx' --saveEngine='model_b16.plan' --workspace=7000

While we check model_b16.plan with this flag (–loadEngine), It is showing FP32. It’s not converting in FP16.

So, Kindly guide us on how we can convert our custom onnx FP32 model into tensorrt FP16.

Thanks,
Dax Jain

yingliu · February 25, 2023, 4:34am

Moving to TensorRT forum for better support.

daxjain987 · February 28, 2023, 7:16am

@yingliu and Nvidia support team,
Kindly give us your thought on this raised query.

AakankshaS · March 1, 2023, 12:30pm

Hi @daxjain987 ,
Any model can be converted to TRT FP16 by setting builder flag to use FP16
Are you using custom onnx operator? If yes, you will need to implement TRT plugin.

Thanks

daxjain987 · March 1, 2023, 12:40pm

Yes, We did that.
Kindly look at that command which I have mentioned. We have followed the same.

No, we don’t.
We have used Tensorrt from this link

daxjain987 · March 3, 2023, 1:23pm

Hello @AakankshaS ,

We are trying to convert the model.onnx FP32 to a model.plan FP16. And for that, we are using “trtexec” for FP16 model conversion.

While we check the precision of the converted FP16 model using “–loadEngine” flag, it’s showing us FP32 only.

Kindly let us know how to convert our model to FP16 or FP32+FP16 precision only.

Thanks
Dax Jain

AakankshaS · March 6, 2023, 4:42am

Hi @daxjain987 ,
Can you share the onnx model with us, so that we can try reproducing the issue from our end?

Thanks

daxjain987 · March 6, 2023, 5:49am

Hi @AakankshaS ,

You can download the model from here.

github.com

onnx/models/blob/main/vision/object_detection_segmentation/yolov4/model/yolov4.onnx

version https://git-lfs.github.com/spec/v1
oid sha256:1881fe9c506c970d7866cb4bfc33bda791ce46951a3c39c45ace2ff2b9daf369
size 257470589

We have tried this model to covert it in the FP16 model with this command

/usr/src/tensorrt/bin/trtexec --fp16 --minShapes=input_1:0:1x416x416x3 --optShapes=input_1:0:32x416x416x3 --maxShapes=input_1:0:32x416x416x3 --onnx='yolov4.onnx' --saveEngine='model_b32.plan' --workspace=7000

After this conversion, we checked this model with the below command

/usr/src/tensorrt/bin/trtexec --loadEngine=model_b32.plan --dumpLayerInfo

We are getting FP32 precision in logs.

Kindly help us to convert the YOLOv4.onnx model to TRT FP16 model conversion.

Thanks,
Dax Jain

daxjain987 · March 6, 2023, 10:30am

Hi @AakankshaS,

Have you checked the above-highlighted issue?

Looking for your prompt response!!!

Thanks,
Dax Jain

Topic		Replies	Views
Convert the TRT model with FP16 Jetson TX2 jetpack , tensorrt , jetson-inference	7	2618	October 18, 2021
Error on converting ONNX to FP16 TensorRT with my model Deep Learning (Training & Inference)	0	428	August 17, 2020
Inference on FP16 segmentation model DeepStream SDK	3	492	November 23, 2021
Onnx to TensorRT conversion TensorRT tensorrt , cuda , ubuntu	1	749	June 21, 2023
Trtexec --layerPrecision and --precisionConstraints not respected when converting onnx model TensorRT	3	2175	February 20, 2023
mAP loss in FP16 mode TensorRT	2	741	December 11, 2019
fp16 mode of onnxparser cannot speed up on Xavier TensorRT	0	473	December 2, 2018
Can convert to INT32 but not with FP16 TensorRT	3	1093	November 29, 2022
Convert fp16 killed TensorRT tensorrt	7	825	June 15, 2022
Misc Error in transformWeightsIfFP: 1 when using onnx model and run for tensorrt using FP16 DRIVE AGX Xavier General tensorrt , driveos-dl	6	531	October 12, 2021

Tensorrt FP16 conversion issue

Related topics