ONNX Runtime Error: fp16 precision has been set for a layer or layer output, but fp16 is not configured in the builder

nirajkale30 · January 10, 2022, 12:19pm

Hi,
I’m trying to run a Yolov5 model (yolov5s.pt) on jetson nano.
Initially, i tried converting the model pytorch model to onnx with fp32 & ran it on nano with CSI camera & code similar to https://developer.nvidia.com/blog/announcing-onnx-runtime-for-jetson . This worked fine but the FPS was low (4 fps) so i wanted to try out fp16.

I converted to model to onnx-fp16 using builtin yolov5 script (TFLite, ONNX, CoreML, TensorRT Export · Issue #251 · ultralytics/yolov5 · GitHub), the conversion was successful (it shrink the model from 28-> 14 mb) but when i try to run it on Nano, I’m getting below error:

2022-01-10 17:24:14.792809142 [W:onnxruntime:Default, tensorrt_execution_provider.h:53 log] [2022-01-10 11:54:14 WARNING] /home/onnxruntime/onnxruntime-py36/cmake/external/onnx-tensorrt/onnx2trt_utils.cpp:364: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
2022-01-10 17:24:15.842234019 [E:onnxruntime:Default, tensorrt_execution_provider.h:51 log] [2022-01-10 11:54:15 ERROR] 4: [network.cpp::validate::2555] Error Code 4: Internal Error (fp16 precision has been set for a layer or layer output, but fp16 is not configured in the builder)
Traceback (most recent call last):
File “detection.py”, line 87, in
detector = ObjectDetector(model_path )
File “detection.py”, line 26, in init
self.sess = rt.InferenceSession(onnx_model_path, providers=providers)
File “/home/niraj/.local/lib/python3.6/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py”, line 335, in init
self._create_inference_session(providers, provider_options, disabled_optimizers)
File “/home/niraj/.local/lib/python3.6/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py”, line 379, in _create_inference_session
sess.initialize_session(providers, provider_options, disabled_optimizers)
onnxruntime.capi.onnxruntime_pybind11_state.EPFail: [ONNXRuntimeError] : 11 : EP_FAIL : TensorRT EP could not build engine for fused node: TensorrtExecutionProvider_TRTKernel_graph_torch-jit-export_13938907072320197802_7_0

The attempt is to use onnx-runtime for running a model atop tensorrt (if i’ve understood it correctly). Now I’m not 100% sure if it’s a Nano issue but I would really appreciate any guidance or help with this.

Update : The model works with onnx-runtime when used with CPUExecutionProvider. Only the TensorrtExecutionProvider is causing this error.

Do i have to set any flags before loading fp16 model using onnx-runtime + tensorrt ? am i missing something here?
I’m also attaching the code & model for reference.
model:
yolov5s_fp16.onnx (14.2 MB)

code:
detection.py (3.7 KB)

AastaLLL · January 11, 2022, 3:13am

Hi,

Please note that the TensorRT engine is not portable.
Have you tried to convert the model directly on the Nano?

Thanks.

nirajkale30 · January 21, 2022, 6:17pm

Yes, I tried & it worked!
Thanks! :)

system · February 4, 2022, 6:17pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can convert to INT32 but not with FP16 TensorRT	3	1042	November 29, 2022
Converting FCN8-ResNet18 from Pytorch to TensorRT for inference on Jetson Nano TensorRT tensorrt , jetson-inference , pytorch , python , onnx	3	2262	October 12, 2021
Why inference in jetson nano with fp16 is slower than fp32 Jetson Nano tensorrt , jetson-inference	9	1938	September 5, 2021
Failed to convert onnx to trt build_engine.py TensorRT	5	1414	May 25, 2022
Convert the TRT model with FP16 Jetson TX2 jetpack , tensorrt , jetson-inference	7	2459	October 18, 2021
Tiny Yolo v3 in Python for Jetson Nano Jetson Nano	10	3159	March 19, 2020
Tensorrt FP16 conversion issue TensorRT tensorrt , cuda , gstreamer , onnx , deep-learning , deepstream	8	2512	March 6, 2023
Inference fp16 engine in c++ get Nan output but inference fp32 engine can get correct result TensorRT	13	1328	October 10, 2023
Inference of Yolov3.onnx model TensorRT	0	1211	January 8, 2020
How to infer using tensorRT on jetson nano? Jetson Nano tensorrt , deep-learning	4	1016	October 15, 2021

ONNX Runtime Error: fp16 precision has been set for a layer or layer output, but fp16 is not configured in the builder

Related topics