Infer layer with specific precision

dara.vinogradova · November 20, 2023, 2:50pm

Hi! Is it possible to force some layer in some precision?
In my case, I want to convert onnx model to int8-tensorrt format. But I got errors in NMS layer (which is not supported in int8, as I know). Could I infer these concrete layer in fp16 precision or even on CPU?
Also, maybe I’m wrong and there is TRT-Plugin for int8 quantization of NMS layer?

AakankshaS · November 21, 2023, 8:04pm

Hi @dara.vinogradova ,
Would you mind trying out the example?

Thanks

dara.vinogradova · November 21, 2023, 9:17pm

But this plugin is only for fp16/fp32, am I wrong?

AakankshaS · December 31, 2023, 9:58am

Hi @dara.vinogradova ,
Apologies for delayed response, can you please share the error logs with us?

Thanks

Topic		Replies	Views
TRTExec - Force precision on certain ONNX Op Nodes TensorRT	1	1032	February 23, 2023
How to use mixed-precision when converting PyTorch model to TRT model by TensorRT? TensorRT tensorrt	5	2351	July 7, 2021
Trtexc specifies the precision for the specified layer TensorRT tensorrt , cudnn	1	23	November 30, 2024
Different inference result with tensorrt FP16 and FP32 TensorRT	1	1722	May 16, 2019
Onnx model to TRT conversion error TensorRT	6	3320	April 15, 2022
NvInfer Mixed-Precision ONNX DeepStream SDK	5	337	February 14, 2023
TensoRT convewrsion in bfloat16 Jetson AGX Orin tensorrt	3	510	February 1, 2024
RT-DETR conversion to int8 TensorRT	1	167	October 23, 2024
How to enforce convert all layers to INT8 when building int8 engine model? TensorRT	5	441	June 21, 2023
TensorRT INT8 plugin layer TensorRT	3	1811	November 13, 2019

Infer layer with specific precision

Related topics