Tensorrt 7 - Best Practice for implementing plugin that supports both FP16 and Fp32

gjgraham53 · August 16, 2022, 10:35pm

Description

We have implemented a solution in C++ which inferences a object detection CNN.
It has some activations for the convolutionos which are custom and implemented suing IPluginEXTV2

We’re looking to implement these customs plugins so they support both Fp16 and Fp32 inference. When the builder is configured as FP16 or not.

How best is this implemented to also ensure the best tactic is generated during compiling?

I’ve done some googling and cannot find a difinitive answer. So hoping I can get some clarification here.

Environment

TensorRT Version: 7.2.3.4
GPU Type: RTX 3060 Mobile, RTX 3060 SUPER Desktop
Nvidia Driver Version: 516.59
CUDA Version: 11.2
CUDNN Version: Unkown
Operating System + Version: Ubuntu 18.04
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): Nvidia docker runtime

NVES · August 16, 2022, 11:07pm

Hi,
Please refer to below links related custom plugin implementation and sample:

While IPluginV2 and IPluginV2Ext interfaces are still supported for backward compatibility with TensorRT 5.1 and 6.0.x respectively, however, we recommend that you write new plugins or refactor existing ones to target the IPluginV2DynamicExt or IPluginV2IOExt interfaces instead.

Thanks!

Topic		Replies	Views
Custom plugin supporting int8 I/O type check fail TensorRT	2	560	May 26, 2023
Question about the tensorrt precision transformation TensorRT	4	472	July 12, 2021
Is it possible to add TensorRT plugin into FasterTransformer Decoder and Decoding TensorRT tensorrt	4	781	September 30, 2021
Is there any layer that fp16 supports but int8 does not？ TensorRT	5	486	December 1, 2021
TensorRT 5.1.6 Custom plugin with fp16 issue TensorRT	6	1801	November 19, 2019
Is there any step-by-step illustration of how to write my own plugin? TensorRT	1	591	February 13, 2023
Does tensor rt 5 automatically enable tensor core for int8 and fp16 mode? TensorRT	6	1810	April 26, 2019
Efficient NMS plugin to TensorRT engine at runtime TensorRT	4	6860	May 17, 2022
Onnx to tensorrt plugin for NonMaxSuppression TensorRT tensorrt , tensorflow	1	2514	April 26, 2020
Is there any usage example of TensorRT Plugins such as bertQKVToContextPlugin? TensorRT	5	652	February 8, 2023

Tensorrt 7 - Best Practice for implementing plugin that supports both FP16 and Fp32

Description

Environment

Related topics