TensorRT with BART

eswissa · December 2, 2021, 11:55pm

Can we get an example of how to use TensorRT for BART model in Pytorch?
TRT does not support some ops in BART so would love to get a path with pytorch to fall back on unsupported ops.

Thanks,
Efrat

NVES · December 6, 2021, 5:43am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec
In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

penglonghu · January 12, 2022, 9:39am

hi, i try to convert the facebook/bart_base model into tensortrt.
1)check_model.py is ok
2)trtexec --explicitBatch --onnx=Bart/temp5/Bart/bart-base/Bart-bart-base.onnx --minShapes=input_ids:16x1 --maxShapes=input_ids:16x32 --optShapes=input_ids:16x16 --buildOnly --saveEngine=gpt.trt

core dump when load the onnx file
[01/12/2022-09:32:11] [W] --explicitBatch flag has been deprecated and has no effect!
[01/12/2022-09:32:11] [W] Explicit batch dim is automatically enabled if input model is ONNX or if dynamic shapes are provided when the engine is built.
[01/12/2022-09:32:13] [W] [TRT] parsers/onnx/onnx2trt_utils.cpp:364: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[01/12/2022-09:32:15] [W] [TRT] parsers/onnx/onnx2trt_utils.cpp:392: One or more weights outside the range of INT32 was clamped
[01/12/2022-09:32:15] [W] [TRT] parsers/onnx/onnx2trt_utils.cpp:392: One or more weights outside the range of INT32 was clamped
[01/12/2022-09:32:15] [W] [TRT] parsers/onnx/onnx2trt_utils.cpp:392: One or more weights outside the range of INT32 was clamped
[01/12/2022-09:32:15] [W] [TRT] parsers/onnx/onnx2trt_utils.cpp:392: One or more weights outside the range of INT32 was clamped
[01/12/2022-09:32:18] [W] [TRT] parsers/onnx/onnx2trt_utils.cpp:392: One or more weights outside the range of INT32 was clamped
[01/12/2022-09:32:18] [W] [TRT] parsers/onnx/onnx2trt_utils.cpp:392: One or more weights outside the range of INT32 was clamped
[01/12/2022-09:33:05] [W] [TRT] Output type must be INT32 for shape outputs
[01/12/2022-09:33:05] [W] [TRT] Output type must be INT32 for shape outputs
[01/12/2022-09:33:05] [W] [TRT] Output type must be INT32 for shape outputs
[01/12/2022-09:33:05] [W] [TRT] Output type must be INT32 for shape outputs
[01/12/2022-09:33:05] [W] [TRT] Output type must be INT32 for shape outputs
[01/12/2022-09:33:08] [W] [TRT] Myelin graph with multiple dynamic values may have poor performance if they differ. Dynamic values are:
[01/12/2022-09:33:08] [W] [TRT] (# 1 (RESHAPE 16 (# 1 (RESHAPE 16 (* E0 (MIN 1 E0)) | 16 E0 zeroIsPlaceholder)) | 16 E0 zeroIsPlaceholder)) where E0=(+ (# 1 (SHAPE input_ids)) -1)
[01/12/2022-09:33:08] [W] [TRT] (# 1 (SHAPE input_ids))
trt.log (1.1 MB)

onnxruntime 1.8.0
onnx 1.9.0
transformers 4.15.0
torch 1.10.1+cu113
TensorRT version: 8.2.1
Driver Version: 470.82.01 CUDA Version: 11.4
T4

penglonghu · January 17, 2022, 2:20am

@NVES do you have any suggestion?

Topic		Replies	Views
Unable to convert ONNX model to TensorRT TensorRT tensorrt , pytorch , onnx	6	3496	September 30, 2020
ONNX to TRT Engine conversion Error TensorRT tensorrt	8	3725	May 25, 2022
Onnx with dynamic batch cannot be parsed TensorRT tensorrt	12	1541	August 9, 2021
Pytorch -> ONNX -> TensorRT inference with terrible accuracy (int64 clamped to int32) TensorRT cudnn	2	1372	January 23, 2024
TensorRT gives diffent results than ONNX and Pytorch TensorRT	8	1578	September 28, 2023
Use pre-trained object detection TF2 models with TensorRT ONNX TensorRT	9	1944	May 31, 2021
Onnx model to TRT conversion error TensorRT	6	3324	April 15, 2022
Error Code 1: Internal Error (Error: Weights of same values but of different types are used in the network!) TensorRT tensorrt , onnx , jetson	4	1227	July 7, 2023
Pytorch_onnx_trt strange error TensorRT	5	746	May 14, 2020
ONNX Model Int64 Weights TensorRT	12	13445	February 17, 2024

TensorRT with BART

check_model.py

Related topics