Description
use nemo output dynamic shape onnx, use trtexec output dynamic engine, use c++ API deserialize trt engine, while got error in enqueue.
Environment
TensorRT Version:8.2.3.0
GPU Type: a100
Nvidia Driver Version: 450.102
CUDA Version: 11.0
CUDNN Version: 8
Operating System + Version: ubuntu 20.04
Python Version (if applicable): use cpp
TensorFlow Version (if applicable):
PyTorch Version (if applicable): use nemo 1.4
Baremetal or Container (if container which image + tag):nvidia/cuda:11.0.3-cudnn8-devel-ubuntu20.04
Relevant Files
Please attach or include links to any models, data, files, or scripts necessary to reproduce your issue. (Github repo, Google Drive, Dropbox, etc.)
Steps To Reproduce
Please include:
- with nemo output a conformer model, export it with torch.export input shape(B,D,T), B T is dynamic axis
- convert onnx model into engine via trtexec, trtexec --onnx=/conformer.onnx --saveEngine=/conformer.trt --minShapes=audio_signal:1x80x100 --optShapes=audio_signal:16x80x1200 --maxShapes=audio_signal:16x80x1200 --shapes=audio_signal:16x80x1200 --workspace=10240
- use c++ deserialize conformer.trt, setBindingDimensions(signal_binding, Dim3(B, D,T)); then context->enqueueV2(buffer,stream, nullptr);
- ERROR:1: [runner.cpp::execute::416] Error Code 1: Cuda Runtime (invalid argument)