TensortRT execute with variable batch size gave incorrect results

jon.richardsm6bq8 · November 9, 2021, 5:13pm

Description

Using code very similar to MNIST UFF sample.

Engine created with
builder->setMaxBatchSize(32);
Then run inference on 32 input tiles gives expected result.
context->execute(32, &internalConfig->buffers[0]);
Then run inference on 31 input tiles gives expected result.
context->execute(31, &internalConfig->buffers[0]);
Then back to 32 input tiles gives the wrong result for input 32
context->execute(32, &internalConfig->buffers[0]);

I can provide code if required but initially just asking if this was a known issue.
If I always run inference on the whole batch size then it’s all fine.
Just seems a waste when I don’t have enough tiles for a whole batch.

Environment

TensorRT Version: 7.1.3.0
GPU Type: Jetson NX
Nvidia Driver Version:
CUDA Version: 10.2.89
CUDNN Version: 8.0.0.180
Operating System + Version: Jetpack 4.4.1
Python Version (if applicable):
TensorFlow Version (if applicable): 1.15
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag):

NVES · November 9, 2021, 7:08pm

Hi,
Can you try running your model with trtexec command, and share the “”–verbose"" log in case if the issue persist
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec

You can refer below link for all the supported operators list, in case any operator is not supported you need to create a custom plugin to support that operation

github.com

onnx/onnx-tensorrt/blob/main/docs/operators.md

<!--- SPDX-License-Identifier: Apache-2.0 -->

# Supported ONNX Operators

TensorRT 8.4 supports operators up to Opset 17. Latest information of ONNX operators can be found [here](https://github.com/onnx/onnx/blob/master/docs/Operators.md)

TensorRT supports the following ONNX data types: DOUBLE, FLOAT32, FLOAT16, INT8, and BOOL

> Note: There is limited support for INT32, INT64, and DOUBLE types. TensorRT will attempt to cast down INT64 to INT32 and DOUBLE down to FLOAT, clamping values to `+-INT_MAX` or `+-FLT_MAX` if necessary.

See below for the support matrix of ONNX operators in ONNX-TensorRT.

## Operator Support Matrix

| Operator                  | Supported  | Supported Types | Restrictions                                                                                                           |
|---------------------------|------------|-----------------|------------------------------------------------------------------------------------------------------------------------|
| Abs                       | Y          | FP32, FP16, INT32 |
| Acos                      | Y          | FP32, FP16 |
| Acosh                     | Y          | FP32, FP16 |
| Add                       | Y          | FP32, FP16, INT32 |

This file has been truncated. show original

Also, request you to share your model and script if not shared already so that we can help you better.

Meanwhile, for some common errors and queries please refer to below link:

Thanks!

Topic		Replies	Views
Tensorrt Execution Provider TensorRT tensorrt , cudnn , onnx	1	733	November 27, 2023
Inference TensorRT randomly returns nan TensorRT tensorrt	2	512	April 27, 2023
Error occurred while running the Tensorrt samples: [reformat.cpp::executeCutensor::385] TensorRT tensorrt	3	1168	December 12, 2023
Different engines give different inference results when using the same onnx model and giving the same input TensorRT	4	886	December 31, 2023
ONNX to TensorRT conversion (FP16 or FP32) results in integer outputs being mapped to near negative infinity (~2e-45) TensorRT tensorrt , cuda , onnx , aws , natural-language-processing-nlp , nlp	3	3184	June 6, 2022
Tensorrt with implicit batchsize TensorRT tensorrt , cudnn	3	507	January 14, 2024
ONNX batchsize setting and buffer.h assert error TensorRT	3	1160	March 23, 2021
TensorRT runtime batch processing in C++ TensorRT tensorrt	5	1531	September 8, 2021
ONNX to TensorRT Python module doesn't generate dynamic batch size engine TensorRT tensorrt , cudnn , onnx	3	1055	October 20, 2023
Converted model is broken if half precision with dynamic batch size and batch size is greater than 1 TensorRT	11	2265	October 18, 2024

TensortRT execute with variable batch size gave incorrect results

Description

Environment

Related topics