Tensorrt with implicit batchsize

mayuning2nd · December 19, 2023, 3:16am

Description

confused with the implict batch_size inference. I intend to improve the overall throughput of a cnn inference task.
in the documents, it suggest using batching .
but the api shows that batch is deprecated with enqueue function and enqueueV3 works only for Explicit mode. (if I did not use [NetworkDefinitionCreationFlag::kEXPLICIT_BATCH] flag , the engine failed to build)

Environment

TensorRT Version: 8.5.1
GPU Type: xavier nx
Nvidia Driver Version:
CUDA Version: 11.4
CUDNN Version: 8.6
Operating System + Version: n.a.
Python Version (if applicable): n.a.
TensorFlow Version (if applicable): n.a.
PyTorch Version (if applicable): n.a.
Baremetal or Container (if container which image + tag): n.a.

mayuning2nd · December 19, 2023, 6:43am

I exported the network with only batchsize (the first dim ) be the dynamic_axes. when I build the engine with different optimization profile , like 1/4/8 or 3/3/3 as the batch dimension. the output dim never updated and remains as 1xCxHxW even I already set the profile for the output tensor also; I expect at least for a 3/3/3 batch dimension profile, the output batch dim should be 3; any idea?

mayuning2nd · December 22, 2023, 2:05am

I figured out that is due to a pytorch bug. so to disturb;

system · January 14, 2024, 2:44pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Tensorrt Execution Provider TensorRT tensorrt , cudnn , onnx	1	851	November 27, 2023
TensortRT execute with variable batch size gave incorrect results TensorRT	1	431	November 9, 2021
How to support dynamic batch size for TensorRT engine? TensorRT	1	1107	March 3, 2023
Dynamic batch Tensor-RT inference output is incorrect TensorRT tensorrt , python	2	1329	May 25, 2023
TensorRT model giving constant output TensorRT deepstream	4	1379	November 30, 2021
Question regarding Tensorrt engine build vs inference environment (TensorRT version, Platform, etc) TensorRT	4	912	October 21, 2021
Detectron2: faster inferencing TensorRT	2	1427	April 29, 2022
TensorRT6 Dynamic Input Size does not support int8 with calibrator. TensorRT	13	3394	July 23, 2021
TensorRT model returns only zero outputs TensorRT	5	2826	April 24, 2022
Batch Inference Wrong in Python API TensorRT	15	3558	October 12, 2021

Tensorrt with implicit batchsize

Description

Environment

Related topics