How TensorRT IOPlugin get enough workspace size in explicit batch mode?

mengzking · December 12, 2024, 10:13am

Description

In implicit batch mode, there is a parameter “maxBatchSize” in tensorRT engine. We can set it via IBuilder::setMaxBatchSize and my plugin calculate needed workspace size in function IPluginV2::getWorkspaceSize. TensorRT make sure that the argument batchSize pass through IPluginV2::enqueue, is less than maxBatchSize.

However, in explicit batch mode, parameter batchSize in IExecutionContext::enqueue has no effect (at least since TensorRT8.5.3). And IPluginV2::enqueue can get a argument batchSize greater than maxBatchSize.

This is the problem. workspaceSize is calculated based on maxBatchSize, but when it is actually executed, no one can guarantee that batchSize < maxBatchSize (maybe there is a way I don’t know) so there is a risk of overflow.

My question is, as a plugin author, how do you apply for a large enough workspace when getWorkSpaceSize only has one parameter, maxBatchSize(until TensorRT 10.7.0)? I’m glad to hear your suggestions.

Topic		Replies	Views
setMaxWorkspaceSize parameter clarification TensorRT	1	1999	February 25, 2021
max batch size is missing in configurePlugin of IPluginV2IOExt. TensorRT	4	1098	February 18, 2020
enqueue/execute batch size argument must be same as maximum specified at build TensorRT	0	800	June 8, 2018
Issues with dynamic shapes Try increasing the workspace size with IBuilderConfig::setMaxWorkspaceSize() TensorRT	6	1391	June 14, 2022
Some tactics do not have sufficient workspace memory to run. Increasing workspace size may increase performance, please check verbose output DeepStream SDK	10	16765	October 12, 2021
TensorRT builder->setMaxBatchSize(maxBatchSize); question Jetson TX2	9	6335	October 18, 2021
TensorRT 5 builder when set max_batch_size to 8 the output shape? Jetson TX2	3	1406	October 18, 2021
about working with dynamic shapes TensorRT	5	1134	January 9, 2020
when i Creating a Lite Engine From a TensorFlow Model, there occurs an error, what does it mean Jetson TX2	4	778	October 18, 2021
Input dimensions are different for plugins TensorRT	2	377	September 15, 2020

How TensorRT IOPlugin get enough workspace size in explicit batch mode?

Description

Related topics