Assertion `engine->getNbBindings() == 2' failed when after doing INT8 calibration

harry_xiaye · July 15, 2021, 12:20pm

I have some models running on TX2 devices well, then I want to move to NX device. I did INT8 calibration for my models from onnx. When I load INT8 models on my NX device, I got error on assert(engine->getNbBindings() == 2). Copy a few code below.

engine = runtime->deserializeCudaEngine(modelStream, size);
delete[] modelStream;
assert(engine != nullptr);
context = engine->createExecutionContext();
assert(context != nullptr);
assert(engine->getNbBindings() == 2);

What may cause this kind of issues? How should I debug this issue? Is there any additional code change for new INT8 models?

Thanks
Harry

AastaLLL · July 16, 2021, 3:10am

Hi,

The error indicates the binding buffer doesn’t equal to 2.
The binding buffer indicates the layer marked as input and output.

For the classification model, it’s expected to have one image input and one softmax output.
So the sample tries to check the buffer binding number = 2.

However, this is a model-dependent checker.
If your model has much more output, please modified it to the corresponding number.

Thanks.

harry_xiaye · July 16, 2021, 9:21am

Looks it is engine model issue. At first, I used yolov5’s export.py tool to convert my yolov5 pytorch model to onnx format, then used GitHub - qq995431104/Pytorch2TensorRT: CUDA10.0, CUDNN7.5.0, TensorRT7.0.0.11 to do INT8 calibration and generated engine file. I always got “assert(engine->getNbBindings() == 2)” error on this engine file. But I have no issue to generate another googlenet model with this tool.

Then I used GitHub - wang-xinyu/tensorrtx: Implementation of popular deep learning networks with TensorRT network definition API to do INT8 calibration and generate engine file for my yolov5 model. This engine file works.

I dont know why this happens.

AastaLLL · July 22, 2021, 9:17am

Hi,

Since GoogleNet is a classifier with one data input and one prob output.
So the expected binding number is two, which meet the condition.

However, for YOLOv5, the output needs a customized parser.
It is implemented as a plugin library in the second link you shared.

Thanks.

system · September 27, 2021, 1:06pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How does TensorRT set the number of bindings (NbBindings) for the engine TensorRT tensorrt	4	1792	April 13, 2020
Question about modifying 04_video_dec_trt example to use custom .onnx Jetson Nano tensorrt , mmapi	9	1035	October 15, 2021
TensorRT fails to build FasterRCNN GIE model with using INT8 TensorRT	28	9343	May 3, 2018
about custom layer error GPU-Accelerated Libraries	0	406	February 8, 2018
There is a problem when use IInt8EntropyCalibrator2 multi input Jetson AGX Orin tensorrt	2	655	April 24, 2023
TensorRT3 YOLOv2 int8 calibration TensorRT	4	1203	July 18, 2019
Tenssorrt INT8 precision engine build failed for the models having custom layer (BatchedNMSDynamic_TRT) TensorRT	11	2059	June 29, 2021
binding[x] != nullptr error when infering TensorRT	1	1689	January 31, 2019
[TensorRT] INTERNAL ERROR: Assertion failed: d.nbDims >= 1 int8 TensorRT tensorrt	4	1740	April 15, 2021
Got Assertion `sI.count() == 1' failed. when create engine with INT8 calibration TensorRT tensorrt	5	644	October 12, 2021

Assertion `engine->getNbBindings() == 2' failed when after doing INT8 calibration

Related topics