Description
When i build a tensorrt engine from inceptionResnetv2 onnx model in int8 precision;There is so called horizontal layer fusion in convolutions with same input tensor and weight kernel size;(The red rectangles below)
It seems that fused convolution output as [64, 208, 35, 35], and deliver to the next convolutions directly; So these convolutions’ input channels were changed to 208 ? or there is data offset in the convolution input data?
Environment
TensorRT Version: 8.2
GPU Type: A10
CUDA Version: 11.2

