Floor - Cast - Resize(or Slice) cause internal error

archr · January 10, 2022, 6:03pm

Thanks for the concise reproducers. There are two problems:

The examples use floating-point shape tensors as network inputs, and shape-tensor I/O is limited to Int32. This limitation is buried in the C++ documentation for ITensor::isShapeTensor:

//! If a tensor is a shape tensor and becomes an engine input or output,
//! then ICudaEngine::isShapeBinding will be true for that tensor.
//! Such a shape tensor must have type Int32.

Shape tensors are tensors whose values are used to compute the dimensions of tensors. https://docs.nvidia.com/deeplearning/tensorrt/developer-guide/index.html#exe_shape_tensors has the formal rules on what is considered a shape tensor (for 8.4, they can be float too, as long as they are not I/O tensors).

TensorRT did not diagnose violation of the restriction, and instead plowed ahead until the assertion failure.

It’s too late to relax (1) in TensorRT 8.4. (2) we’ll fix. Since floating-point shape-tensor I/O won’t be available, I was wondering if you have a way to avoid it in the networks of real interest.

Topic		Replies	Views
Duplicated reshapes triggers "[graphOptimizer.cpp::findOne::510] Error Code 2: Internal Error (Assertion it != v.end() failed. )" TensorRT tensorrt , onnx	4	918	February 22, 2022
Slice & resize operators error: N shape values provided by optimization profile, which is not product of its dimensions [N] TensorRT	3	1052	April 12, 2022
BUG: Output TRT engine from trtexec has completely different inference than input model TensorRT tensorrt , debugging-and-troubleshooting	3	2199	January 4, 2022
How to export the Pytorch model Keypoint R-CNN to onnx and benchmark with trtexec TensorRT	7	1330	July 14, 2022
Trtexec cannot convert QAT onnx model to trt model TensorRT tensorrt	7	1143	January 26, 2023
TensorRT conversion from tensorflow with custom op TensorRT tensorrt , tensorflow	5	1313	August 12, 2023
Cuda OutOfMemory when creating tensor with 2^29 (~0.5 G) elements TensorRT tensorrt , cuda , onnx	6	1753	March 9, 2022
TensorRT generated QAT engine, why the engine is bigger than pretrained fp16 engine? TensorRT	3	1289	January 4, 2022
PyTorch/ONNX Model Involving Very Large Images: Myelin error: autotuning: CUDA error 2 allocating 0-byte buffer: out of memory TensorRT pytorch	4	1019	February 9, 2023
Issues with torch.nn.ReflectionPad2d(padding) conversion to TRT engine TensorRT tensorrt , pytorch , onnx	21	4163	February 8, 2022

Floor - Cast - Resize(or Slice) cause internal error

Related topics