Add support to working with data-dependent tensor shapes

arsenium.ml · December 11, 2023, 7:37am

A am trying to implement Sparse convolution (Sparse matrix, dense kernel) in TensorRT. But it seems there is a fundamental limitation:

Dense 1d matrix:
00001110000

Sparse 1d matrix (features, indices):
1 4
1 5
1 6

Dense 1d kernel:
111

Dense result:
001232100

Sparse result:
1 2
2 3
3 4
2 5
1 6

Dense 1d matrix:
00010101000

Sparse 1d matrix:
1 3
1 5
1 7

Dense 1d kernel:
111

Dense result:
011212110

Sparse result:
1 1
1 2
2 3
1 4
2 5
1 6
1 7

So output matrices shape depends not only on input matrices shape, but also on data in these input matrices (specifically on data in input indices matrix).

The only solution to this problem is to pre-calculate maximum output tensor shape and pad extra elements with -1:

Sample 1 (features, indices):

1 4
1 5
1 6
-1 -1
-1 -1
-1 -1
-1 -1
-1 -1
-1 -1

Sample 2 (features, indices):

1 1
1 2
2 3
1 4
2 5
1 6
1 7
-1 -1
-1 -1

But now consumer of sparse output of Sparse Convolution must be aware of these padded elements, i.e. it must can slice these tensors to their real size:

Sample 1 (features, indices):

1 4
1 5
1 6

Sample 2 (features, indices):

1 1
1 2
2 3
1 4
2 5
1 6
1 7

What I request - I want shape of arguments of enqueue(…) function to be the sizes of SLICED tensors (3 and 7) in this case, but let sizes, that getOutputDimensions(…) function return, to be “maximum padded” tensor shapes (9 in this case). So static TensorRT memory allocator will have all information to allocate memory before engine started execution, but consumers of Sparse convolution output would get tensors without any padding.

Example of API that allows to return sliced tensor shapes:
const nvinfer1::PluginTensorDesc* outputDesc in enqueue (…) change to nvinfer1::PluginTensorDesc* outputDesc

It it very desirable, if I am trying to convert large existing Pytorch model to TRT, and it is hard to make every layer working properly with padded tensors as inputs.

arsenium.ml · December 12, 2023, 6:59am

Moreover, I think TensorRT can calculate upper bound shape per each layer at the engine build time, given sample input of sufficient size.
Now I have to do this work by hands…

AakankshaS · December 31, 2023, 10:38am

Hi
Can you please share your onnx model
Thanks

Topic		Replies	Views
Can output tensor dimensions be decided in enqueue? TensorRT	3	370	October 12, 2021
How to use shape tensor in custom plugin? TensorRT tensorrt	4	1166	July 27, 2020
Does TensorRT support the model has dynamic inputshape and outputshape? TensorRT	2	830	October 12, 2021
Can I transform a shape tensor to a normal tensor? TensorRT	1	395	December 28, 2020
Plugins with dynamic shapes TensorRT	1	612	August 18, 2020
Data dependent tensor shapes in TensorRT TensorRT	5	2645	March 18, 2024
*** HELP ME! How to get input tensor size(C,H,W) and how input/output are ordered and dynamic output size? TensorRT	0	666	May 28, 2019
Shapes in trtexec not working for sr_input:0 TensorRT	2	517	October 16, 2020
Any plans for a TensorRT plugin that can directly use a (dynamic) shape tensor? TensorRT tensorrt	4	712	April 29, 2023
TensorRT 6.0 Dynamic shapes - SSD TensorRT	0	648	September 24, 2019

Add support to working with data-dependent tensor shapes

Related topics