Problems with converting Einsum operations to the tensorrt

madmldev32 · June 2, 2022, 11:24am

Problem

Hello,

I am trying to convert LOFTR pytorch model to tensorrt. The model is just combination of CNN with Transformers, but the implementation uses a lot of einsum operation so this is what I think gives me several errors. Because I tested the entire pipeline on other models and it worked well, so I don’t think I have errors in the code.

This is the process.

I converted model to the onnx and tested it using onnxruntime, it works well. Initially I used opset = 15, but now decreased to 12. Results are same for both versions.

with torch.no_grad():
        torch.onnx.export(model,
                          input_dict, 
                          out_filename, 
                          do_constant_folding=True,
                          verbose=True, 
                          opset_version=12)

This is the code for onnx conversion.

Then I use the generated file for trt conversion. During parsing the onnx file I get the following error,

[TRT] [E] [layers.cpp::EinsumLayer::5821] Error Code 3: API Usage Error (Parameter check failed at: optimizer/api/layers.cpp::EinsumLayer::5821, condition: nbInputs > 0 && nbInputs <= MAX_EINSUM_NB_INPUTS
)

but it finishes the parsing and continues to build the model.

plan = builder.build_serialized_network(network, config)

with trt.Runtime(TRT_LOGGER) as runtime:
	engine = runtime.deserialize_cuda_engine(plan)

I use this script, for generation but it produced error of

network.cpp::validate::2726] Error Code 4: Internal Error (Network must have at least one output

After this I was curious to look if the model is parsed correctly at all and when I print

print(network.num_outputs)

after model parsing, it returns 0. I also added following line to my model

network.mark_output(network.get_layer(network.num_layers - 1).get_output(0))

This produces the following error

[TRT] [E] 2: [checkSanity.cpp::checkSanity::94] Error Code 2: Internal Error (Assertion tensors.size() == g.tensors.size() failed.

I got stuck here, I am not sure which one I have to fix, whether the error is in parsing or in generating step.

Environment

TensorRT Version: 8.4.0.6
GPU Type: RTX 2080
Nvidia Driver Version: 460.73.01
CUDA Version: 11.2
Operating System + Version: “Debian GNU/Linux 10”
Python Version (if applicable): 3.9
PyTorch Version (if applicable): 1.11.0

NVES · June 2, 2022, 11:37am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

madmldev32 · June 2, 2022, 11:52am

Thank you for the quick reply.

I tested the model using the 1st approach check_model.py and didnt print any errors.
for the model its uploaded here
https://drive.google.com/file/d/1Vai-DCioKvd0KYmVWlE6Wyo7w_8RR_Mr/view?usp=sharing

madmldev32 · June 3, 2022, 11:43am

Hello,
Sorry for rushing but did you find anything useful ? Maybe you also have some question I could clarify.

madmldev32 · June 6, 2022, 10:24am

If anyone also has a similar problem.

torch.einsum("nlhd,nhdv,nlh->nlhv", x1,x2,x3)

this kind of einsum operation is not supported yet and is not parsed by tensorrt exec. I changed this with custom operations and it worked.

spolisetty · June 7, 2022, 4:53am

Hi,

Yes. TRT’s einsum doesn’t support more than two inputs, the above error is trying to convey the same. We will improve this error message in future releases.

Thank you.

will.stone · February 2, 2023, 4:47pm

Hi @madmldev32, I’ve modified the einsum operations, but I am still getting a lot of other issues popping up with other parts of the model implementation when I attempt to convert to ONNX format. Were the einsum replacements really all you changed?

skumar5 · June 17, 2024, 11:55pm

Could you help me with the custom operation? I am facing the same issue except in my case, I can’t modify the source code itself because I am using a .pth model as the source.
I am performing .pth → .onnx → .trt conversion.

Topic		Replies	Views
No importer registered for op: Einsum TensorRT	5	1984	June 8, 2020
Questions about custom plugin? TensorRT	10	1008	July 7, 2021
LSTM ONNX to TensorRT mismatched outputs TensorRT tensorrt	3	1080	September 29, 2022
Unable to convert model to TensorRT when do_constant_folding=False TensorRT cudnn	1	650	February 29, 2024
[TensorRT] Converted ONNX model inference error TensorRT tensorrt	5	2207	October 12, 2021
Parseq tensorrt conversion takes for ever to complete TensorRT cudnn	1	123	August 30, 2024
Unable to convert ONNX model to TensorRT TensorRT tensorrt , pytorch , onnx	6	3645	September 30, 2020
ONNX to TensorRT conversion TensorRT	3	814	July 6, 2023
Problem converting TensorFlow 2-> ONNX model to TensorRT Engine (efficientdet_d0) TensorRT	8	1540	November 17, 2022
[TensorRT] INTERNAL ERROR: Assertion failed: validateInputsCutensor(src, dst) TensorRT	3	2954	October 12, 2021

Problems with converting Einsum operations to the tensorrt

Problem

Environment

check_model.py

Related topics