Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[lm_head.bias.../Cast]}.)

christophraab · March 22, 2023, 2:03pm

Description

Hi there,

i want to convert an static quantized transformer model to trt. It is an CodeGenForCausalLM from transformers library. I used the following command for converting the model.

 trtexec --onnx=trt/decoder_model_quantized.onnx --int8 --minShapes=input_ids:1x1,attention_mask:1x1 --maxShapes=input_ids:1x512,attention_mask:1x512 --saveEngine=model-quantized.onnx.plan --device=0 --allowGPUFallback --useCudaGraph

The error is as follows:

03/22/2023-14:01:53] [I] [TRT] Local timing cache in use. Profiling results in this builder pass will not be stored.
[03/22/2023-14:02:03] [W] [TRT] Myelin graph with multiple dynamic values may have poor performance if they differ. Dynamic values are: 
[03/22/2023-14:02:03] [W] [TRT]  (- 0 (CAST_F_TO_I (FLOOR (DIV_F (MUL_ADD_F -1 (CAST_I_TO_F sequence_length) 0) 1))))
[03/22/2023-14:02:03] [W] [TRT]  sequence_length
[03/22/2023-14:02:04] [W] [TRT] Skipping tactic 0x0000000000000000 due to Myelin error: [canonicalize_axis] Operation /transformer/ln_f/Constant_1_output_0_QuantizeLinear has out of range axis value 0.
[03/22/2023-14:02:04] [E] Error[10]: [optimizer.cpp::computeCosts::3728] Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[lm_head.bias.../Cast]}.)
[03/22/2023-14:02:04] [E] Error[2]: [builder.cpp::buildSerializedNetwork::751] Error Code 2: Internal Error (Assertion engine != nullptr failed. )
[03/22/2023-14:02:04] [E] Engine could not be created from network
[03/22/2023-14:02:04] [E] Building engine failed
[03/22/2023-14:02:04] [E] Failed to create engine from model or file.
[03/22/2023-14:02:04] [E] Engine set up failed

Environment

Docker Image: nvcr.io/nvidia/pytorch:22.12-py3
TensorRT Version 8.501
Python 3.8.10
Pips:
transformers 4.26.1
optimum 1.7.1
onnx 1.12.0
onnxruntime-gpu 1.14.1
pytorch-quantization 2.1.2
pytorch-triton 2.0.0+b8b470bc59
torch 1.13.1
torch-tensorrt 1.3.0
torchtext 0.13.0a0+fae8e8c
torchvision 0.15.0a0

Steps To Reproduce

Convert the model to transformer model to onnx via optimum-cli
Do the quantization (we did it exactly) like in quantization guide
Run infer_shape.py as suggested by trtexec.
Run the cmd from above.

Thank you very much for any help!

NVES · March 22, 2023, 2:38pm

Hi ,
We recommend you to check the supported features from the below link.

You can refer below link for all the supported operators list.
For unsupported operators, you need to create a custom plugin to support the operation

github.com

onnx/onnx-tensorrt/blob/main/docs/operators.md

<!--- SPDX-License-Identifier: Apache-2.0 -->

# Supported ONNX Operators

TensorRT 8.6 supports operators up to Opset 17. Latest information of ONNX operators can be found [here](https://github.com/onnx/onnx/blob/master/docs/Operators.md)

TensorRT supports the following ONNX data types: DOUBLE, FLOAT32, FLOAT16, INT8, and BOOL

> Note: There is limited support for INT32, INT64, and DOUBLE types. TensorRT will attempt to cast down INT64 to INT32 and DOUBLE down to FLOAT, clamping values to `+-INT_MAX` or `+-FLT_MAX` if necessary.

See below for the support matrix of ONNX operators in ONNX-TensorRT.

## Operator Support Matrix

| Operator                  | Supported  | Supported Types | Restrictions                                                                                                           |
|---------------------------|------------|-----------------|------------------------------------------------------------------------------------------------------------------------|
| Abs                       | Y          | FP32, FP16, INT32 |
| Acos                      | Y          | FP32, FP16 |
| Acosh                     | Y          | FP32, FP16 |
| Add                       | Y          | FP32, FP16, INT32 |

This file has been truncated. show original

Thanks!

christophraab · March 23, 2023, 7:53am

Thank you for your reply.

It seems that lm_head.bias.../Cast is not in the list of supported files, right?

Topic		Replies	Views
Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[668...Mul_497] TensorRT tensorrt , nvbugs , onnx	6	3104	October 25, 2024
Failed to convert model using tensorrt 8.5.3 TensorRT cudnn	1	738	November 17, 2023
Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[onnx::MatMul_9665 + (Unnamed Layer* 387) TensorRT	7	1827	March 5, 2024
Error: Could not find any implementation for node {ForeignNode TensorRT	1	2873	February 24, 2023
ERROR: [TRT]: 10: Could not find any implementation for node /0/model.24/Expand DeepStream SDK tensorrt , onnx	6	945	March 22, 2024
(Could not find any implementation for node {ForeignNode[Transpose_2713 + (Unnamed Layer* 4032) [Shuffle]...MatMul_2714]}.) TensorRT	7	3332	January 12, 2023
Error Code 10: Internal Error (Could not find any implementation for node TensorRT cudnn	19	2978	September 29, 2024
Failed to convert quantized onnx model to engine TensorRT cudnn , jetson	4	176	July 28, 2024
Could not find any implementation for node TensorRT cudnn	21	668	December 6, 2024
Error Code 10: Internal Error (Could not find any implementation for node PWN(/model.0/act/Sigmoid).) Jetson Orin Nano tensorrt	4	660	February 26, 2024

Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[lm_head.bias.../Cast]}.)

Description

Environment

Steps To Reproduce

Related topics