Runtime.deserialize_cuda_engine return a NoneType, how to fix ti?

787939533 · June 16, 2022, 10:15am

Description

runtime.deserialize_cuda_engine(serialized_engine) return a NoneType
I have search from the forum, however there is no suitable solution.

Environment

TensorRT Version:
GPU Type: GTX1660
Nvidia Driver Version: 465.89
CUDA Version: 11.3
CUDNN Version: 8.4.0
Operating System + Version: win10
Python Version (if applicable): 3.8
TensorFlow Version (if applicable):
PyTorch Version (if applicable): ‘1.11.0+cu113’
Baremetal or Container (if container which image + tag):

Code below:

logger = trt.Logger(trt.Logger.WARNING)
def build_engine(onnx_file_path):
    explicit_batch_flag = 1 << int(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)
    with trt.Builder(logger)as builder,builder.create_network(1 << int(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)) as network,builder.create_builder_config()as config:
        parser = trt.OnnxParser(network,logger)
        success = parser.parse_from_file(onnx_file_path)
        for idx in range(parser.num_errors):
            print(parser.get_error(idx))
        if not success:
            pass
        
        with open(onnx_file_path,'rb') as model:
            print('Beginning ONNX file parsing')
            parser.parse(model.read())
        print("Complete parsing of ONNX file")
        #config.set_memory_pool_limit(trt.MemoryPoolType.WORKSPACE,1<<20)
        config.max_workspace_size=1<<30

        builder.max_batch_size = 1
        if builder.platform_has_fast_fp16:
            config.set_flag(trt.BuilderFlag.FP16)

        print('Building an engine...')
        '''last_layer = network.get_layer(network.num_layers-1)
        network.mark_output(last_layer.get_output(0))'''
        engine = builder.build_serialized_network(network,config)
        print("Completed create Engine")
    
    return engine

engine= build_engine("7_class_cuda.onnx")

with open("sample.engine","wb")as f:
    f.write(engine)
f.close()

with open("sample.engine","rb") as f:
    serialized_engine = f.read()

runtime = trt.Runtime(logger)
engine_ = runtime.deserialize_cuda_engine(serialized_engine)

NVES · June 16, 2022, 10:37am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.

In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

787939533 · June 17, 2022, 3:40am

hi
Seems like my onnx model is too large that can’t upload, as it is 119MB. So do you have any idea to upload my onnx model?

But it would be ok if I didn`t load the file from “sample.engine”(you can find this part in my code) and directly use the engine obtained from my build_engine function.

spolisetty · June 17, 2022, 3:08pm

Hi,

Which version of the TensorRT are you using? Please use the latest TensorRT version.
Also please refer to the following sample and make sure, your script is correct.

github.com

NVIDIA/TensorRT/blob/main/samples/python/introductory_parser_samples/onnx_resnet50.py

#
# SPDX-FileCopyrightText: Copyright (c) 1993-2022 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
# SPDX-License-Identifier: Apache-2.0
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#

import os

# This sample uses an ONNX ResNet50 Model to create a TensorRT Inference Engine

This file has been truncated. show original

Thank you.

787939533 · June 20, 2022, 1:24am

my tensorrt version is 8.4.1.5. Actually, my problem can be simplified to I can`t read the engine from a binary data file.

spolisetty · June 29, 2022, 9:46am

Hi,

Have you tried given sample code?

787939533 · July 4, 2022, 9:25am

Sorry for replying to you so late, yes I have tried that code. But unfortunately, when I retried that code, it didn`t work—Same code and Same process, got a different result. @spolisetty

spolisetty · July 4, 2022, 11:16am

Could you please share with us the latest script along with the model and error logs for better debugging.

spolisetty · July 8, 2022, 1:25pm

Hi,

Based on the error it seems that your plugin library is not being registered for some reason.
Could you please check if your plugin is registered properly or force a plugins init using trt.init_libnvinfer_plugins(None,'')
https://docs.nvidia.com/deeplearning/tensorrt/api/python_api/infer/Plugin/IPluginRegistry.html

How are you installing the TensorRT?

Also the following similar issue may help you,

github.com/onnx/onnx-tensorrt

[TRT] INVALID_ARGUMENT: getPluginCreator could not find plugin InstanceNormalization_TRT version 1 [12/22/2020-12:46:22] [E] [TRT] safeDeserializationUtils.cpp (322) - Serialization Error in load: 0 (Cannot deserialize plugin since corresponding IPluginCreator not found in Plugin Registry) [12/22/2020-12:46:22] [E] [TRT] INVALID_STATE: std::exception

opened 06:26AM - 22 Dec 20 UTC

closed 06:03PM - 14 Mar 22 UTC

jinfagang

triaged repro requested

``` [TRT] INVALID_ARGUMENT: getPluginCreator could not find plugin InstanceNorm…alization_TRT version 1 [12/22/2020-12:46:22] [E] [TRT] safeDeserializationUtils.cpp (322) - Serialization Error in load: 0 (Cannot deserialize plugin since corresponding IPluginCreator not found in Plugin Registry) [12/22/2020-12:46:22] [E] [TRT] INVALID_STATE: std::exception ``` How do I found this plugin when using `onnx2trt` generates an engine file which contains custom plugin used in onnx-tensorrt?

787939533 · July 14, 2022, 8:37am

Sorry for replying to you so late, I did try to use trt.init_libnvinfer_plugins(None,‘’), but program still break down sometime. @spolisetty

spolisetty · July 15, 2022, 1:39pm

Hi,

How are you installing the TensorRT? This issue may be due to incorrect setup as well.
Could you please try reinstalling by following the correct steps.

Thank you.

Topic		Replies	Views
TensorRT-7.1.3.4 Deserialize the cuda engine failed TensorRT cuda	9	8144	March 28, 2024
Trouble deserialising a trt engine file TensorRT	1	1502	September 5, 2021
TensorRT build.build_serialized_network return silent None TensorRT	1	2739	September 25, 2021
Error in tensorrt test TRT file TensorRT tensorrt , onnx	3	1376	July 5, 2022
TensorRT deserialize_cuda_engine() returns a None Object TensorRT tensorrt	7	3634	October 12, 2021
AttributeError: 'NoneType' object has no attribute 'create_execution_context' TensorRT	30	22022	June 17, 2023
TRT 8.2.0.6 - 'NoneType' object has no attribute 'serialize' TensorRT	2	1964	February 1, 2023
buildEngineWithConfig returns null_ptr TensorRT	9	2156	July 2, 2021
Deserialize cuda engine return none TensorRT	3	1296	January 25, 2022
Build TensorRT on Cuda compute capability 7.5 and make it backward compatible with previous capabilities TensorRT tensorrt	4	1914	May 19, 2022

Runtime.deserialize_cuda_engine return a NoneType, how to fix ti?

Description

Environment

check_model.py

Related topics