Abou pytorch2onnx2TRT,serialize and deserialize

cr_men_CN0411 · May 15, 2019, 3:17am

I try to use my model from pytorch 2 onnx,then 2 trt and it works well.
but ,when I try to save onnx as ‘.engin’ in c++ trt and load , I got some err like this:
getPluginCreator could not find plugin ResizeNearest version 001 namespace
Cannot deserialize plugin ResizeNearest
Some one says this because Somehow the ResizeNearest plugin is implemented in tensorrt onnx parser.
link:
https://devtalk.nvidia.com/default/topic/1050845/tensorrt/tensorrt-5-1-c-api-cannot-deserialize-retinanet-trt-engine/post/5333756/#5333756

I get the .onnx as follow:
from onnx import helper
helper.make_node(
‘Upsample’,
mode=‘nearest’,
scales=[1.0, 1.0, 2, 2],
inputs=inputs,
outputs=[layer_name],
name=layer_name,
)…
and I got :
onnx::Upsample[mode = ‘nearest’, scales = [1, 1, 2, 2]]…

How to slove this problem?

NVES · May 16, 2019, 4:32pm

Hello, can you provide details on the platforms you are using?

Linux distro and version
GPU type
nvidia driver version
CUDA version
CUDNN version
Python version [if using python]
Tensorflow version
TensorRT version

Also a small repro of the source/model that demonstrate the error you are seeing will help us debug with you too.

cr_men_CN0411 · May 17, 2019, 7:09am

Under WIN10 enviroment:
Cuda ：cuda_9.0.176_win10
cudnn ： cudnn-9.0-windows10-x64-v7.5.0.56

I got the .onnx model from python3.5 ,pytorch1.0(make some modify about export becuase upsample operation will cause fault, link :GitHub - NVIDIA/retinanet-examples: Fast and accurate object detection with end-to-end GPU optimization).
I run the this onnx molde on VS2015 tensorRT 5.0.4.3, and get the right result, it works well.

ONNX IR version: 0.0.3
Opset version: 9
Producer name: pytorch
Producer version: 0.4
Domain:
Model version: 0
Doc string:

But ,when i try save the model as follow:

void onnxToTRTModel(const std::string& modelFile,
unsigned int maxBatchSize,
IHostMemory*& trtModelStream)
{
int verbosity = (int) nvinfer1::ILogger::Severity::kWARNING;
IBuilder* builder = createInferBuilder(gLogger);
nvinfer1::INetworkDefinition* network = builder->createNetwork();
auto parser = nvonnxparser::createParser(*network, gLogger);
if (!parser->parseFromFile(locateFile(modelFile, directories).c_str(), verbosity))
{
string msg(“failed to parse onnx file”);
gLogger.log(nvinfer1::ILogger::Severity::kERROR, msg.c_str());
exit(EXIT_FAILURE);
}

builder->setMaxBatchSize(maxBatchSize);
builder->setMaxWorkspaceSize(1 << 20);
samplesCommon::enableDLA(builder, gUseDLACore);
ICudaEngine* engine = builder->buildCudaEngine(*network);
assert(engine);
parser->destroy();
trtModelStream = engine->serialize();
engine->destroy();
network->destroy();
builder->destroy();


/*               save trtModelStream as      .engine */
std::fstream file;
file.open("./serialize_engine_output.engine", ios::binary | ios::out);
file.write((const char*)trtModelStream->data(), trtModelStream->size());
file.close();

}

and try load this model as follow:

IRuntime* runtime = createInferRuntime(gLogger);
std::fstream file;
file.open(“./serialize_engine_output.engine”, ios::binary | ios::in);
file.seekg(0, ios::end);
int length_ = file.tellg();
file.seekg(0, ios::beg);
std::unique_ptr<char>data_(new char[length_]);
file.read(data_.get(), length_);
file.close();
ICudaEngine* engine = runtime->deserializeCudaEngine(data_.get(), length_, nullptr);

Then ,the fualt like the follow:

ERROR: 00007FFBC2DCF2D0ResizeNearest version 001 namespace
ERROR: Cannot deserialize plugin ResizeNearest
ERROR: 00007FFBC2DCF2D0ResizeNearest version 001 namespace
ERROR: Cannot deserialize plugin ResizeNearest
ERROR: 00007FFBC2DCF2D0ResizeNearest version 001 namespace
ERROR: Cannot deserialize plugin ResizeNearest
ERROR: 00007FFBC2DCF2D0ResizeNearest version 001 namespace
ERROR: Cannot deserialize plugin ResizeNearest
…

Topic		Replies	Views
Tensorrt 5.1 c++ api cannot deserialize retinanet trt engine TensorRT	4	1661	November 5, 2020
Python: Unable to load .trt model, but loads fine using trtexec TensorRT	3	2319	July 6, 2021
Error in tensorrt test TRT file TensorRT tensorrt , onnx	3	1496	July 5, 2022
Failed to parse ONNX file on TensorRT5 C++ API but works on python API TensorRT	3	830	December 11, 2019
Error when Deserializing TensorRT Engine with Custom Plug-In TensorRT tensorrt	2	1138	September 12, 2020
deserialization problem using dwDNN_initializeTensorRTFromFileNew on drive software 10 DRIVE AGX Xavier General	6	1401	September 14, 2020
Python TensorRT loaded engine failed Jetson AGX Orin tensorrt	5	646	April 25, 2023
ResizeNearest layer get error on deserialize DRIVE AGX Xavier General tensorrt , cuda , driveos-dl	2	838	October 5, 2020
Deserialize TensorRT Engine Failure with Python API TensorRT	0	901	June 19, 2019
Runtime.deserialize_cuda_engine return a NoneType, how to fix ti? TensorRT tensorrt	10	2651	July 15, 2022

Abou pytorch2onnx2TRT,serialize and deserialize

But ,when i try save the model as follow:

}

Related topics