Why the program pause when deserializeCudaEngine is called？

Tiruna · March 23, 2018, 2:10am

I use engine->serialize to serialize a caffemodel in tensorRT model as follows.

serialize caffemodel
gieModelStream = engine->serialize();
std::ofstream outfile(model_rt_file, std::ios::out | std::ios::binary);
unsigned char* streamdata = (unsigned char*)gieModelStream->data();
outfile.write((char*)streamdata, gieModelStream->size());
outfile.close();
read tensorRT model
std::ifstream in_file(model_rt_file, std::ios::in | std::ios::binary);
std::streampos begin, end;
begin = in_file.tellg();
in_file.seekg(0, std::ios::end);
end = in_file.tellg();
std::size_t size = end - begin;
in_file.seekg(0, std::ios::beg);
std::unique_ptr<unsigned char> engine_data(new unsigned char);
in_file.read((char*)engine_data.get(), size);
in_file.close();
infer = createInferRuntime(gLogger);
engine = infer->deserializeCudaEngine((const void*)engine_data.get(), size, &pluginFactory);

But when comes to engine = infer->deserializeCudaEngine((const void*)engine_data.get(), size, &pluginFactory), the program pause and doesn’t output any information. This problem occurs by accident, about once among ten attempts. Did anybody meet the same situation?

AastaLLL · March 23, 2018, 3:29am

Hi,

We don’t receive a relevant topic before.
Not sure if there is something incorrect when dealing with serialize/de-serialize IO stream.

Could you share a complete source for us to reproduce this issue in our side?
Thanks.

Tiruna · April 13, 2018, 9:36am

sorry for late reply. Because of some reasons, I could not share the complete code, but the code I paste can already say something. The problem occurs depend on GPU such as Nvidia Tesla P4, but it never occurs when Nvidia Tesla P40 is used.

AastaLLL · April 16, 2018, 7:35am

Hi,

Could you try another model to check if this issue is model-dependent?
Thanks.

Topic		Replies	Views
segmentation fault when using deserializeCudaEngine in C++ api TensorRT	2	1089	August 15, 2019
ERROR: runtime->deserializeCudaEngine build a engine ,report error "Serialization assertion sizeRead == static_cast<uint64_t>(mEnd - mCurrent) failed" TensorRT	2	547	October 28, 2022
TensorRT nvinfer1::ICudaEngine deserializeCudaEngine not fast TensorRT tensorrt , deep-learning	7	1533	November 22, 2023
nvinfer1::ICudaEngine deserializeCudaEngine takes 40-60 sec Jetson TX2	10	2134	October 18, 2021
CUDA Error in TensorRT deserializeCudaEngine() TensorRT tensorrt , cuda , linux	5	3478	October 12, 2021
can we write IHostMemory into a file, and read the file to deserializeCudaEngine? TensorRT	9	1944	October 12, 2021
[error] deserialize_cuda_engine(): incompatible funtion arguments in sample fc_plugin_caffe_mnist TensorRT	7	2474	December 9, 2019
cannot deserialize engine and segmentation fault(core dumped) TensorRT	1	1032	September 6, 2019
nvinfer1::IRuntime::deserializeCudaEngine fails TensorRT	1	1775	June 22, 2023
Serialization Error in verifyHeader: 0 (CRC-32 checksum does not match value in archive) TensorRT tensorrt , cuda	1	711	May 9, 2023

Why the program pause when deserializeCudaEngine is called？

Related topics