How to serialize TensorRT model into file and deserialize from file?

1718516988 · November 14, 2018, 6:26am

I managed to get a network parsed from caffe and I serialized it.

According to the following program, I saved the serialized result to “serialized_engine.txt” file successfully .

IHostMemory serializedModel = engine->serialize();
…
// store model to disk
std::ofstream ofs(“serialized_engine.txt”, std::ios::out | std::ios::binary);
ofs.write((char)(serialized_model ->data()), serialized_model ->size());
ofs.close();

But I don’t know how to use the “serialized_engine.txt” and haven’t found something useful in the examples.

Anyone knows how to transform the “serialized_engine.txt” to the serializedModel in inference ?

piotrw · November 15, 2018, 5:52am

Hi.

Please take a look at the latest blog post below. Section “Reuse the TensorRT Engine” describes deserialization process. In short, you need to read the binary file into the memory (e.g. character array), create IRuntime object and use it’s method to recreate engine from serialized part.

I hope this helps. Here is the link to blog post:

Best regards,
Piotr Wojciechowski

Deenz · November 29, 2018, 8:36pm

Hi Piotr,

Thanks for the blog.

In the code example within “Reuse the TensorRT Engine”, there is the following line:

string buffer = readBuffer(enginePath);

I assume the “readBuffer” function is just taking the contents of the engine file and placing it withing the string buffer. Could you please elaborate a little on your preferred method of doing this?

piotrw · November 29, 2018, 9:46pm

Hi Deenz.

You are correct. The purpose of readBuffer function is to read binary file and place it’s content into a collection of characters. You can find it’s source code at the following GitHub location below. For a reference, I also copy-paste function definition.

Best regards,
Piotr Wojciechowski

source: https://github.com/parallel-forall/code-samples/blob/master/posts/TensorRT-introduction/ioHelper.cpp

// Returns empty string iff can't read the file
string readBuffer(string const& path)
{
    string buffer;
    ifstream stream(path.c_str(), ios::binary);

    if (stream)
    {
        stream >> noskipws;
        copy(istream_iterator<char>(stream), istream_iterator<char>(), back_inserter(buffer));
    }

    return buffer;
}

Topic		Replies	Views
TensorRt5: How to save the engine after it has built TensorRT	7	7618	March 23, 2020
can we write IHostMemory into a file, and read the file to deserializeCudaEngine? TensorRT	9	1811	October 12, 2021
Saving a serialized model to disk TensorRT	5	2551	October 12, 2021
Issue while storing/reading model to/from file Jetson TX2	0	360	February 19, 2018
Problem on exporting Tensor RT engine to file and reimport it. TensorRT	5	1976	October 12, 2021
Saving and loading serialized engine in Windows 10 TensorRT tensorrt	3	1129	June 8, 2020
Load TensorRT engine and deserialize in C++ TensorRT	12	5424	February 27, 2025
How to serialize TensorRT model into file and deserialize from file? (Solved) TensorRT	3	1333	October 12, 2021
Getting different results from Serialized TensorRT Engine vs. Onnx TensorRT	3	829	July 12, 2021
Run TF-TRT graph through TF C++ API TensorRT	16	3747	July 21, 2022

How to serialize TensorRT model into file and deserialize from file?

Related topics