Run TF-TRT graph through TF C++ API

pkolomiets · April 18, 2019, 9:32am

Hello,
I want to use TF-TRT Python API to optimize the graph,
and then use TF C++ API for deployment on NVIDIA Xavier.
Is TF C++ API capable of running TF-TRT optimized graph?

Also, what is the preferred way of deploying TRT optimized model on Jetson?
According to item 8.1 of Developer Guide :: NVIDIA Deep Learning TensorRT Documentation
“Note: The UFF Parser which is used to parse a network in UFF format will be deprecated in a future release. The recommended method of importing TensorFlow models to TensorRT is using TensorFlow with TensorRT (TF-TRT).”
While item 7.2 says to use TRT C++ API with UFF as an intermediate format.
Thanks.

NVESJ · April 30, 2019, 10:05pm

Hello,

You can convert the model to TRT using TF-TRT and serialize it to a .plan file. Then deserialize the .plan file using the C++ API.
See:
[url]Accelerating Inference In TF-TRT User Guide :: NVIDIA Deep Learning Frameworks Documentation
and
[url]https://docs.nvidia.com/deeplearning/sdk/tensorrt-developer-guide/index.html#serial_model_c[/url]

Thanks.

michael4e2ca · May 1, 2019, 6:57am

Hi,
Thanks for the links but still did not understand either.
Until Now, I had TF and TRT on my workstation with Tesla v-100
The workflow was:

Build model and parse to UFF on workstation

uff.from_tensorflow()

Convert to engine and the serialize on one xavier TRT C++ API

ParseUFF...
  ICudaEngine *engine = builder->buildCudaEngine(*network);
  IHostMemory *serializedEngine = engine->serialize(); 
  planFile.write((char *)serializedEngine->data(), serializedEngine->size()); 
  planFile.close();

Deploy and deserialize on multiple Xavier TRT C++ API

IRuntime* runtime = createInferRuntime(gLogger);
ICudaEngine* engine = runtime->deserializeCudaEngine(modelData, modelSize, nullptr);

Now if I use the TF-TRT api on my workstation with

trt.create_inference_graph

Is the model serialized?
Can I deserialize on Xavier directly with TRT C++ API?
you write :
Note: Serialized engines are not portable across platforms or TensorRT versions. Engines are specific to the exact GPU model they were built on (in addition to platforms and the TensorRT version).
If not can you please indicate the steps?
a. how/where to build the engine?
b. how/where to serailize?

I’m a little bit confused here …

Thanks

pkolomiets · May 6, 2019, 7:32am

https://docs.nvidia.com/deeplearning/dgx/tf-trt-user-guide/index.html#tensorrt-plan
states that “This feature requires that your entire model converts to TensorRT”.
What about nets that don’t convert entirely, like SSD+Mobilenet?
Can I just load and run TF-TRT optimized graph through TF C++ API in a standard way (I know that I need to build TF C++ API from source on Xavier).

ChrisDing · September 29, 2019, 8:26am

Can I just load and run TF-TRT optimized graph through TF C++ API in a standard way

Hi NVESJJ
Could you answer this question? Is there C++ sample code to run TF-TRT ?

baris.demiray · November 8, 2019, 9:26am

Hello NVIDIA,

Would that be possible to have samples for C++ like requested above?

Also, when we do following builds of TensorFlow,

bazel build --config=tensorrt //tensorflow:libtensorflow.so
bazel build --config=tensorrt //tensorflow/tools/lib_package:libtensorflow

Are the generated C/C++ libraries have TensorRT support as well or it is only the pip package?

Thanks in advance!

Pooya-Davoodi · November 19, 2019, 1:21am

We are working on a C++ sample for TF-TRT.
Please stay tuned.

It will be published soon at GitHub - tensorflow/tensorrt: TensorFlow/TensorRT integration

baris.demiray · November 19, 2019, 8:53am

Thanks for the update!

yiwenwan2008 · January 30, 2020, 9:15pm

Hi,

want to ask the same question " Can I just load and run TF-TRT optimized graph through TF C++ API in a standard way ?" and any update on C++ sample for TF-TRT?

deblauwetom · March 19, 2020, 9:45am

Is this example already online?
Actually, I ask this because my model did not work with pure TensorRT 5 like on the jetson nano, and I wanted to try with tensorflow.

chad.palmer · May 17, 2020, 5:31pm

It has been 6 months since this post and there is still no C++ example in the TF-TRT repository. Is this ever going to happen?

bnascimento · October 29, 2020, 5:36pm

Yeah I am starting to look into TF-TRT too, I was able to convert/build an object detection (od api2) model, but I cant see any clear c++ sample for infering it.

nha.tuan84 · October 1, 2021, 10:35am

@NVESJ @ChrisDing
Is there C++ sample code to run TF-TRT ?

Thanks.

NVESJ · October 1, 2021, 3:47pm

We are still working on a C++ sample. Thanks.

NVES · October 2, 2021, 7:58am

Hi,
We recommend you to check the below samples links in case of tf-trt integration issues.

If issue persist, We recommend you to reach out to Tensorflow forum.
Thanks!

antonio.juric3zauh · January 25, 2022, 12:35pm

But those links are still not C++ examples of inference for TF-TRT? Or I didn’t see it well?

sr.you · July 21, 2022, 9:56am

Hello, is there any new solution or example to “run the TF-TRT optimized graph with TF C++ runtime or TensorRT runtime”?
Thanks very much!

Topic		Replies	Views
Serialize the Tensors Engine in my python code to do inference using TensorFlow Jetson Nano tensorrt , tensorflow , jetson-inference , python , artificialintelligence	3	1170	October 15, 2021
Deserializing TensorRT plan with the C++ API TensorRT	3	1612	November 26, 2019
Getting started witth Tensorflow to TRT conversion Jetson Xavier NX	4	1068	October 18, 2021
Lower performance with TRT than plain TF? Jetson Xavier NX tensorrt , jetson-inference	14	1986	October 18, 2021
TensorRT across different platforms using TF-TRT TensorRT	2	743	May 1, 2019
Loading of the tensorRT Engine in C++ API Jetson TX1	24	19240	October 18, 2021
Examples for porting from Tensorflow to TensorRT4 object detection inference TensorRT	4	2466	April 26, 2018
Trying to run TensorFlow 1.15 produced graphdefs with TF2 based tensorRT but TensorRT model is not building correctly TensorRT tensorrt , tensorflow , python , inference-server-triton , machine-learning	4	963	May 13, 2021
TF-TRT vs TensorRT Jetson Nano	2	3541	October 14, 2021
TensorRT 3: Faster TensorFlow Inference and Volta Support Technical Blog	16	468	December 8, 2020

Run TF-TRT graph through TF C++ API

Related topics