Will there be any advantage in inference speed if I use python to execute the inference when the .plan was created with C++?

Aizzaac · September 21, 2020, 8:34pm

Hi

When creating a .plan file using C++, either:

Will there be any advantage in inference speed if I use python to execute the inference?

This is to summarize:

Thank you

AastaLLL · September 22, 2020, 3:07am

Hi,

This depends on your use case.

For inference, both C++ and python interface links to the same TensorRT library, which is implemented with CUDA.
So the performance is similar.

Some user prefer python for its rich preprocessing modules.
However, if you are going to write some CUDA code, C++ will be a better choice.

Thanks.

Aizzaac · September 22, 2020, 1:46pm

Thank you for the quick answer. I have some other doubts:

Is the Python API a wrapper of the C++ API?
In this GIT: NVIDIA-AI-IOT/tf_trt_image_classification I have noticed that everything has been done using python except for the generation of the .plan file (UFF → plan); which has been done with C++.

So, why not just do everything with python? What is the advantage of creating the plan file in C++ (performance, or just because of the CUDA code ?
In the NVIDIA TensorRT documentation says the following:

Does that mean that using C++ will be faster when doing inference?

AastaLLL · September 23, 2020, 6:32am

Hi,

1. YES.

2. TensorRT python support is added after the sample release.

3. Python might be slightly slower for the wrapper overhead.

Thanks.

Topic		Replies	Views
TensorRT: Python vs C++ TensorRT	1	1607	October 10, 2018
TensorFlow to C++ Jetson AGX Xavier tensorflow	6	2418	October 18, 2021
Inference Yolov5 with TensorRT and C++ Jetson Nano tensorrt , jetson-inference , yolo	2	1995	March 30, 2023
Using tensorRT to accelerate caffe model， but it take more time to inference Jetson TX2	6	511	October 18, 2021
Ressource to get started converting TensorRT python inference to C++ Inference TensorRT	0	327	November 4, 2020
Python API vs C++ API Jetson Orin NX python	3	277	May 31, 2024
TensorRT7 infer time is longer with dynamic shape in C++ compared with python TensorRT	4	570	April 7, 2020
Difference between running the inference with trtexec and tensorrt python API Jetson AGX Xavier tensorrt , python	4	3079	October 18, 2021
Terrible scaling behavior of TensorRT using C++ API TensorRT tensorrt , cudnn	5	50	March 6, 2025
Aboat tensorrt python api on TX2 Jetson TX2	3	1572	October 18, 2021