Thank you for the quick answer. I have some other doubts:
Is the Python API a wrapper of the C++ API?
In this GIT: NVIDIA-AI-IOT/tf_trt_image_classification I have noticed that everything has been done using python except for the generation of the .plan file (UFF → plan); which has been done with C++.
So, why not just do everything with python? What is the advantage of creating the plan file in C++ (performance, or just because of the CUDA code ?