Hello,
I am trying to convert a trained TensorFlow model to run on my TX2 using TensorRT, but I’m a bit lost on the steps involved. I am registered for the March 8th webinar on this topic, but I was hoping I could get some clarification beforehand (and maybe I can even get something working in the meantime). Note that I have my TX2 set up with JetPack 3.2, so it has TensorRT 3.0 on it.
Here is my understanding of the steps involved:
- Train the network in TensorFlow and then freeze the network into a .pb file. This can be done on any machine.
- Convert the .pb file to a UFF file using the convert-to-uff.py script. My understanding is that the resulting UFF file is still platform independent, so this step can be done on any machine.
- From the UFF file, build a TensorRT engine. Based on my reading of the documentation, this step must be done on the TX2 itself, since the resulting serialized engine (I want to save it to a file so I can just use said file over and over again for inference) is platform specific. Also, since this step must be done on the TX2 and because the new Python API isn’t supported on ARM, this mean writing some C++ code to handle this step.
- Do inference using the engine created in step #3. Once again, this means some C++ running on the TX2, although this should be as easy as just loading the engine and then calling the appropriate inference function.
Am I on the right track? Is there anything I’m missing? Thanks in advance for the help.