GPU difference inference

Yes, it can. You can leverage GitHub - NVIDIA-AI-IOT/tao-toolkit-triton-apps: Sample app code for deploying TAO Toolkit trained models to Triton
or search/find some topics in forum. For example,