For detectnet_v2 inference without deepstream, officially, you can leverage GitHub - NVIDIA-AI-IOT/tao-toolkit-triton-apps: Sample app code for deploying TAO Toolkit trained models to Triton
Or you can also leverage forum topic by other user: Run PeopleNet with tensorrt - #21 by carlos.alvarez