Multi-model inference paralle on jetson agx orin

phamthanhdat270198 · July 16, 2024, 3:02am

Description

Parallel inference multi-model with the same input image. I have 2 models to inference in the same time at each frame input. I tried multi-thread, multi-process but I saw when inference on GPU, all model run sequence(not parallel). I also use DLA and GPU to saparate infer but DLA not support all layer → inference time so bad. Please tell me the best way to Parallel inference multi-model with the same input image. Many thanks!!!

Environment

TensorRT Version:
GPU Type:
Nvidia Driver Version: AGX Orin 64GB devkit
CUDA Version: 11.4
Operating System + Version: ubuntu20
Python Version (if applicable): python3.10

ashiyana · March 18, 2025, 7:09am

I would also like to see how this can be done.

linc1 · March 24, 2025, 5:01pm

TensorRT is planning to add multi GPU, parallel inference in the 2025 Q2 product roadmap. we will first focus on the datacenter GPU first and edge platform like Jetson will come later.

Topic		Replies	Views
Separate GPU for Parallel on Jeton AgxOrin Jetson AGX Xavier gpu	19	58	August 28, 2024
[TensorRT] Speed of concurrent execute multiple TensorRT model on one GPU TensorRT tensorrt	1	1773	May 24, 2020
Multithread inference Jetson Xavier NX tensorrt	4	839	August 29, 2021
Running Real-Time Instance Segmentation with Local GPUs TensorRT tensorrt , camera , ros , python , cudnn	2	66	February 18, 2025
Use multiple CUDA streams with multiple TensorRT models Jetson AGX Orin tensorrt , cuda	3	405	December 26, 2023
Tensorflow running very slow on Nvidia Jetson AGX Orin Jetson AGX Orin tensorflow	3	66	March 4, 2025
Tensorrt Threads affect each other during multithreaded inference TensorRT tensorrt	16	1459	September 6, 2024
Running Inference on AGX GPU Jetson AGX Orin tensorrt	7	975	July 4, 2024
Is there a plan to support MiG on Orin AGX？ Jetson AGX Orin tensorrt	4	104	March 17, 2025
TensorRT examples TensorRT tensorrt , cuda , tensorflow , cudnn , tensorrt-model-optimizer	1	43	February 28, 2025

Multi-model inference paralle on jetson agx orin

Description

Environment

Related topics