Is there a plan to support MiG on Orin AGX？

neuwfy · March 14, 2025, 3:24am

We are currently deploying multiple AI models on the AGX Orin using TensorRT, but we have observed a notable decline in inference performance after launching multiple servers. We believe that MIG (Multi-Instance GPU) could potentially be an excellent solution to address this issue. I was wondering if there are any plans to support MIG on the AGX Orin in the future? Alternatively, if there are any other recommendations or best practices for efficiently running multiple AI models on a single AGX Orin, we would be very grateful for your guidance. Thank you!

AastaLLL · March 14, 2025, 4:18am

Hi,

MiG needs hardware support which Orin doesn’t have.

To solve the issue, you can try to deploy engines on different CUDA streams.
Tasks on the different streams share the GPU resource in a time-slicing manner.

Thanks.

neuwfy · March 14, 2025, 6:00am

Thank you for the information regarding the current lack of MIG support on the AGX Orin. Regarding the approach of assigning engines to different CUDA streams, we have some questions. If different tasks share the GPU resources in a time-slicing manner, it should lead to a decline in performance. Are there any strategies or solutions to allocate and schedule computational resources (e.g., SMs) in a way that minimizes the performance drop for different tasks?

AastaLLL · March 17, 2025, 6:13am

Hi,

If the tasks are all in the same process, you can check if green context can meet your requirements.

Thanks.

system · April 9, 2025, 2:58am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Inquiry about support for multi-tenancy Jetson AGX Orin	2	341	December 5, 2023
Does AGX Orin support the Multi-Instance GPU(MIG)？ Jetson AGX Orin	4	1062	May 11, 2022
MiG support on Orin AGX Jetson AGX Orin	2	463	March 7, 2023
Multi Instance GPU C++ Sample Reference DRIVE AGX Orin General driveos-dl	6	1106	March 17, 2023
Jetson AGX Orin Usage Jetson AGX Orin jetson-inference , ai-training	5	101	January 20, 2025
CUDA IPC Memory Sharing Support on Jetson AGX Orin 64GB with JetPack 6.0 Jetson AGX Orin tensorrt , cuda	4	36	June 2, 2025
Announcing Jetson AGX Orin: Next-level AI performance for next-gen robotics Jetson AGX Orin	13	3287	August 10, 2022
MPS for Jetson AGX Orin Jetson AGX Orin cuda	4	56	April 9, 2025
Getting the Most Out of the NVIDIA A100 GPU with Multi-Instance GPU Technical Blog	11	1491	January 19, 2023
Is it possible to run multiple LLM instances in parallel using multithreading to handle multiple queries simultaneously on Jetson Orin AGX? Jetson AGX Orin generative_ai	3	41	May 28, 2025

Is there a plan to support MiG on Orin AGX？

Related topics