Please provide the following info (tick the boxes after creating this topic):
Software Version
DRIVE OS 6.0.6
Target Operating System
Linux
QNX
Hardware Platform
DRIVE AGX Orin Developer Kit (940-63710-0010-100)
SDK Manager Version
2.1.0
Host Machine Version
native Ubuntu Linux 20.04 Host installed with SDK Manager
Issue Description
1. LLM Deployment Approach
I would like to understand the recommended method for running LLMs on Drive Orin:
- Option A: Using the NIM (Nvidia Inference Microservices) framework
- Option B: Taking a generic approach to run LLMs directly
2. Host System GPU Requirements
- Is a GPU required on the host system for development and deployment?
- Can we run the code directly on the SoC without a host GPU?
- What is the recommended development workflow (cross-compilation vs native)?
3. AI Agent Framework Selection
- Which framework is best suited for AI Agent development and deployment on Drive Orin SoC?
- Are there Nvidia-recommended frameworks optimized for Drive Orin’s architecture?
- What frameworks provide the best balance of performance, ease of deployment, and resource efficiency?