Will Deepseek-V3 run on AGX Orin?

sprime01 · January 22, 2025, 5:23pm

Can an NVIDIA Jetson AGX Orin 64GB developer kit run Deepseek-V3 given it’s a MoE model, that it only activates 37B of the total 671B parameters for each token during inference? Has anyone benchmarked this?

carolyuu · January 22, 2025, 5:30pm

Hi,
Here are some suggestions for the common issues:

1. Performance

Please run the below command before benchmarking deep learning use case:

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

2. Installation

Installation guide of deep learning frameworks on Jetson:

TensorFlow: Installing TensorFlow for Jetson Platform - NVIDIA Docs
PyTorch: Installing PyTorch for Jetson Platform - NVIDIA Docs
We also have containers that have frameworks preinstalled:
Data Science, Machine Learning, AI, HPC Containers | NVIDIA NGC

3. Tutorial

Startup deep learning tutorial:

Jetson-inference: Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson
TensorRT sample: Jetson/L4T/TRT Customized Example - eLinux.org

4. Report issue

If these suggestions don’t help and you want to report an issue to us, please attach the model, command/step, and the customized app (if any) with us to reproduce locally.

Thanks!

dusty_nv · February 5, 2025, 1:54am

Hi @sprime01, sparse MoE still requires all the model weights to be loaded in memory, so I’m going to go with ‘unlikely’, alas we have the Deepseek-R1-Llama-70B running here.

sprime01 · February 5, 2025, 9:58pm

Thanks for following up and the info. 70B it is then

rk5devmail · February 13, 2025, 6:01pm

@dusty_nv New Jetson user here. I have a Jetson AGX Orin and after flashing to the latest Jetpack I have a ~57G drive with 30GB left after the dustynv/mlc:r36.4.0 is downloaded. Then it starts pulling the safetensors looks like it’s rapidly going exceed the remaining drive space. Is that your experience as well? What is the recommended system requirements for running Deepseek-R1-Llama-70B?

m_o_bz · February 27, 2025, 3:18am

any update?

system · March 13, 2025, 3:19am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Jetson AGX Orin Usage Jetson AGX Orin jetson-inference , ai-training	5	94	January 20, 2025
FAQ: Can llama3.2 vision LM be deployed in Jetson Orin Nx 16g Jetson AGX Orin jetson-inference , generative_ai	5	186	November 27, 2024
Identify Orin module type Jetson AGX Orin	4	66	December 9, 2024
【AGX Orin盒子】sdkmanager烧录，选择Jetson AGX Orin 64GB和Jetson AGX Orin(64GB developer kit version)有什么区别 Jetson AGX Orin reflash	3	74	November 5, 2024
NVIDIA Orin Performance Jetson AGX Orin tensorrt	3	213	October 14, 2024
Jetson Jetson AGX Orin jetson-inference	4	20	November 21, 2024
Jetson selection Jetson AGX Xavier	5	381	April 26, 2023
Doubts about Jetson AGX Orin as PCIe EP Jetson AGX Orin pcie	7	56	November 18, 2024
Standard Shelf Life / Moisture Sensitivity Level Jetson AGX Orin documentation	5	33	November 19, 2024
Nividia Jetson AGX Orin commercial model deployment Jetson AGX Orin	2	198	May 7, 2024

Will Deepseek-V3 run on AGX Orin?

1. Performance

2. Installation

3. Tutorial

4. Report issue

Related topics