I’ve tried using mlc-ai/Llama-3.1-8B-Instruct-fp8-MLC on Jetson AGX orin but it fails at the compilation step. I’ve raised an issue in MLC community but I’m posting it here as well hoping if I could get an insight from Jetson side. [Bug] Failed to compile mlc-ai/L…

Failed to MLC-compile mlc-ai/Llama-3.1-8B-Instruct-fp8-MLC on Jetson AGX orin

Robotics & Edge Computing Jetson & Embedded Systems Jetson AGX Orin

AastaLLL January 13, 2025, 6:32am 6

Hi,

It looks like the model can work with q4 but fails with 8-bit quantization.
So the failure might be caused by the model requiring more memory resources than the Jetson device has.

You can verify this by monitoring the system with tegrastats:

$ sudo tegrastats

Thanks.

Topic		Replies	Views
Running Llama3.1 on JP5.1 Jetson AGX Orin generative_ai , llama	6	143	January 10, 2025
Llamacpp compile failed on Jetson Orin Nano (8GB) Jetson Orin Nano generative_ai , llama	5	343	January 13, 2025
Running llama3.3 or llama4 on Jetson AGX Orin Developer Kit (64 GB) Jetson AGX Orin generative_ai	7	75	May 12, 2025
MiniCPM-Llama3-V-2_5 live on Jetson Orin Jetson AGX Orin generative_ai	11	612	August 9, 2024
Live Llava on Orin Jetson Projects generative_ai	20	2151	March 13, 2025
MiniGPT-4 on Jetson Orin Nano 8Gb Dev kit not working Jetson Orin Nano generative_ai	9	354	May 28, 2024
Ollama and Jetson issue Jetson Orin NX jetson-inference , generative_ai	12	5323	March 20, 2024
TensorRT-LLM on Jetson Orin NX(16GB) Jetson Orin NX tensorrt , jetson-inference , generative_ai	9	370	February 12, 2025
Trouble running Llamaspeak on AGX Orin 64GB Jetson AGX Orin demos-and-tutorials , generative_ai	8	489	May 25, 2024
Running NanoLLM Docker on Jetson Orin Nano FileNotFoundError Jetson Orin Nano generative_ai , llama	5	95	April 9, 2025

Failed to MLC-compile mlc-ai/Llama-3.1-8B-Instruct-fp8-MLC on Jetson AGX orin

Related topics