AGX Orin 64GB computational limits?

startrack79 · December 22, 2022, 5:33pm

Hi everyone. After some reflections I think I’ll buy an AGX Orin dev kit 64gb (to have something that will last for 3-4 years).
Now the main concern: what are its current limits? I mean, is there some kind of model (by NVIDIA NGC or HuggingFace) that is simply too big/computationally intensive for the AGX?
I’d like to use Bloom, StableDIffusion, SantaCoder, and of course the NVIDIA AI stack at the highest capacity allowed by AGX. What will not run with it?
Thanks, and sorry if my question sounds naive.

kayccc · December 25, 2022, 11:39pm

I’m not able to comment it’s no limitaton, but it’s the most powerful AI edge device.

linuxdev · December 28, 2022, 10:45pm

I’ll suggest that the biggest limitation isn’t in its computing power. You do need to be careful though to understand it is an integrated GPU (iGPU), and the APIs it can use are tied to its particular L4T (Ubuntu + NVIDIA drivers) release. Whatever you need, be certain that the flashed release supports the CUDA or other API releases it needs to work with.

Also, despite being an extraordinary GPU device at that size, it still is not the same as a high end discrete GPU (dGPU) you might find on a desktop. The iGPU uses the same memory as the CPU on an Orin, whereas a dGPU has its own memory. Memory consumption limits differ between dGPU and iGPU.

startrack79 · January 5, 2023, 3:37pm

Thank you for your reply.
As per the use case, I plan to use it essentially in 2 different scenarios:

training new models from scratch (mainly some BERT-related ones)
learning and experimenting with some cool models I can find online (e.g. from huggingface.com). Since some of these activities overlap with my job I cannot use for example Colab, I need the gear to be on my desk.

Thanks!

linuxdev · January 5, 2023, 4:26pm

For training a desktop GPU is recommended, although it is capable of training if there is sufficient memory. Even a 1080 Ti has 3584 CUDA cores, and although the Orin has overtaken a GTX 1060 (I think 2048 cores for an Orin), you’ll very quickly get ahead training on most desktops with a 1080 Ti or better. The part which makes many of the higher end training systems more expensive though isn’t necessarily the number of CUDA cores…you’ll see that the Titan series (and more specialized GPUs) tend to have a lot more VRAM. That VRAM is quite fast compared to a Jetson’s RAM, and the Jetson only has available whatever is left over after the operating system uses RAM. Still, if you were to use an Orin for training, it could probably do the job (something like a TX2 I would not recommend for training).

system · January 19, 2023, 4:26pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Seeking Advice on Choosing an AI Processing Unit for Versatile AI Applications Jetson AGX Orin generative_ai	3	200	August 14, 2024
Announcing Jetson AGX Orin: Next-level AI performance for next-gen robotics Jetson AGX Orin	13	3262	August 10, 2022
Jetson Orin modules and developer kit announcements Jetson AGX Orin	10	5351	August 6, 2024
Graphics memory related issues Jetson AGX Orin ai-training	3	734	January 18, 2024
Jetson Orin modules and developer kit announcements Jetson Orin NX	11	3780	April 11, 2023
Jetson Orin modules and developer kit announcements Announcements	2	1728	April 11, 2023
Jetson orin 64gb or rtx 3090 24gb Jetson AGX Orin	8	4144	January 3, 2024
Which AGX Orin I have? Jetson AGX Orin	11	621	December 29, 2023
Introducing the Jetson AGX Orin Series, and Jetson AGX Orin Developer Kit Availability Jetson AGX Xavier jetson	3	2860	February 7, 2023
Nividia Jetson AGX Orin commercial model deployment Jetson AGX Orin	2	183	May 7, 2024

AGX Orin 64GB computational limits?

Related topics