How to run local llm with cuda 10.2 support

aniolekx · May 19, 2024, 8:29am

Hi, I recently bought a Jetson Nano Development Kit and tried running local models for text generation on it. For example, Ollama works, but without CUDA support, it’s slower than on a Raspberry Pi! The Jetson Nano costs more than a typical Raspberry Pi, but without CUDA support, it feels like a total waste of money.

Is there a way to run these models with CUDA 10.2 support?

AastaLLL · May 20, 2024, 7:18am

Hi,

Unfortunately, we don’t have experience with llm on Jetson Nano with CUDA 10.2.
However, we have a generative AI tutorial for the Orin series (including Orin Nano).

If this is an option for you, please give it a try.

Thanks.

aniolekx · May 20, 2024, 8:58am

All these solutions for Orin do not support Cuda 10.2, so it is slower than on Raspberry Pi 5

AastaLLL · May 22, 2024, 8:30am

Hi,

If you want to get CUDA support for Ollama, please try our new device.
Thanks.

dusty_nv · May 22, 2024, 1:38pm

@aniolekx if you follow this thread, Jetson support appears to be in ollama dating back to Nano / CUDA 10.2:

You may need to compile it from source. If you face issue, please file issues against the upstream ollama repo who is maintaining the project.

system · June 19, 2024, 5:11am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
No Existing base Libraries for Python/Cuda? Jetson Orin Nano python , generative_ai	2	123	June 12, 2024
Jetson orin nano local small models perform insanely slow Jetson Orin Nano generative_ai	2	235	June 6, 2024
LLM on Jetson Nano 4GB B01 Jetson Nano conversational-ai , generative_ai	13	680	August 12, 2024
Ollama and Jetson issue Jetson Orin NX jetson-inference , generative_ai	12	4529	March 20, 2024
Jetson orin nano insanely slow inference speed? Jetson Orin Nano generative_ai	3	458	May 6, 2024
LLM run on Jetson Orin Nano Jetson Orin Nano	3	7373	October 9, 2023
Orin Nano vs. LLM doable or not Jetson Orin Nano generative_ai	2	33	November 4, 2024
Help Needed to Update Ollama Container for Newer Model Support (JetPack 6.0 DP) Jetson Orin Nano cuda , jetson-inference , llama	7	213	November 5, 2024
Introducing Ollama Support for Jetson Devices Jetson Projects cuda , natural-language-processing-nlp , artificialintelligence , interactive , docker-machine-learning , generative_ai	29	7700	August 28, 2024
Want to run a Local LLM on Nvidia Jetson AGX Orin Jetson AGX Orin generative_ai	3	816	July 17, 2024

How to run local llm with cuda 10.2 support

Related topics