Hello I am working on jetson agx orin
For some decition making i want the local llm running in the jetson device, i want near real time performance , also there are other vision models already i am utilizing
What i wanted to know is what is the best inbuilt solution there to run llm effectively in jetson
I hered some options as
- Nano llm (Welcome to NanoLLM! — NanoLLM 24.7 documentation)
- vLLMs ( Introduction to GenAI on Jetson: How to Run LLMs and VLMs | Jetson AI Lab )
please advice which is the latest most effective way to run
