Ollama with nemotron-mini

mateusz.leszczynski · February 4, 2025, 11:15am

Hi all!
I would like to use Ollama along with nemotron-mini.
I decided to work the nemotron-mini:4b-instruct-q4_K_M model that is pulled from Ollama. I can see that sometimes it has issues with tool handling, especially when multiple tools are defined. It wasn’t the case with other models, that I have tried. It seems that the output JSON misses an arguments part and this is the reason Ollama doesn’t handle the tool calling properly.
I’m also using ollama-python using following example as a base solution: Ollama Python library 0.4 with function calling improvements · Ollama Blog. Due to memory constraints (8GB Orin Nano), I can’t have larger model.

Could you please guide me with the required/recommended steps to proceed?
Thank you!

DavidDDD · February 5, 2025, 6:55am

Hi,

Do you follow the doc to optimize your memory?

Thanks

mateusz.leszczynski · February 5, 2025, 8:53am

Thank you for your response.
I already have GUI disabled and swap (16GB enabled).

What, I thought about would be rather a customized prompt or template. With the prompt being not adjusted, Ollama doesn’t handle the tool calling properly. It often fails in getting tool name, it’s arguments. I thought you could point me to some docs or tips to follow when dealing with Ollama and Nemotron-mini when it comes to proper tool calling.

DavidDDD · February 7, 2025, 5:56am

Hi,

Could you follow the tutorial using jetson-containers to check whether the issue exist?

Thanks

mateusz.leszczynski · February 20, 2025, 8:50am

Thanks for your support!
Unfortunately on my custom OS, I will have no support for docker containers. Thanks

system · March 6, 2025, 8:50am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Ollama support for Jetson Nano Jetson Nano generative_ai	8	211	April 1, 2025
Problems with "Tutorial - text-generation-webui" Jetson Orin Nano generative_ai	6	330	February 24, 2025
Ollama and Jetson issue Jetson Orin NX jetson-inference , generative_ai	12	5440	March 20, 2024
Gemma3:4b not using the gpu while gemma3:1b does on orin Jetson Nano super Jetson Orin Nano generative_ai , llama	1	80	June 2, 2025
LLMs token/sec Jetson AGX Orin generative_ai	2	964	April 8, 2024
Ollama on Docker does not finmd GPU Jetson Orin Nano generative_ai	4	844	March 5, 2025
Jetson Orin Nano Super: Error Running Gemma 3 4B Model Jetson Orin Nano generative_ai	8	414	April 2, 2025
LLM on Jetson Nano 4GB B01 Jetson Nano conversational-ai , generative_ai	13	2793	August 12, 2024
Jetson orin nano insanely slow inference speed? Jetson Orin Nano generative_ai	3	1099	May 6, 2024
Local_llm vs NanoLLM: Help Getting NanoLLM up & running Jetson Orin Nano generative_ai	7	1053	April 17, 2024

Ollama with nemotron-mini

Related topics