Gemma 3 and Gemma 3n on Jetson Orin Nano Super

vtong · June 28, 2025, 9:57pm

I’ve been running Gemma 3 4B on Jetson for quite a while. And it is reliable.

Today, I gave Gemma 3n e2b a quick spin using Ollama, and the results were surprisingly good.

It handled memory efficiently, delivered solid tokens-per-second performance, and the response quality was impressive.

If you’re working with Jetson, I definitely recommend giving Gemma 3n e2b a try. It feels like it was made for the Jetson Nano.

kevinpatel4400 · August 15, 2025, 4:04pm

how much? compared to Gemma 3 4b?

vtong · August 19, 2025, 2:59pm

Gemma 3n e2b
total duration: 48.044328503s
load duration: 264.648734ms
prompt eval count: 31 token(s)
prompt eval duration: 228.477798ms
prompt eval rate: 135.68 tokens/s
eval count: 787 token(s)
eval duration: 47.549470783s
eval rate: 16.55 tokens/s

Gemma 3 4B
total duration: 1m9.660771616s
load duration: 241.261546ms
prompt eval count: 96 token(s)
prompt eval duration: 203.591012ms
prompt eval rate: 471.53 tokens/s
eval count: 699 token(s)
eval duration: 1m9.132128939s
eval rate: 10.11 tokens/s

vatsalved52862 · August 21, 2025, 8:56pm

I tried running gemma3n e2b and e4b both using ollama on jetson. since we cannot directly since ollama is not ARM compatible. So I ran it using the container images that are ARM compatible:

git clone GitHub - dusty-nv/jetson-containers: Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
bash jetson-containers/install.sh *
j*etson-containers run $(autotag ollama)

# in another terminal, run the ollama client
user@hostname:~$ jetson-containers run $(autotag ollama) ollama run gemma3n:e4b

Namespace(packages=[‘ollama’], prefer=[‘local’, ‘registry’, ‘build’], disable=[‘’], user=‘dustynv’, output=‘/tmp/autotag’, quiet=False, verbose=False)

-- L4T_VERSION=36.4.4 JETPACK_VERSION=6.2.1 CUDA_VERSION=12.6

-- Finding compatible container image for [‘ollama’]

dustynv/ollama:0.6.8-r36.4-cu126-22.04

V4L2_DEVICES: --device /dev/video0 --device /dev/video1

### ARM64 architecture detected

### Jetson Detected

SYSTEM_ARCH=tegra-aarch64

+ docker run --runtime nvidia --env NVIDIA_DRIVER_CAPABILITIES=compute,utility,graphics -it --rm --network host --shm-size=8g --volume /tmp/argus_socket:/tmp/argus_socket --volume /etc/enctune.conf:/etc/enctune.conf --volume /etc/nv_tegra_release:/etc/nv_tegra_release --volume /tmp/nv_jetson_model:/tmp/nv_jetson_model --volume /var/run/dbus:/var/run/dbus --volume /var/run/avahi-daemon/socket:/var/run/avahi-daemon/socket --volume /var/run/docker.sock:/var/run/docker.sock --volume /home/students/jetson-containers/data:/data -v /etc/localtime:/etc/localtime:ro -v /etc/timezone:/etc/timezone:ro --device /dev/snd -e PULSE_SERVER=unix:/run/user/1000/pulse/native -v /run/user/1000/pulse:/run/user/1000/pulse --device /dev/bus/usb --device /dev/video0 --device /dev/video1 --device /dev/i2c-0 --device /dev/i2c-1 --device /dev/i2c-2 --device /dev/i2c-4 --device /dev/i2c-5 --device /dev/i2c-7 --device /dev/i2c-9 -v /run/jtop.sock:/run/jtop.sock --name jetson_container_20250821_222743 dustynv/ollama:0.6.8-r36.4-cu126-22.04 ollama run gemma3n:e4b

**Error: llama runner process has terminated: signal: killed

user@hostname**:~$ jetson-containers run $(autotag ollama) ollama run gemma3n:e2b

Namespace(packages=[‘ollama’], prefer=[‘local’, ‘registry’, ‘build’], disable=[‘’], user=‘dustynv’, output=‘/tmp/autotag’, quiet=False, verbose=False)

-- L4T_VERSION=36.4.4 JETPACK_VERSION=6.2.1 CUDA_VERSION=12.6

-- Finding compatible container image for [‘ollama’]

dustynv/ollama:0.6.8-r36.4-cu126-22.04

V4L2_DEVICES: --device /dev/video0 --device /dev/video1

### ARM64 architecture detected

### Jetson Detected

SYSTEM_ARCH=tegra-aarch64

+ docker run --runtime nvidia --env NVIDIA_DRIVER_CAPABILITIES=compute,utility,graphics -it --rm --network host --shm-size=8g --volume /tmp/argus_socket:/tmp/argus_socket --volume /etc/enctune.conf:/etc/enctune.conf --volume /etc/nv_tegra_release:/etc/nv_tegra_release --volume /tmp/nv_jetson_model:/tmp/nv_jetson_model --volume /var/run/dbus:/var/run/dbus --volume /var/run/avahi-daemon/socket:/var/run/avahi-daemon/socket --volume /var/run/docker.sock:/var/run/docker.sock --volume /home/students/jetson-containers/data:/data -v /etc/localtime:/etc/localtime:ro -v /etc/timezone:/etc/timezone:ro --device /dev/snd -e PULSE_SERVER=unix:/run/user/1000/pulse/native -v /run/user/1000/pulse:/run/user/1000/pulse --device /dev/bus/usb --device /dev/video0 --device /dev/video1 --device /dev/i2c-0 --device /dev/i2c-1 --device /dev/i2c-2 --device /dev/i2c-4 --device /dev/i2c-5 --device /dev/i2c-7 --device /dev/i2c-9 -v /run/jtop.sock:/run/jtop.sock --name jetson_container_20250821_225026 dustynv/ollama:0.6.8-r36.4-cu126-22.04 ollama run gemma3n:e2b

>>> hyy! how are you

Error: Post “http://0.0.0.0:11434/api/chat”: EOF
**
then I checked the logs:
user@hostname:~**$ docker logs jetson_container_20250821_215028

Starting ollama server

OLLAMA_HOST 0.0.0.0

OLLAMA_LOGS /data/logs/ollama.log

OLLAMA_MODELS /data/models/ollama/models

ollama server is now started, and you can run commands here like ‘ollama run gemma3’

root@ubuntu:/# jetson-containers run $(autotag ollama) ollama run gemma3n:e2b

bash: autotag: command not found

bash: jetson-containers: command not found

root@ubuntu:/# ollama run gemma3n:e2b

>>> hyy! how are you?

⠧

Error: Post “http://0.0.0.0:11434/api/chat”: EOF

root@ubuntu:/#

vtong · August 21, 2025, 9:03pm

I am not using docker. I just run following command on Jetson.
curl -fsSL https://ollama.com/install.sh | sh
Then, it works well on Jetson. But I run it by root account. Then, set the LISTEN to 0.0.0.0. Finish!

Topic		Replies	Views
Gemma3:4b not using the gpu while gemma3:1b does on orin Jetson Nano super Jetson Orin Nano generative_ai , llama	2	373	June 2, 2025
Inference time for Gemma3:4b on Jetson origin nano Jetson Nano demos-and-tutorials , generative_ai	8	160	August 13, 2025
Ollama and Jetson issue Jetson Orin NX jetson-inference , generative_ai	12	5698	March 20, 2024
LLaMa 2 LLMs w/ NVIDIA Jetson and textgeneration-web-ui Jetson Projects generative_ai	86	24834	May 10, 2024
Jetson Orin Nano Super: Error Running Gemma 3 4B Model Jetson Orin Nano generative_ai	8	619	April 2, 2025
Introducing Ollama Support for Jetson Devices Jetson Projects cuda , natural-language-processing-nlp , artificialintelligence , interactive , docker-machine-learning , generative_ai	29	12514	August 28, 2024
MiniGPT-4 on Jetson Orin Nano 8Gb Dev kit not working Jetson Orin Nano generative_ai	9	462	May 28, 2024
LLMs token/sec Jetson AGX Orin generative_ai	2	1092	April 8, 2024
Jetson orin nano insanely slow inference speed? Jetson Orin Nano generative_ai	3	1243	May 6, 2024
LLM on Jetson Nano 4GB B01 Jetson Nano conversational-ai , generative_ai	13	3427	August 12, 2024

Gemma 3 and Gemma 3n on Jetson Orin Nano Super

Related topics