Error Running nanoLLM on Jetson Orin Nano with JetPack 6.1: Subprocess SIGKILL Issue

kouta21146 · January 4, 2025, 12:58pm

Hello,

I am trying to use nanoLLM on a Jetson Orin Nano 8GB (JetPack 6.1) with the following command:

jetson-containers run $(autotag nano_llm) \
  python3 -m nano_llm.chat --api=mlc \
    --model Efficient-Large-Model/VILA-2.7b \
    --max-context-len 256 \
    --max-new-tokens 32

However, I encounter the following error:

Traceback (most recent call last):
  File "/usr/lib/python3.10/runpy.py", line 196, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/usr/lib/python3.10/runpy.py", line 86, in _run_code
    exec(code, run_globals)
  File "/opt/NanoLLM/nano_llm/chat/__main__.py", line 32, in <module>
    model = NanoLLM.from_pretrained(
  File "/opt/NanoLLM/nano_llm/nano_llm.py", line 91, in from_pretrained
    model = MLCModel(model_path, **kwargs)
  File "/opt/NanoLLM/nano_llm/models/mlc.py", line 60, in __init__
    quant = MLCModel.quantize(self.model_path, self.config, method=quantization, max_context_len=max_context_len, **kwargs)
  File "/opt/NanoLLM/nano_llm/models/mlc.py", line 276, in quantize
    subprocess.run(cmd, executable='/bin/bash', shell=True, check=True)  
  File "/usr/lib/python3.10/subprocess.py", line 526, in run
    raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command 'python3 -m mlc_llm.build --model /data/models/mlc/dist/models/VILA-2.7b --quantization q4f16_ft --target cuda --use-cuda-graph --use-flash-attn-mqa --sep-embed --max-seq-len 256 --artifact-path /data/models/mlc/dist/VILA-2.7b/ctx256 --use-safetensors ' died with <Signals.SIGKILL: 9>.

According to the tutorial provided on Jetson AI Lab, the VILA-2.7b model should work on Jetson Orin Nano.

When looking at the Jetson Power GUI, I see that the memory usage has reached 8GB. Is this a problem?

I would greatly appreciate any insights or solutions to resolve this issue. Thank you in advance for your assistance!

kouta21146 · January 5, 2025, 2:43am

I realized that the issue was caused by running out of memory. I tried mounting SWAP and disabling ZRAM, which solved the problem. For more details, please refer to the following link:
Jetson Containers Setup - Mounting Swap
Thank you.

Topic		Replies	Views
NanoLLM Studio Error Jetson Orin Nano generative_ai , llama	2	127	February 17, 2025
Error on "Tutorial - Small Language Models (SLM)" Jetson Orin Nano containers , generative_ai	3	600	May 3, 2024
Can't start NanoVLM on Orin Nano 8GB Jetson Orin Nano jetson-inference , generative_ai	2	256	January 13, 2025
Memory exhausted when loading LLM and rebooted Jetson Nano Super Jetson Orin Nano generative_ai	3	305	January 24, 2025
Jetson orin nano fail to quanization NanoVLM model Jetson Orin Nano generative_ai	3	255	July 30, 2024
Error on following "NanoVLM - Efficient Multimodal Pipeline" Jetson Orin Nano generative_ai	2	306	May 24, 2024
Running NanoLLM Docker on Jetson Orin Nano FileNotFoundError Jetson Orin Nano generative_ai , llama	5	349	April 9, 2025
VILA 1.5 3B on Jetson Orin Nano Jetson Orin Nano jetson-inference , inception , generative_ai	4	1084	June 5, 2024
Can't start the live llava on jetson orin nano developer kit Jetson Orin Nano generative_ai	9	1029	June 4, 2024
VILA 1.5-3b Model Jetson Orin Nano generative_ai	4	350	June 26, 2025

Error Running nanoLLM on Jetson Orin Nano with JetPack 6.1: Subprocess SIGKILL Issue

Related topics