NIM does not support llama-3.1-8b-instruct and llama-3.1-70b-instruct on GH200 On-Prem deployment

sjbbihan · November 7, 2024, 10:42am

NIM does not support llama-3.1-8b-instruct on GH200.
Below is the command I tried

docker run -it --rm \
    --gpus all \
    --shm-size=16GB \
    -e NGC_API_KEY \
    -v "$LOCAL_NIM_CACHE:/opt/nim/.cache" \
    -u $(id -u) \
    -p 8000:8000 \
    nvcr.io/nim/meta/llama-3.1-8b-instruct:latest

Error

The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested
exec /opt/nvidia/nvidia_entrypoint.sh: exec format error

Also does not work with Llama3.1-70B

Status: Downloaded newer image for nvcr.io/nim/meta/llama-3.1-70b-instruct:latest
WARNING: The requested image's platform (linux/amd64) does not match the detected host platform (linux/arm64/v8) and no specific platform was requested
exec /opt/nvidia/nvidia_entrypoint.sh: exec format error

±----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
±----------------------------------------------------------------------------------------+

neal.vaidya · November 7, 2024, 7:51pm

Hi @sjbbihan, unfortunately NIM does not support deployment on ARM platforms at the moment, including Grace CPUs. We’re working to close this gap in future releases.

Topic		Replies	Views
NIM Llama3 8B Instruct - Running container with "CUDA_ERROR_NO_DEVICE" cuDNN docker , nim , llama3-8b-instruct	1	91	March 28, 2025
NIM with llama-3-8b models stuck without any error Models nim , llama3-8b-instruct , llama	0	195	November 15, 2024
How to fix 0 compatible profiles? Where to get compatible profiles? Models nim , llama-31-8b-instruct , llama	4	635	November 26, 2024
Aunch NVIDIA NIM (llama3-8b-instruct) for LLMs locally Access/Accounts nim , llama3-8b-instruct	3	178	November 8, 2024
0 Compatible Profiles for Llama 3.1 70B Models nim , llama-31-70b-instruct	6	685	October 28, 2024
Unable to use version of LLAMA 3.1 greater than 1.2.1 on DGX Cloud Slurm Cluster Models nim , llama-31-70b-instruct , llama	1	73	March 13, 2025
Unable to Run NIM on H100 GPU Due to Profile Compatibility Issue Despite Sufficient GPU Resources Models nim , llama-31-8b-instruct , llama	1	272	November 12, 2024
Issues while starting NIM container in A10 VM Models nim , llama3-8b-instruct	4	218	September 4, 2024
/opt/nim/start-server.sh: line 61: 32 Killed python3 -m vllm_nvext.entrypoints.openai.api_server Container: CUDA	0	311	July 9, 2024
CUDA fail start. Local NIM Containers run failed CUDA Setup and Installation nim , llama-31-405b-instruct , llama	2	275	September 20, 2024

NIM does not support llama-3.1-8b-instruct and llama-3.1-70b-instruct on GH200 On-Prem deployment

Related topics