Profiles doesnt match machine even though specs are correct

mark355 · November 29, 2024, 7:55am

Hey,

I am trying to launch nim/meta/llama-3.1-405b-instruct on a machine with 8xA100 80GB with SXM4 and I get an error that the profile doesnt match the machine, even though it seems to meet like an exact match.

What an I missing?

"

== NVIDIA Inference Microservice LLM NIM ==

NVIDIA Inference Microservice LLM NIM Version 1.1.2
Model: nim/meta/llama-3.1-405b-instruct

The use of this model is governed by the NVIDIA AI Foundation Models Community License Agreement (found at NVIDIA Agreements | Enterprise Software | NVIDIA AI Foundation Models Community License Agreement.

ADDITIONAL INFORMATION: Llama 3.1 Community License Agreement, Built with Llama.

INFO 11-28 18:28:32.556 ngc_profile.py:222] Running NIM without LoRA. Only looking for compatible profiles that do not support LoRA.
INFO 11-28 18:28:32.556 ngc_profile.py:224] Detected 0 compatible profile(s).
ERROR 11-28 18:28:32.556 utils.py:21] Profile ‘tensorrt_llm-a100-fp16-tp8-latency’ is incompatible with detected hardware. Please check the system information below and select a compatible profile.
SYSTEM INFO

Free GPUs:
- [20b2:10de] (0) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
- [20b2:10de] (1) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
- [20b2:10de] (2) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
- [20b2:10de] (3) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
- [20b2:10de] (4) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
- [20b2:10de] (5) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
- [20b2:10de] (6) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
- [20b2:10de] (7) NVIDIA A100-SXM4-80GB (A100 80GB) [current utilization: 0%]
  "

Topic		Replies	Views
How to fix 0 compatible profiles? Where to get compatible profiles? Models nim , llama-31-8b-instruct , llama	4	557	November 26, 2024
Unable to Run NIM on H100 GPU Due to Profile Compatibility Issue Despite Sufficient GPU Resources Models nim , llama-31-8b-instruct , llama	1	233	November 12, 2024
0 Compatible Profiles for Llama 3.1 70B Models nim , llama-31-70b-instruct	6	581	October 28, 2024
How to fix 0 compatible profiles for L40S with mistral-7b-instruct-v03 NIM? Models gpu , nim , mistral-7b-instruct-v03	7	366	November 4, 2024
RTX 4090 shows as "non-free GPU" when running NIM model in docker NVIDIA Nemotron nim	8	2215	October 21, 2024
Model says there is a compatible profile but fails on data type Models nim , mistral-7b-instruct-v03	4	719	August 21, 2024
Nemollm-inference-microservice failed to deploy Models nim , llama3-8b-instruct , llama	1	185	October 22, 2024
NIM does not support llama-3.1-8b-instruct and llama-3.1-70b-instruct on GH200 On-Prem deployment Models nim , llama-31-8b-instruct , llama	1	252	November 7, 2024
Blueprint RAG v2.0.0 NVIDIA Blueprints nim , llama-31-70b-instruct , llama , blueprints	1	108	April 24, 2025
High-throughput serving Llama-3.1 on A100 w/ VLLM or Llama.cpp NVIDIA Nemotron llama	2	355	January 27, 2025

Profiles doesnt match machine even though specs are correct

"

== NVIDIA Inference Microservice LLM NIM ==

Related topics