Topics tagged llama-31-8b-instruct

Topic	Replies	Views	Activity
Example ran out of memory on dgxspark DGX Spark / GB10 llama-31-8b-instruct , llama-31-70b-instruct , llama	1	38	December 18, 2025
Certificate verify failed while installing NIM models Models cuda , nim , llm , llama-31-8b-instruct , llama	1	51	December 15, 2025
Cannot enable --enable-auto-tool-choice and --tool-call-parser Models nim , llama-31-8b-instruct , llama	1	72	December 15, 2025
Announcing new VLLM container & 3.5X increase in Gen AI Performance in just 5 weeks of Jetson AGX Thor Launch Jetson Thor jetson , llama-31-8b-instruct , llama , nemotron	46	2362	December 14, 2025
Second NIM container won't start due to less than desired GPU memory utilization DGX Spark / GB10 docker , nim , llama-31-8b-instruct , llama	10	224	December 3, 2025
Question Regarding Draft Model Support AnythingLLM via NVIDIA NIM DGX Spark / GB10 nim , llama-31-8b-instruct , llama	2	69	November 19, 2025
NVIDIA NIM: LLAMA-4 (Maverick) Image for performance Benchmarking on Nvidia H200 GPUs Models cuda , nim , llama-31-8b-instruct , llama	2	159	November 17, 2025
Title: 401 Unauthorized when calling NVIDIA Integrate API (/v1/chat/completions) from container (API key works for /v1/models but fails for chat) NVIDIA Blueprints cuda , kernel , ubuntu , llama-31-8b-instruct , llama-31-70b-instruct , llama	0	41	November 6, 2025
Vllm client connection refused Jetson Thor ai , llama-31-8b-instruct , llama	10	104	October 31, 2025
Trying to Run DAPT-Continual PreTraining on Chip_design Data NVIDIA NeMo cuda , nemo , llama-31-8b-instruct , llama	2	82	August 5, 2025
Speculative decoding using vLLM on the Nvidia Jetson AGX Orin 64GB dev kit Jetson Projects generative_ai , llama-31-8b-instruct , llama	0	207	March 9, 2025
VSS local deployment single gpu: Failed to load VIA stream handler - Guardrails / CA-RAG setup failed Visual AI Agent nim , llama-31-8b-instruct , llama	4	175	July 4, 2025
Example-hybrid-rag NVIDIA AI Workbench nim , llama-31-8b-instruct , llama	7	193	June 2, 2025
Model _ request Model Does not exist error NIM on RTX AI PCs and Workstations nim , llama-31-8b-instruct , llama	0	66	May 31, 2025
SOTA inference speed using SGlang and EAGLE-3 speculative decoding on the NVIDIA Jetson AGX Orin Jetson Projects llama-31-8b-instruct , llama	2	907	March 23, 2025
Batch processing using NVIDIA NIM \| Docker \| Self-hosted Models python , nim , llama3-8b-instruct , llama-31-8b-instruct , llama	11	656	January 29, 2025
Running LMdeploy inference engine on the NVIDIA Jetson AGX Orin Devkit Jetson Projects jetson , llama-31-8b-instruct , llama	2	203	January 14, 2025
Failed to MLC-compile mlc-ai/Llama-3.1-8B-Instruct-fp8-MLC on Jetson AGX orin Jetson AGX Orin generative_ai , llama-31-8b-instruct , llama	5	294	January 13, 2025
Jetson Orin Nano Super Dev Kit Performance Jetson Orin Nano cudnn , gemma-2-9b-it , llama-31-8b-instruct , llama	6	1108	January 28, 2025
How to fix 0 compatible profiles? Where to get compatible profiles? Models nim , llama-31-8b-instruct , llama	4	667	November 26, 2024
Boosting LLM Inference Speed Using Speculative Decoding in MLC-LLM on Nvidia Jetson AGX Orin Jetson Projects generative_ai , llama-31-8b-instruct , llama	0	256	November 23, 2024
NIM TensorRT-LLM on H100 NVL Models nim , llama-31-8b-instruct , llama	2	264	November 22, 2024
Unable to Run NIM on H100 GPU Due to Profile Compatibility Issue Despite Sufficient GPU Resources Models nim , llama-31-8b-instruct , llama	1	294	November 12, 2024
NIM does not support llama-3.1-8b-instruct and llama-3.1-70b-instruct on GH200 On-Prem deployment Models nim , llama-31-8b-instruct , llama	1	325	November 7, 2024
Reusing a stored model (llama-3.1-8b-instruct) with a proper profile Models nim , llama-31-8b-instruct , llama	0	208	October 30, 2024
LoRA swapping inference Llama-3.1-8b-instruct \| Exception: lora format could not be determined Models nim , llama3-8b-instruct , llama-31-8b-instruct , llama	4	264	October 22, 2024
Running Ollama / llama3.1 on Jetson AGX Xavier 16gb is it possible? how-to? Jetson AGX Xavier generative_ai , llama-31-8b-instruct	8	2607	October 19, 2024
NIM API key not Found Models nim , llama-31-8b-instruct , llama	4	822	September 21, 2024
API connect Models nim , llama-31-8b-instruct , llama	1	259	September 20, 2024
Problem with installation of Llama 3.1 8b NIM Models nim , llama3-8b-instruct , llama-31-8b-instruct , llama	1	653	September 4, 2024