Model _ request Model Does not exist error

lapporatory · May 31, 2025, 3:19pm

Hi
Trying to run Downloadable NIM on RTX 4090
What I an doing wrong / Compose Output:
llama-3.1-8b-instruct-RTX-1 | raise ValueError(f"The model {request.model} does not exist.")
llama-3.1-8b-instruct-RTX-1 | ValueError: The model meta/llama-3.1-8b-instruct-RTX does not exist.

llama-3.1-8b-instruct-RTX-1 | await self.middleware_stack(scope, receive, send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/middleware/errors.py”, line 187, in call
llama-3.1-8b-instruct-RTX-1 | raise exc
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/middleware/errors.py”, line 165, in call
llama-3.1-8b-instruct-RTX-1 | await self.app(scope, receive, _send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/middleware/cors.py”, line 85, in call
llama-3.1-8b-instruct-RTX-1 | await self.app(scope, receive, send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/middleware/exceptions.py”, line 62, in call
llama-3.1-8b-instruct-RTX-1 | await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py”, line 62, in wrapped_app
llama-3.1-8b-instruct-RTX-1 | raise exc
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py”, line 51, in wrapped_app
llama-3.1-8b-instruct-RTX-1 | await app(scope, receive, sender)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/routing.py”, line 715, in call
llama-3.1-8b-instruct-RTX-1 | await self.middleware_stack(scope, receive, send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/routing.py”, line 735, in app
llama-3.1-8b-instruct-RTX-1 | await route.handle(scope, receive, send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/routing.py”, line 288, in handle
llama-3.1-8b-instruct-RTX-1 | await self.app(scope, receive, send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/routing.py”, line 76, in app
llama-3.1-8b-instruct-RTX-1 | await wrap_app_handling_exceptions(app, request)(scope, receive, send)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py”, line 62, in wrapped_app
llama-3.1-8b-instruct-RTX-1 | raise exc
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/_exception_handler.py”, line 51, in wrapped_app
llama-3.1-8b-instruct-RTX-1 | await app(scope, receive, sender)
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/starlette/routing.py”, line 73, in app
llama-3.1-8b-instruct-RTX-1 | response = await f(request)
llama-3.1-8b-instruct-RTX-1 | ^^^^^^^^^^^^^^^^
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/fastapi/routing.py”, line 301, in app
llama-3.1-8b-instruct-RTX-1 | raw_response = await run_endpoint_function(
llama-3.1-8b-instruct-RTX-1 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/fastapi/routing.py”, line 212, in run_endpoint_function
llama-3.1-8b-instruct-RTX-1 | return await dependant.call(**values)
llama-3.1-8b-instruct-RTX-1 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/nim_llm_sdk/entrypoints/openai/api_server.py”, line 657, in create_chat_completion
llama-3.1-8b-instruct-RTX-1 | ) = self.openai_serving_chat._maybe_get_adapters(request)
llama-3.1-8b-instruct-RTX-1 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
llama-3.1-8b-instruct-RTX-1 | File “/opt/nim/llm/.venv/lib/python3.12/site-packages/vllm/entrypoints/openai/serving_engine.py”, line 235, in _maybe_get_adapters
llama-3.1-8b-instruct-RTX-1 | raise ValueError(f"The model {request.model} does not exist.")
llama-3.1-8b-instruct-RTX-1 | ValueError: The model meta/llama-3.1-8b-instruct-RTX does not exist.

Topic		Replies	Views
Issue accessing NIM Containers using Keys Models nim , llama	4	40	July 2, 2025
ChatNVIDIA - HTTPError: 404 Client Error: Not Found Models nim	5	398	September 22, 2024
Aunch NVIDIA NIM (llama3-8b-instruct) for LLMs locally Access/Accounts nim , llama3-8b-instruct	3	125	November 8, 2024
API connect Models nim , llama-31-8b-instruct , llama	1	141	September 20, 2024
OpenAI Compatible API does not work Models llama-31-8b-instruct , llama-31-70b-instruct	6	441	August 26, 2024
NVIDIA NIM API invoked by Langchain returns statuscode 500 Access/Accounts nim , llama-31-70b-instruct , llama	1	177	September 4, 2024
Result of nvidia nims in openai SDK and API inconsistent AI Foundation Models and Endpoints nim , llama-31-405b-instruct , llama	0	31	January 7, 2025
The model llama3 does not exist calling from ChatNVIDIA langchain class AI Foundation Models and Endpoints	2	547	May 6, 2024
NIM with llama-3-8b models stuck without any error Models nim , llama3-8b-instruct , llama	0	142	November 15, 2024
NIM Llama3 8B Instruct - Running container with "CUDA_ERROR_NO_DEVICE" cuDNN docker , nim , llama3-8b-instruct	1	52	March 28, 2025

Model _ request Model Does not exist error

Related topics