"404 Page Not Found" Error When api used as openai

siyuan-l · January 24, 2025, 2:04pm

How to Use API Key in a Way Compatible with OpenAI?

neal.vaidya · January 24, 2025, 8:11pm

Hi @siyuan-l – take a look at one of the examples on build.nvidia.com, like the one here in the Python tab: llama-3.1-405b-instruct Model by Meta | NVIDIA NIM

Your code should look something like the following:

from openai import OpenAI

client = OpenAI(
  base_url = "https://integrate.api.nvidia.com/v1",
  api_key = "$API_KEY_REQUIRED_IF_EXECUTING_OUTSIDE_NGC"
)

completion = client.chat.completions.create(
  model="meta/llama-3.1-405b-instruct",
  messages=[{"role":"user","content":"Write a limerick about the wonders of GPU computing."}],
  temperature=0.2,
  top_p=0.7,
  max_tokens=1024,
  stream=True
)

for chunk in completion:
  if chunk.choices[0].delta.content is not None:
    print(chunk.choices[0].delta.content, end="")

If you’re seeing a 404 error, it might be because the base_url or model name aren’t correctly specified

siyuan-l · January 25, 2025, 2:40am

Thanks for your help!
This method works with LLaMA-3.1-405B-Instruct, but it doesn’t work with LLaMA-3.2-90B-Instruct. Is it because the latter hasn’t been adapted yet?

benq.engr · February 22, 2026, 7:35am

I believe Nvidia API endpoint dosen’t support OpenAI ‘v1/reponses’ endpoint so that lead me 404 in some application which is fixed to referring to ‘v1/reponses’. Should anyone confirm this so I don’t like to try&error.

Topic		Replies	Views
Llama 3.1 nemotron 70b instruct API access not working correctly NVIDIA Nemotron llama , nemotron	1	266	December 5, 2025
NVIDIA_API_KEY access Access/Accounts llama-31-405b-instruct , llama	8	4452	July 18, 2025
ChatNVIDIA - HTTPError: 404 Client Error: Not Found Models nim	4	1152	September 22, 2024
API connect Models nim , llama-31-8b-instruct , llama	1	468	September 20, 2024
NVIDIA NIM API / openai.API: Error code: 402,Cloud credits expired - Please contact NVIDIA representatives Models nim , llama-31-405b-instruct , llama	7	829	January 5, 2025
Nvidia / llama-3.1-nemotron-70b-instruct openai api is not working TensorRT llama	1	450	November 10, 2024
Open AI Endpoint NVIDIA Nemotron	0	355	April 28, 2024
404 Function not found for account when calling aisingapore/sea-lion-7b-instruct via integrate.api.nvidia.com Models	0	247	February 24, 2026
ChatNVIDIA: Exception: [403] Forbidden Invalid UAM response Models llama3-8b-instruct	7	1001	January 16, 2025
The model llama3 does not exist calling from ChatNVIDIA langchain class NVIDIA Nemotron	1	693	May 6, 2024

"404 Page Not Found" Error When api used as openai

Related topics