Not connect to endpoint https://integrate.api.nvidia.com/v1

hovancon1998 · February 17, 2025, 2:12pm

Hi teams,

I try using NVIDIA NIM Via OpenAI SDK like below

from openai import OpenAI

client = OpenAI(
  base_url = "https://integrate.api.nvidia.com/v1",
  api_key = "$API_KEY_REQUIRED_IF_EXECUTING_OUTSIDE_NGC"
)

completion = client.chat.completions.create(
  model="meta/llama-3.3-70b-instruct",
  messages=[{"role":"user","content":"Write a limerick about the wonders of GPU computing."}],
  temperature=0.2,
  top_p=0.7,
  max_tokens=1024,
  stream=True
)

for chunk in completion:
  if chunk.choices[0].delta.content is not None:
    print(chunk.choices[0].delta.content, end="")

But request very longtime and not response? Can you help me answer for that case.

Thank you!

sophwats · February 17, 2025, 3:06pm

Hi @hovancon1998

The NVIDIA API catalog offers a no-cost trial experience of NVIDIA NIM, and you may experience extended wait times during periods of high load. To ensure consistent performance, we recommend the following options:

Self-host the API on your cloud provider or on-prem. Research and test use is free under the ‘NVIDIA Developer Program’ access. Please note that your organization must have an NVIDIA AI Enterprise license for production use.
Use serverless NIM API on Hugging Face with per-pay-use pricing. The NVIDIA AI Enterprise license is included with this option so you don’t need a separate license.

Topic		Replies	Views
NVIDIA NIM API invoked by Langchain returns statuscode 500 Access/Accounts nim , llama-31-70b-instruct , llama	1	404	September 4, 2024
NIM HTTP API Inference (Run Anywhere) Taking Extremely Long! Models nim , llama-31-70b-instruct , llama-31-405b-instruct , llama	1	743	September 11, 2024
NVIDIA API endpoint Models nim , deepseek	1	114	May 15, 2026
Nvidia nim Models nim	0	60	May 10, 2026
NVIDIA NIM API / openai.API: Error code: 402,Cloud credits expired - Please contact NVIDIA representatives Models nim , llama-31-405b-instruct , llama	8	734	January 19, 2025
Result of nvidia nims in openai SDK and API inconsistent NVIDIA Nemotron nim , llama-31-405b-instruct , llama	0	117	January 7, 2025
Getting 429 Too many request for NIM cloud api Models nim	3	1412	June 12, 2025
Request for NVIDIA NIM API Rate Limit Increase (40 → 200 RPM) Access/Accounts nim , llama , nemotron	3	383	April 16, 2026
Request for NVIDIA NIM API Rate Limit Increase (40 → 200 RPM) Access/Accounts nim , llama , nemotron	0	66	May 3, 2026
Sugesstions on how to use the nvdia nim api keys Access/Accounts nim	0	24	May 8, 2026

Not connect to endpoint https://integrate.api.nvidia.com/v1

Related topics