Getting 429 Too many request for NIM cloud api

sumit.mehta · June 10, 2025, 4:41pm

Hi Team,

We are currently experiencing 429 (Too Many Requests) response codes when using the NVIDIA NIM cloud API. Previously, the same integration was working smoothly in our project. However, we are now encountering this issue even with as few as 10 requests in a loop.

Could you please confirm if there have been any recent changes in rate limits or if there is an ongoing issue on your end? This will help us make the necessary adjustments in our implementation.

API Endpoint: https://integrate.api.nvidia.com/v1/chat/completions

Looking forward to your response.

Below attached a curl request to 10 request :

sophwats · June 10, 2025, 5:03pm

Hi,

We are currently seeing a range of issues with NGC services, including API Endpoints.

You can track the issue here https://status.ngc.nvidia.com/.

If your error persists once the issue has been resolved please reach back out and we will look into this for you.

Best,

Sophie

sumit.mehta · June 11, 2025, 7:55am

Hi Sophie,

The status shows as issues with NGC services are resolved but I still see issue when hitting multiple request to Nvidia API endpoints, I get 429 response code (Too many requests).

Please update if Nvidia NGC services are completely up and operational.

NGC Status

Response :
{“id”:“chat-2a7b029f403741aba5f99022486bcdc5”,“object”:“chat.completion”,“created”:1749628365,“model”:“meta/llama-3.1-8b-instruct”,“choices”:[{“index”:0,“message”:{“role”:“assistant”,“content”:“Hello! How can I assist you today?”},“logprobs”:null,“finish_reason”:“stop”,“stop_reason”:null}],“usage”:{“prompt_tokens”:11,“total_tokens”:20,“completion_tokens”:9},“prompt_logprobs”:null}{“status”:429,“title”:“Too Many Requests”}
{“status”:429,“title”:“Too Many Requests”}{“status”:429,“title”:“Too Many Requests”}{“status”:429,“title”:“Too Many Requests”}
{“status”:429,“title”:“Too Many Requests”}{“status”:429,“title”:“Too Many Requests”}{“status”:429,“title”:“Too Many Requests”}{“status”:429,“title”:“Too Many Requests”}{“status”:429,“title”:“Too Many Requests”}

sophwats · June 12, 2025, 2:40pm

Hi @sumit.mehta,

I’m trying to get clarity on whether there have been changes to the trial API for the model you are using.

In the mean time, you could try adding a sleep command in your loop iterations.

(I see the same ‘429 Too many requests’ error when I run your code.)

Please note that the API Endpoints are only to be used for experimentation, development, testing and research. NVIDIA NIM FAQ

Best,

Sophie

Topic		Replies	Views
Not connect to endpoint https://integrate.api.nvidia.com/v1 Access/Accounts nim , llama	1	1330	February 17, 2025
NVIDIA NIM API / openai.API: Error code: 402,Cloud credits expired - Please contact NVIDIA representatives Models nim , llama-31-405b-instruct , llama	8	668	January 19, 2025
NVIDIA NIM API invoked by Langchain returns statuscode 500 Access/Accounts nim , llama-31-70b-instruct , llama	1	355	September 4, 2024
Get started quickly with the NIM framework, and an error occurred when trying to reproduce it NGC GPU Cloud nemo	6	1138	April 26, 2024
NIM HTTP API Inference (Run Anywhere) Taking Extremely Long! Models nim , llama-31-70b-instruct , llama-31-405b-instruct , llama	1	548	September 11, 2024
Inferencing models from api taking very long Models jetson , nim , mistral-large , deepseek , nemotron	1	173	December 19, 2025
Model Limits Models nim	4	3947	May 25, 2025
Getting Started With NVIDIA NIM Tutorial Issues with NGC Registry Access/Accounts ubuntu , nim , llm , llama3-8b-instruct	7	2222	July 24, 2024
API connect Models nim , llama-31-8b-instruct , llama	1	374	September 20, 2024
Error 504 when calling the model through api Access/Accounts nim	4	207	December 20, 2025

Getting 429 Too many request for NIM cloud api

Related topics