Hello NVIDIA team,
I am currently using the NVIDIA API (NIM) for a project and I have reached the default rate limit of 40 Requests Per Minute .Could you please assist in reviewing my account and increasing the quota?
Thank you for your support.
Hello NVIDIA team,
I am currently using the NVIDIA API (NIM) for a project and I have reached the default rate limit of 40 Requests Per Minute .Could you please assist in reviewing my account and increasing the quota?
Thank you for your support.
i think that you don’t understud that is free test plan …. it’s not fair use to request 200 rpm … if you planed to used it in concurent workflow you should think about purchase some cloud hardware like nvidia allready purpose or maybe stop conccurency
in fact current 40 rpm is allready enougth to test model and they don’t need to upgrade that because api is allready dying every day because too much people are on it, so if you need more power you have to take a look at Brev.dev Console that allready exist and will help you and nvidia
hope that you understud FREE TESTING is not made for hardcore usage
please take care about the ToS and the QoS from NIM API
You’re basically doing unpaid PR for NVIDIA across multiple threads.
Just to be clear: repeating the same defense everywhere won’t get you any free instances or special treatment.
Also, 40 RPM isn’t enough to test even a basic concurrent workflow.
So yes, people asking for more quota are making a valid point.
-------- Messaggio originale --------
basicly 40 RPM is really enoutgh to test models you are not supposed to build strong and powerfull system with that , as all of other sayed you are doing multiple agent at same time so you can easly seen that you concept is working until reached the 40 RPM so if you want to get more , use your credit card instead of killing the free NIM API, please learn what is fair use
You’re missing the point.
With 40 RPM I can’t even handle a simple static landing page edit in OpenClaw
one user action already triggers multiple requests (generation, validation, retries). It hits the limit instantly.
This isn’t about building “powerful systems”, it’s about reaching the minimum threshold to test a real workflow. Right now, it doesn’t.
-------- Messaggio originale --------
you missing something NIM free plan is not made for daly usage is only made to test models ;) not for daly usage , as i told you take a look at Brev.dev Console <3
also you can maybe try to setup you openclaw to only use one agent at time even multiple at time
Hi there,
Thanks for the detailed request.
I can elevate this internally to the team, as many others have had similar requests, but we cannot approve rate-limit increases directly from the forum.
I know that’s not the answer you are looking for, but I will do my best to ensure forum posters are heard in the decision-making process.
Thanks,
Aharpster
Hello NVIDIA Team,
I would like to kindly request a rate limit increase for my NVIDIA NIM API key as well.
Current limit: ~40 RPM
Requested: 200 RPM (or the next available tier for individual developer use)
I am using NIM models (MiniMax M2.7, Kimi K2.5, Qwen 3.5 VLM, etc.) through LiteLLM + Claude Code extension in VS Code for personal AI development and testing.
The current limit causes frequent 429 errors and significantly slows down the workflow.
This is for personal / non-commercial use only.
Thank you in advance for considering my request!
Best regards,
engielll
thanks to read other topics that has been rejected :)