Hi NVIDIA Team,
I am writing to request a rate limit increase for my free NVIDIA NIM API account.
Account Details:
Email: maedevisa@gmail.com
Current limit: 40 RPM
Requested limit: 200 RPM
Use case: I am using NVIDIA NIM models (primarily for Python development) through Kilo Code (a fork of OpenCode), an AI coding agent. Kilo Code performs agentic workflows with multi-step reasoning, automatic tool calls (file read/write, terminal commands, code analysis), and can spawn sub-agents for parallel tasks. A single development task easily generates 20–50+ API calls per minute, and when multiple agents run concurrently, that number multiplies significantly.
Why the increase is needed: The current 40 RPM limit causes constant 429 Too Many Requests errors, which breaks agent reasoning loops mid-task and makes complex workflows impossible to complete. This is not about sustained throughput - it is about burst capacity during short reasoning windows where the agent needs to make rapid sequential decisions.
Confirmation: This is strictly for personal, non-commercial use - learning, prototyping, and experimenting with modern AI tooling. I understand this is a free tier and greatly appreciate NVIDIA making these models accessible.
Thank you for your support.