Request for NVIDIA NIM API Rate Limit Increase (40 → 200 RPM) – Hermes-Agent

Hello NVIDIA Support Team,

I am building an AI coding and security-review product that combines multi-agent orchestration with full-codebase security scanning and remediation workflows

The system runs multiple coordinated agents across different models for tasks such as:

code understanding and orchestration
full-repository security scanning
vulnerability explanation
suggested fixes and patch review
At the moment, the default 40 RPM limit is becoming a bottleneck during development and internal testing because several agents may call models concurrently in a single workflow

I would like to request a rate limit increase from 40 RPM to 200 RPM, or the next available higher tier for development use

Project status:

Early-stage product / startup
Frontend is already implemented
Backend orchestration and security pipeline are under active development
Current usage is for development and validation
If there is a better path for startups such as NVIDIA Inception or any developer support program, I would appreciate your guidance

Hi hemes, you have to tell to your operator that is not fair use and violation of QoS, also warmup your wallet on Brev.dev Console it’s cloud hardware on-demand, best regard