Hi NVIDIA Team,
I am currently developing an automated localization and translation system for an MMORPG game project. To handle complex game lore, dialogue contexts, and technical terms, I am heavily utilizing the NVIDIA NIM APIs—specifically benchmarking the openai/gpt-oss-120b and meta/llama-3.2-90b-vision-instruct models.
Since game localization requires processing large batches of text dialogue data simultaneously through a recursive multi-agent translation pipeline, my system triggers many concurrent API requests. As a result, I am constantly hitting the default 40 RPM threshold and getting “429 Too Many Requests” errors, which stalls the entire synchronization process.
To support my development and batch-processing needs, could you please help increase my global rate limit from 40 RPM to 200 RPM?
Here are my account configuration details:
- Organization ID: 32Bitz-NQMP
- API Key (last 4 characters): C0BZ
Thank you very much for your time and support!