Rate Limit Increase Request: Developer Research & Model Fine-Tuning

jsaghbini · April 6, 2026, 6:59pm

Hello NVIDIA API Support Team,

I am an individual developer working on a personal research project involving large-scale instruction/response dataset generation for a specific domain-expert fine-tuning pipeline. I am requesting a rate limit increase to help complete a one-time generation phase for my local model development.

Project Context: I am building a high-precision domain-specific dataset (~100,000 entries) to fine-tune a local 7B parameter model. My pipeline is currently running on moonshotai/kimi-k2-thinking, which is excellent for this task but high in latency.

Technical Hurdle: Because the Kimi-K2 model has long internal “thinking” cycles, I need to run high concurrency (20 workers) to maintain a reasonable generation timeline. However, the 40 RPM ceiling on the developer tier is causing frequent 429 errors which are significantly extending the generation runtime.

Request Details:

Email: jsaghbini@outlook.com
Current Limit: 40 RPM
Requested Limit: 200 RPM
Model: moonshotai/kimi-k2-thinking
Workload: One-time “burst” generation run (to be completed once the dataset is full)
API Endpoint: https://integrate.api.nvidia.com/v1

This is a personal, non-commercial project focused on exploring the limits of specialized dataset generation. Increasing the limit would allow me to finish this specific data generation phase without 429-related interruptions.

Thank you very much for your time and for supporting individual developers on your platform.

Best regards, Joseph Saghbini.

jsaghbini · April 7, 2026, 9:52pm

To assist with this request, my NVIDIA Cloud Account ID is 0928046057701069. This is for the moonshotai/kimi-k2-thinking model on the https://integrate.api.nvidia.com/v1 endpoint.

jsaghbini · April 8, 2026, 8:09pm

Hi @TomNVIDIA and @sophwats,

Following up on my request (posted ~48 hours ago) for a temporary RPM increase to 200 RPM for the moonshotai/kimi-k2-thinking model.

I am currently hitting consistent 429 errors at the 40 RPM limit, which is stalling my research dataset generation (~100k entries).

Account Details for Review:

NVIDIA Cloud Account ID: 0928046057701069
Email: jsaghbini@outlook.com
Context: High-concurrency (20 workers) needed to offset the long thinking cycles of this specific model.
Duration: This is a one-time burst for a personal research project.

Thank you for your help in supporting individual developers on the platform!

TomNVIDIA · April 8, 2026, 8:59pm

Hi @Aharpster,

Can you please chime in here?

jsaghbini · April 8, 2026, 9:56pm

Hi @Aharpster, thank you for looking into this,

To clarify the request: I am an individual developer working on a personal research project (dataset generation). Because the kimi-k2-thinking model has high-latency reasoning cycles, the standard 40 RPM limit is causing constant 429 errors even with low throughput.

Account Details:

NVIDIA Cloud Account ID: 0928046057701069
Email: jsaghbini@outlook.com
Target: Increase from 40 RPM to 200 RPM for a one-time burst generation (~100k entries).

I have exponential backoff logic ready to go to ensure I stay within the new limit. Happy to provide any other details you need!

Topic	Replies	Views
Request for Rate Limit Increase – NVIDIA NIM (OpenClaw AI Assistant) Models nim , openclaw	44	April 6, 2026
Request for Rate Limit Increase – NVIDIA NIM (OpenClaw AI Assistant) Access/Accounts nim , llama , agentic-ai , nemotron , openclaw	66	April 3, 2026
Request for API Rate Limit Exception: Advanced AI Multi-Agent Mission Control Project NVIDIA NeMo llama , nemotron	47	April 1, 2026
Request for Rate Limit Increase – NVIDIA NIM (Llama 405B for V# language development) Access/Accounts api , nim , llama-31-405b-instruct , llama	39	April 2, 2026
Request to Increase API Rate Limit for AI Customer Service Chatbot Models board-design , nim	27	April 8, 2026
API Rate Limit Increase for NVIDIA NIM Access/Accounts nim	8	April 9, 2026
Request for Exception: API Rate Limit Increase for NVIDIA NIM Access Models nim	61	April 1, 2026
[Request] for Quota Increase (200 RPM) & API Health Check Inquiry NIM on RTX AI PCs and Workstations	105	March 20, 2026
40 RPM Requested limit: 200 RPM (or the next available tier for individual developer use) Access/Accounts nim , openclaw	13	April 8, 2026
Request for API Rate Limit Exception: Advanced AI Multi-Agent Mission Control Project Access/Accounts nim	31	April 1, 2026

Rate Limit Increase Request: Developer Research & Model Fine-Tuning

Related topics