I’ve been using the NVIDIA NIM API to run inference with the LLaMA 3.3 70B-Instruct model.
Recently, I have encountered repeated issues where the API returns: Error code: 504 (Gateway Timeout)
This has happened multiple times over the past three days, and in addition, I have noticed that the generation speed is significantly slower than usual.
Has there been any update, server-side changes, or system load issues on the NIM platform recently?
Any help or clarification would be greatly appreciated. Thank you!
While using the NVIDIA NIM API with deepseek-ai/deepseek-r1, I’ve repeatedly received 504 Gateway Timeout errors in the last three days. I’ve also observed that the model’s generation speed has slowed down considerably.
Please let me know if you are still having issues with the deepseek-ai NIM API and/or the Llama-3.3-70b-instruct NIM API and I will get our NIM team to look into it.