Is it currently possible to deploy our own models on NVIDIA's cloud and use NIM for inference?

901228 · July 19, 2024, 5:14am

Currently, there are many AI networks on the NIM website that can be used for inference directly through HTTP requests, like llama3-70b-instruct, stable-diffusion-3-medium, etc. I would like to ask if it is possible to deploy our own networks and pretrained models on NVIDIA’s cloud in the same way as those models, and use HTTP requests to perform inference from anywhere.

neal.vaidya · July 24, 2024, 12:18am

Hi @901228 – at the moment, we don’t have the ability for users to deploy their own models on the API Catalog. Are there any particular pre-trained models you’d be interested in?

system · October 10, 2024, 6:09pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to Create a Custom NIM-Compliant Container Image for Self Hosting? Models nim	3	102	February 26, 2025
Access large models (405B) with NIM after using all credits for the build.nvidia.com endpoints Access/Accounts nim , nemotron-4-340b-reward , llama-31-405b-instruct , llama	3	207	August 29, 2024
Is it possible to run nim in offline Models nim	2	269	August 21, 2024
Container image (nim) construction guide for models where a nim doesn't exist? Models nim	2	129	November 29, 2024
Model Limits Models nim	4	63	May 25, 2025
NIM custom inference engine NVIDIA NIM cuda , kernel , jetson-inference , nim	1	29	April 1, 2025
Can I use models like Llama-3-8B-Instruct-Coder with NIM? Models nim , llama	1	59	September 20, 2024
Build intelligent chatbots, enhance search engines, and develop educational tools with Llama 3-ChatQA Technical Blog	1	70	June 26, 2024
NIM HTTP API Inference (Run Anywhere) Taking Extremely Long! Models nim , llama-31-70b-instruct , llama-31-405b-instruct , llama	1	197	September 11, 2024
NIM for finetunning/custom models? NGC GPU Cloud	1	1257	June 5, 2024

Is it currently possible to deploy our own models on NVIDIA's cloud and use NIM for inference?

Related topics