How can we bring VLM of choice?

akarali · August 23, 2024, 10:16am

we have choice of VITA and GPT,
How can I use different endpoint from maybe build.nvidia.com or any other openAI compatible model

aryason · August 23, 2024, 1:09pm

If you have another openAI compatible model you can run the following:

docker run … -e OPENAI_API_KEY=<OPENAI_API_KEY>
-e VIA_VLM_OPENAI_MODEL_DEPLOYMENT_NAME=gpt-4o
-e VLM_MODEL_TO_USE=openai-compat

You can also load a custom model:

Mount the directory containing the inference.py file and the optional model files in the
container.
Set the MODEL_PATH env variable to the mount path. MODEL_PATH should point to the
directory containing the inference.py file in the container.
Set VLM_MODEL_TO_USE=custom

ls <MODEL_DIR_ON_HOST>
inference.py
docker run … -v <MODEL_DIR_ON_HOST>:<MODEL_DIR_IN_CONTAINER>
-e MODEL_PATH=<MODEL_DIR_IN_CONTAINER> -e VLM_MODEL_TO_USE=custom …

You can refer to pages 21 and 23 of the VIA DP User Guide for more details.

system · September 6, 2024, 1:10pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Open AI Endpoint AI Foundation Models and Endpoints	0	203	April 28, 2024
Performance Gap Between Self-Hosted VILA Model and NVIDIA VILA API - Need Parameter Configuration Guidance Visual AI Agent cosmos	4	33	June 25, 2025
Unable to configure gpt-4o with VSS instead of vila using Openai-azure API key Visual AI Agent nim	2	38	April 21, 2025
Error: "No such model 'openai-compat'" when using VSS Engine with VLM_MODEL_TO_USE=openai-compat Visual AI Agent	3	30	July 7, 2025
Error while downloading VIA Visual AI Agent llama	20	320	September 23, 2024
Accuracy concerns and Deployment references for the VILA model Models nim , cosmos	0	20	June 25, 2025
Using non-nvidia external endpoints on custom deployment Visual AI Agent	13	124	June 12, 2025
VSS 2.3.0 Docker remote_llm_deployment Failed to generate TRT-LLM engine Visual AI Agent nim , paligemma , kosmos-2 , llama	5	62	May 23, 2025
Unable to configure gpt-4o with VSS instead of vila using Openai-azure API key Visual AI Agent	17	130	June 12, 2025
VILA with VIA [New] Visual AI Agent demos-and-tutorials , llama	4	1074	December 24, 2024

How can we bring VLM of choice?

Related topics