I have launchpad access with 8 NVIDIA H100 NVL GPUs. I am able to deploy and use VSS with nvila but unable to use gpt-4o model with VSS replacing nvila. I followed the instructions mentioned in Nvidia documentation( Configure the VLM — Video Search and Summarization Agent). Attaching the images from overrides file which I have modified for using gpt-4o through openai azure api key.
Below are the commands:
OPENAI_API_KEY=‘XXXXXXXXXXXXXXX’
Warning FailedMount 2m10s (x2177 over 3d1h) kubelet MountVolume.SetUp failed for volume "secret-ngc-api-key-volume" : secret "ngc-api-key-secret" not found
Yes, I have created and used the NGC API KEY for deploying VSS with Nvila. I am able to deploy that successfully but when changing the model to gpt-4o then it’s failed to deploy.
Thanks for notifying that. It solved the error while deploying the services. Now I am able to access UI but when I am trying to use summarization API getting below error:
Hi @ina.khandelwal , could you try with a sample video packaged inside VSS container? We have tried and it works well.
Summarize the bridge video using gradio UI(To get the file added to the backend)
From swagger UI execute GET /files API to get the asset id of the bridge file
Then, use summarize API using the swagger UI. Modify the “id” to the asset id of the bridge file and remove “api_type” before running the summarize API.
UI is working fine but I want to use APIs and getting that error while using API for summarization. It should work for other videos as well. Although even for default videos API is not working.
After you switch the VLM to gpt-4o, is it also okay to use the UI?
We suggest that you first load the “bridge video” using gradio UI to facilitate obtaining the file ID. Then you can use the API to get summarization.
Do you mean that even using our “bridge video” is not working?
UI is working fine for all videos but when trying to use API for any of the video default or external for summarization then it’s not showing me the content.