When using the VideoQuery agent to apply prompts to the video feed with the VLM, like in the Live LLaVA example from Live LLaVA - NVIDIA Jetson AI Lab, the refresh rate seems to be low. Is this a limitation of the Jetson AGX Orin or the camera?
After maximizing performance, I am getting around 0.6 FPS when using Llava-7b. The benchmarks are around 1.43 FPS for Llava-7b. I should have also specified that I am using the Jetson AGX Orin Developer Kit 32GB version. Would ~0.6 FPS be around the estimated performance?
I downloaded the MLC container, but I did not know how to run the llava-v1.5-7B model in the MLC container. I use the following command from the Live LLaVA tutorial and I’m getting approximately 0.6 FPS after maximizing performance.