Develop Generative AI-Powered Visual AI Agents for the Edge

Originally published at: https://developer.nvidia.com/blog/develop-generative-ai-powered-visual-ai-agents-for-the-edge/

An exciting breakthrough in AI technology—Vision Language Models (VLMs)—offers a more dynamic and flexible method for video analysis. VLMs enable users to interact with image and video input using natural language, making the technology more accessible and adaptable. These models can run on the NVIDIA Jetson Orin edge AI platform or discrete GPUs through NIMs.…

I’m looking for reasoning VLMs that I can deploy privately. So far the only solution that works for me is google gemini 2.0. https://harpagan.com/ - Visual AI Research Agent Harpagan.com