Build an Agentic Video Workflow with Video Search and Summarization

Originally published at: Build an Agentic Video Workflow with Video Search and Summarization | NVIDIA Technical Blog

Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system that can answer questions about video and image content? This presents a far more complex task.  Traditional video analytics tools struggle due to their limited functionality and a narrow focus on predefined…