Visual Language Intelligence and Edge AI 2.0

jwitsoe · May 3, 2024, 6:25pm

Originally published at: Visual Language Intelligence and Edge AI 2.0 | NVIDIA Technical Blog

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest model comes with ~3B parameters. It is fully open source (including model checkpoints and even training code and training data). In this post, we describe how VILA performs against other…

jasonlu1 · May 3, 2024, 9:20pm

We observe very strong video understanding capability of VILA1.5 models. It is fully open sourced, feel free to try it out!

Topic		Replies	Views
Visual Language Models on NVIDIA Hardware with VILA Technical Blog	2	253	May 3, 2024
Develop Generative AI-Powered Visual AI Agents for the Edge Technical Blog	2	45	February 15, 2025
NVIDIA Jetson Orin Nano 개발자 키트, “슈퍼” 부스트 Technical Blog - South Korea jetson	1	46	December 20, 2024
Bringing Generative AI to Life with NVIDIA Jetson Technical Blog	0	426	October 19, 2023
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost Technical Blog jetson	2	153	December 19, 2024
New VILA-1.5 multimodal vision/language models released in 3B, 8B, 13B, 40B Jetson Projects generative_ai	0	1477	May 3, 2024
NVIDIA Jetson으로 생성형 AI에 생명을 불어넣다 Technical Blog - South Korea korean	0	558	October 26, 2023
Building a Multimodal AI Agent: Integrating Vision-Language Models in NVIDIA Isaac Sim with Jetson Orin AGX Jetson Projects camera , jetson-inference , isaacsim , generative_ai , jetson-platform-services	0	267	October 5, 2024
Vila를 사용하는 nvidia 하드웨어의 시각적 언어 모델 Technical Blog - South Korea	1	139	May 17, 2024
Deploying Accelerated Llama 3.2 from the Edge to the Cloud Technical Blog llama	1	69	September 25, 2024

Visual Language Intelligence and Edge AI 2.0

Related topics