Jetbot Voice-Activated Copilot Tools with Nvidia RIVA and NanoLLM Container for ROS2 Robot - version 2.0

jenhungho · September 21, 2024, 4:25am

Jetbot Voice-Activated Copilot Tools: Empowering Your ROS2 Robot with Voice-Activated Copilot Functionality

Experience the power of voice control for your ROS2 robot with the Jetbot Voice-Activated Copilot Tools. This project leverages the capabilities of the Nvidia RIVA ASR-TTS service, enabling your robot to understand and respond to spoken commands.

In this version 2 update, in addition to the features of V1 Jetbot Voice To Action Tools such as natural chat greetings, Lidar-assisted self-driving for object avoidance, and real-time person following, you can further enhance your robot’s interactions. Version 2 introduces support for multiple AI models, including LLM and VLM chat support, and hosts ROS2 under the NanoLLM Docker container to simplify setup procedures.

Key Features:

Jetbot ASR Processor: Enables your robot to decode human voice messages using the Nvidia RIVA ASR service client ROS2 node.
Jetbot TTS Processor: Converts chat-vision LLM and VLM response text into speech using Nvidia RIVA TTS services, which is then played via the robot’s speaker. This feature enhances the interaction between the robot and humans, making it more engaging and user-friendly.
Jetbot ASR Agent: Allows you to build a simple 1D convolutional neural network (CNN) model for text classification to predict human voice intentions and pipe corresponding LLM chat, VLM vision, and actions that the robot should take.
Jetbot Voice Tools Copilot: Executes the actions corresponding to the voice commands posted via ROS2 topic from the Jetbot ASR Agent. Supported actions include:
- Large Language Model (LLM) Chat: Empower your Jetbot to respond using LLM chat. By default, it utilizes the meta-llama/Llama-2-7b-chat-hf model hosted in a ROS2 node.
- Vision-Language Model (VLM) Robot Camera Image Description: Enable your Jetbot to describe images captured by its camera. By default, it employs the Efficient-Large-Model/VILA1.5-3b model hosted in a ROS2 node.
- Lidar-assisted self-driving for safe navigation and object avoidance.
- Real-time object detection for seamless person following interactions.
- Basic robot navigation commands such as moving forward/backward and turning left/right.

Code:

Demos:
Jetbot Voice Activated Copilot Vision, Chat, and Robot Actions Demo - YouTube

dusty_nv · October 3, 2024, 10:33pm

Awesome update to your project @jenhungho, that’s great progress and your agent architecture has become quite advanced! Thanks for sharing your work and look forward to seeing where you head with this!

Topic		Replies	Views
Jetbot Voice to Action Tools with Jetson ASR Deep Learning Interface Library for ROS2 Robot Jetson Projects tensorrt , ros , opencv , jetson-inference , audio , docker , python , deep-learning	2	913	September 25, 2024
Jetson AI Lab - Agent Controller LLM Jetson Projects generative_ai	1	1130	April 30, 2024
Accelerating AI Modules for ROS and ROS 2 on NVIDIA Jetson Platform Technical Blog	5	888	February 9, 2022
NVIDIA Jetson으로 생성형 AI에 생명을 불어넣다 Technical Blog - South Korea korean	0	559	October 26, 2023
Which docker nano_llm:ros with Jetpack 6.1 L4T 36.4.0 Jetson Orin Nano generative_ai , llm	2	45	January 23, 2025
Empowering Robots to See and Understand: A Vision-Language Model Powered by Jetson AGX and Isaac Sim Jetson Projects camera , cuda , ubuntu , jetson-inference , generative_ai	1	366	December 17, 2024
Jetbot Tools utilize Jetson Inference DNN Vision Library for ROS2 Robot Jetson Projects camera , opencv , jetson-inference , python	1	798	September 25, 2023
Develop Generative AI-Powered Visual AI Agents for the Edge Technical Blog	2	52	February 15, 2025
Bringing Generative AI to Life with NVIDIA Jetson Technical Blog	0	429	October 19, 2023
Using Generative AI to Enable Robots to Reason and Act with ReMEmbR Technical Blog	1	32	September 23, 2024

Jetbot Voice-Activated Copilot Tools with Nvidia RIVA and NanoLLM Container for ROS2 Robot - version 2.0

Related topics