Integrate voice commands and robots can reply back by voice

henry.nguyen.mr.1108 · July 19, 2019, 8:01am

Dear All
I have completed the demo with Jetbot as instructed.

How to integrate voice commands and robots can reply back by voice.

I use Jetson nano development kit

THanks for your help

sms · July 19, 2019, 4:42pm

This is complicated. I have actually come pretty close to that, but your first step is to get sound working. See the threads related to sound processing on i2s that requires a new dtb.

snarky · July 19, 2019, 5:04pm

For speaking back:
THe easiest way to get a Linux machine like the Nano to speak is to install “espeak”

sudo apt-get install espeak
You can then run a shell command:
“espeak ‘hello world’” and it speaks. It is, however, quite robotic.

A better system for speech is Festival, especially if you go through the trouble to download additional high-quality voices from various research centers.

sudo apt-get install festival
You can then:
festival ‘(SayText “hello world”)’
or perhaps:
echo “Hello World” | festival --tts

You can then use these functions from a Python script using the “subprocess” module, or you can use the libraries/servers that come with the packages to call directly from C. Check the open source sites and documentation for those packages for more details.

Voice recognition is harder, because the good models (Google Assistant, Amazon Alexa, Apple Siri, Microsoft Cortana) aren’t publicly available, only on their respective ecosystems.
Those models all benefit from previous research, of which the “sphinx” system is probably the best documented.
You could download the software and an appropriate language model from Carnegie-Mellon: CMU Sphinx Downloads – CMUSphinx Open Source Speech Recognition
This will let you build tools and libraries, that you in turn can use from your own program.

S4WRXTTCS · July 19, 2019, 5:14pm

@snarky Have you tested adding high quality voices to festival?

I tried it last night on the Nano, and it didn’t work. For some reason festival wouldn’t recognize the voices I installed (all the CMU Arctic ones). Then I tried essentially the same thing on a desktop machine, and it worked fine.

I was going to try again on the nano (using just one CMU voice like I did on my PC), but then I encountered espeak. For my needs I actually kinda like the British Female voice on it (the F3 one).

It works great on the Nano, and seems to be highly configurable.

For the audio I used the Adafruit speaker bonnet. It plugs right onto the Nano, and the dtb on the following instructions worked fine.

https://devtalk.nvidia.com/default/topic/1051776/jetson-nano/enabling-i2s-audio-output-on-40-pin-connector/

I copied the dtb over AFTER the SDK OS Image build process finished, and right before the image was written. I tested it with Jetpack 4.2, and haven’t tested it with the latest Jetpack 4.2.1

Here is a link to the speaker bonnet I used

I went with the 3W 4 Ohm speaker set, but it’s a little quiet even at 100%. So I might have to make some changes. I really want to use a much more high powered “glass” speaker that would look really cool, and is water proof but it needs more power.

I haven’t decided if I’ll have voice recognition.

I’m going to play around with Hey-Jetson, but I don’t know how well it works on the Nano. It seems like it was mostly tested on the TX2 which is faster, and doesn’t mention the TX1 at all. The Nano is like half a TX1.

ak-nv · July 19, 2019, 9:00pm

Snips nlu platform might be interesting for your purposes, I personally tried home assistant app.
You can create your own app such that robot will reply according to your intent etc.

Install Snips on Jetson: https://docs.snips.ai/getting-started/quick-start-jetson-tx2

Let me know if that works out for you!

snarky · July 19, 2019, 10:33pm

Festival is sensitive to the path where you put the voices. I’ve used custom voices with Festival in multiple versions, and have had to move the voices around to get them to be recognized. I haven’t used Festival on the Nano, though. I’ve used it with the Xavier on an earlier Jetpack, and on other Ubuntu and even Arch Linux systems.

henry.nguyen.mr.1108 · July 20, 2019, 2:14am

@snarky
Thanks you so much!

I will try in today

walter.gallego · October 12, 2022, 4:45pm

Sorry for reviving this old post, but is the only one I could find related to my question.

I’m going to play around with Hey-Jetson, but I don’t know how well it works on the Nano. It seems like it was mostly tested on the TX2 which is faster, and doesn’t mention the TX1 at all. The Nano is like half a TX1.

@S4WRXTTCS were you able to run Hey-Jetson on the Nano?

Topic		Replies	Views
JetBot Voice Commands - Seeking Input Jetson Nano	0	552	December 10, 2019
Nano Bluetooth Jetson Nano	45	10160	October 14, 2021
How to connect microphone and speaker using gpio pin Jetson nano B01 Jetson Nano tensorrt , gpio , gstreamer	10	2142	May 28, 2023
How to implement speech recognition on jetson nano Jetson Nano	4	4663	October 14, 2021
How to upgrade my project using Jetson developer tools Jetson Nano reflash	15	891	April 15, 2023
Connecting esp32 to jetson nano Jetson Nano audio	5	3866	October 18, 2021
Jetbot Voice to Action Tools with Jetson ASR Deep Learning Interface Library for ROS2 Robot Jetson Projects tensorrt , ros , opencv , jetson-inference , audio , docker , python , deep-learning	2	895	September 25, 2024
Jetson nano microphone tdm config Jetson Nano	39	3123	October 14, 2021
Jetson Nano Audio I2S not work Jetson Nano audio , i2s	32	2147	April 5, 2023
Jetson Nano A02 with Respeaker 4 Mic Array and Tensorflow 2.7 Jetson Nano i2s	8	1076	March 29, 2023

Integrate voice commands and robots can reply back by voice

Related topics