Two channels of audio input with ML models

wang19964455 · May 18, 2023, 7:21am

Hi,

I am new to Jetson Nano. I need to use two channels of audio input on the board for my project.

Based on the link here Models — NVIDIA NeMo, MarbleNet (VAD) can be used for VAD. I am interested in this and have the following questions （I hope to design a OWN VAD device）：

Can jetson nano connect to a binaural analog mic?
2）Is there any existing examples of how to run two ML models (one for each audio channel) simultaneously on Jetson Nano?
Is there any existing models for speaker identification recognition (to realize the OWN part)? I think it should be similar to MarbleNet since they all use MFCC features.

Thank you in advance for any help.

Best,

Larry

wang19964455 · May 18, 2023, 10:39pm

Hi,

I just realized that the MarbleNet can be modified so that it can be used for OWN VAD. Maybe a Gaussian Mixture Model can be added later on for speaker recognition. Is it correct?

In this one, only one channel of audio input is needed. I only need to figure out how to modify the MarbleNet to complete the speaker recognition part. Any similar project before?

Thank you in advance for any help.

Best,

Larry

AastaLLL · May 19, 2023, 3:24am

Hi,

Please check if the below repo can meet your requirement or not:

Thanks.

wang19964455 · May 19, 2023, 3:08pm

Hi,

Thank you for your kind reply. I am new to this area. Maybe my questions are stupid.

Can I simulate the performance of the above inference library first offline (on a PC)? Because I hope to modify the MarbleNet later on in my project.
For the hardware connection part, since Nvidia Jetson Nano has 4 USB ports. For implementing audio input, the only thing I need to purchase is a USB mic, is it correct? If I need two channels mic input, I just need two USB mics and insert them into USB port directly? If this is correct, does it mean Jetson Nano can support up to 4 audio input with the usage of usb mics?

Thanks in advance.

Larry

wang19964455 · May 19, 2023, 4:35pm

Sorry, I have another question apart from the previously mentioned two questions. Do you know what is the estimated on device performance of the above inference? Can the inference time be limited to 10ms? I am wondering whether jetson nano is fast enough to run these models. Thanks.

AastaLLL · June 1, 2023, 4:22am

Hi,

Sorry for the late update.

1. Since TensorRT is hardware dependent, you will need to test it directly on the device.
2. Please check the below topic for the info:

3. Sorry that we don’t have a perf number for Nano.
But since Nano is quite limited in resources, you will need a lightweight model to reach the 10ms performance.

Thanks.

Topic		Replies	Views
Audio recognition Jetson Nano	5	5774	July 30, 2020
How to implement speech recognition on jetson nano Jetson Nano	3	4910	May 28, 2021
Does Jetson Nano support scream detection using a usb microphone? If yes, what is the process to do so? Jetson Nano jetson-inference	5	686	August 3, 2022
How to feed an Audio using the USB microphone on Jetson Nano to my trained model? Jetson Nano audio	6	861	August 3, 2022
Jetson nano simple voice recognition XR – VR/AR/MR python	1	972	March 24, 2023
Is there any step-by-step python example of audio classification using nano? Jetson Nano jetson-inference	7	2139	May 20, 2022
How to add both an audio input and audio output to Jetson Nano Dev Kit 4GB? Jetson Nano ubuntu , usb , audio	2	3489	July 29, 2023
Is there any light text to speech(speech synthesis) module for Jetson Nano? Jetson Nano ai-training	1	1463	July 23, 2020
How to connect microphone and speaker using gpio pin Jetson nano B01 Jetson Nano tensorrt , gpio , gstreamer	9	2608	May 14, 2023
What are some of the frameworks available for the Jetson Nano kitn Jetson Nano ai-training	2	727	October 11, 2023

Two channels of audio input with ML models

Related topics