Is there any step-by-step python example of audio classification using nano?

AK51 · May 10, 2022, 2:39am

Hi,
I would like to do some custom audio classification using python.
Is there any tutorial of it using Nvidia nano?
If training is not possible, I can make an onnx file somewhere. And is there any tutorial of how to import the onnx and do the inference? How to get audio from the IP cam?
Thx.

AastaLLL · May 10, 2022, 6:26am

Hi,

Do you want to find a speech recognition sample for Jetson?
If yes, it’s recommended to check the sample below:

Thanks.

AK51 · May 10, 2022, 7:59am

Hi,

It is a pre-trained model I assume. And I cannot re-train it like object detection, right?
My application is simpler, just need to distinguish several different sounds, don’t need to convert to text.
Just now, I can do the training in a big machine and copy the model to the nano for the inference using javascript.
Still looking for a way to get the audio from IP cam through RTSP protocol…
If there is a sample python code getting the RTSP audio and do the inference, it will be great.
(just like dusty-nv object detection)

Thanks,

AK51 · May 12, 2022, 9:22am

And how to get the rtsp audio?
I found this in the forum
How to install PyAudio? (L4T 32.2.3) - Jetson & Embedded Systems / Jetson Nano - NVIDIA Developer Forums
But it got the audio from wav, not rtsp audio stream…
Is there any conversion in Jetson from rtsp audio to a virtual mic?
Thanks

AK51 · May 13, 2022, 8:18am

Hi,
To summarize what I have done.
I can use my USB microphone to do sound classification. (using tensorflow)
How to convert the IP camera audio (rtsp) to virtual microphone?
If it works, I will be happy.
Thanks.

AastaLLL · May 19, 2022, 6:02am

Hi,

Unfortunately, we don’t have a sample that deals with the RTSP audio input.

But there is a sample in Deepstream that use audio file input.
Since Deepstream is a GStreamer-based component, maybe it will be easier to replace the source with RTSP audio input.

https://docs.nvidia.com/metropolis/deepstream/dev-guide/text/DS_ref_app_audio.html

Thanks.

AK51 · May 19, 2022, 3:37pm

Mmm… I have bad experience with Deepstream before…
Plan B: Is there anyway to convert rtsp audio to a virtual microphone? thx

AastaLLL · May 20, 2022, 3:29am

Hi,

There is a sample to use RTSP mp4 streaming to extract the soundtrack.
Could you check if the following sample works for you?

$ cd /opt/nvidia/deepstream/deepstream-5.0/sources/apps/sample_apps/deepstream-audio/configs
# set rtsp stream in ds_audio_sonyc_rtsp_test_config.txt
$ deepstream-audio -c ds_audio_sonyc_rtsp_test_config.txt

Thanks.

system · June 8, 2022, 6:17am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Audio Classification inference on Jetson Nsno Jetson Nano jetson-inference	2	308	April 1, 2024
Audio recognition Jetson Nano	6	5685	October 14, 2021
How to feed an Audio using the USB microphone on Jetson Nano to my trained model? Jetson Nano audio	7	789	August 17, 2022
Implementing rtsp streaming with audio from usb Jetson Nano rtsp , audio	7	3382	October 18, 2021
Does Jetson Nano support scream detection using a usb microphone? If yes, what is the process to do so? Jetson Nano jetson-inference	6	626	August 17, 2022
Is there any light text to speech(speech synthesis) module for Jetson Nano? Jetson Nano ai-training	2	1413	October 18, 2021
Jetson Nano - How to play sound when inference value reaches predefined value Jetson Nano camera , jetson-inference	2	877	October 15, 2021
How to implement speech recognition on jetson nano Jetson Nano	4	4792	October 14, 2021
How to process audio with my pipeline on the Jetson Xavier NX Jetson Xavier NX rtsp , jetson-inference , gstreamer	2	842	October 18, 2021
How to set up my own onnx model on jetson nano Deep Learning (Training & Inference) onnx	0	419	June 23, 2020

Is there any step-by-step python example of audio classification using nano?

Related topics