Nvidia NEMO / ASR model to be trained on VCTK data

joepareti54 · July 25, 2021, 12:36pm

I have run NEMO ASR tutorial on a local Ubuntu PC with GPU: the model trains correctly on the provided data. The important thing to understand is how to use this notebook for the VCTK data set I have. There are wav files and text files with identical descriptors. The text file contains the sentence spoken in the wav file.

joepareti54@MSI /cygdrive/f/x/finance-2020/AI/Listen_attend_spell/VCTK-Corpus
$  ls -l txt/p225 | head -5
total 231
-rw-r--r--+ 1 joepareti54 None  20 Aug 22  2012 p225_001.txt
-rw-r--r--+ 1 joepareti54 None  55 Aug 22  2012 p225_002.txt
-rw-r--r--+ 1 joepareti54 None 103 Aug 22  2012 p225_003.txt
-rw-r--r--+ 1 joepareti54 None  68 Aug 22  2012 p225_004.txt

joepareti54@MSI /cygdrive/f/x/finance-2020/AI/Listen_attend_spell/VCTK-Corpus
$ ls -l wav48/p225 | head -5
total 93464
-rw-r--r--+ 1 joepareti54 None  196990 Aug 23  2012 p225_001.wav
-rw-r--r--+ 1 joepareti54 None  389676 Aug 23  2012 p225_002.wav
-rw-r--r--+ 1 joepareti54 None  749754 Aug 23  2012 p225_003.wav
-rw-r--r--+ 1 joepareti54 None  423528 Aug 23  2012 p225_004.wav

The feasibility of using NEMO for my project depends on the manifest and yaml files, and perhaps on other things. Any guidance is appreciated.

Topic		Replies	Views
JARVIS throwing errors for offline ASR when using own model Riva riva	12	2845	September 4, 2021
Failed to convert Nemo model to Riva using nemo2riva for ASR Riva riva	1	46	January 24, 2025
How to implement speech recognition on jetson nano Jetson Nano	4	4651	October 14, 2021
Error finetuning with new catalog RIVA Citrinet ASR English model - "Archive doesn't have the required runtime, format, version or object class type" Riva	1	695	April 22, 2022
Challenges in Fine-Tuning an Asian Language ASR Model with an Open-Source Dataset Riva	2	352	April 18, 2025
"Permission Denied" when launching the heartratenet notebook TAO Toolkit	11	678	November 1, 2023
Nemo Trained model not giving transcript when deployed on jarvis both offline and streaming Riva nemo , riva	6	1003	September 8, 2021
Finetuning Nemo Model Frameworks nemo	3	832	November 14, 2024
STT get metadata (start time/duration) Miscellaneous Products (archived) cuda , nemo	1	756	April 5, 2022
Deployments of custom-trained ASR models result in empty transcript results Riva	1	519	November 18, 2021

Nvidia NEMO / ASR model to be trained on VCTK data

Related topics