Riva ASR acoustic model finetune

yoav.ellinson · September 13, 2022, 7:01am

Hi,
I am building an ASR application with Riva ASR in French.
Due to noisy environment (acoustic environment), the stock french Conformer model (RIVA Conformer ASR French | NVIDIA NGC ) is not sufficient enough so I would like to finetune it with augmented data from the same acoustic environment.
While following the finetune jupyter notebook (tutorials/asr-python-advanced-finetune-am-citrinet-tao-finetuning.ipynb at stable · nvidia-riva/tutorials · GitHub ), it is required to create a tokenizer for training,
What are the recommended configurations for the tokenizer (bpe/spe/wpe)? and what is the correct vocab size?
While talking to the technical team at NVIDIA they suggested to finetune with the vocab file that is available on ngc (Riva ASR French LM | NVIDIA NGC dict_vocab_2.1.txt) . How can that be done? what tokenizer should be used while doing this?

Thanks,
Yoav

Hardware - GPU A6000 x2
Hardware - CPU Intel Xeon Silver 4216 2.1GHz x2
Operating System Ubuntu 20.04
Riva Version 2.40
How to reproduce the issue ? (This is for errors. Please share the command and the detailed log here)

rvinobha · September 13, 2022, 7:39am

Hi @yoav.ellinson

Thanks for your interest in Riva,

I will check regarding your questions on finetuning with the team and get back soon

Thanks

Topic		Replies	Views
Riva model for FR-EN transcription Riva audio	2	778	January 2, 2022
Riva Citrinet Language Model Riva	4	982	November 22, 2021
Riva model deployment issue Riva inception	8	1560	April 4, 2024
VAD / Endpointing configurations Riva	2	711	December 10, 2024
Application will be correcting they pronunciations and they speech will be clear Riva	2	440	January 27, 2023
Issue Deploying Fine-Tuned Arabic Conformer Model in NVIDIA Riva: No Transcriptions Returned Riva	0	59	December 1, 2024
Error finetuning with new catalog RIVA Citrinet ASR English model - "Archive doesn't have the required runtime, format, version or object class type" Riva	1	695	April 22, 2022
Unable to get interim transcripts for RIVA Unified Conformer ASR Japanese model Riva riva	1	29	April 15, 2025
Finetune AM from pretained riva/tlt model Riva riva	9	744	August 23, 2022
Help Needed: Riva ASR Model Not Detecting Audio Riva riva	1	76	April 22, 2025

Riva ASR acoustic model finetune

Related topics