How to Train FastPitch with custom labels?

azhar.dhiaulhaq · March 8, 2022, 8:05am

Please provide the following information when requesting support.

• Hardware : NVIDIA TESLA V100
• Network Type : FastPitch
• Training spec file : FastPitch train specs (yaml)
• How to reproduce the issue ? : change notations in train.yaml into chars

Hello. I want to train Text to Speech FastPitch model using custom labels. In Training spec file, it used phonemes as default value for notations. I tried to change the value into chars, but it produce error as below :

TypeError: Error instantiating ‘nemo.collections.asr.data.audio_to_text.AudioToCharWithPriorAndPitchDataset’ : init() missing 1 required positional argument: ‘labels’

How to add those argument ? I already tried to add labels argument in train.yaml file and add +labels argument in running command, neither of them work. Still give me the same error.

Thank you.

Morganh · March 8, 2022, 2:03pm

May I know if you meet the same issue with official released jupyter notebook?
TAO Toolkit Quick Start Guide — TAO Toolkit 3.22.05 documentation →
Text to Speech Notebook | NVIDIA NGC → https://catalog.ngc.nvidia.com/orgs/nvidia/teams/tao/resources/texttospeech_notebook/version/v1.0/files/text-to-speech-training.ipynb

azhar.dhiaulhaq · March 8, 2022, 3:47pm

Hello. Thank you for the answer.

The tutorial in official released jupyter notebook is working just fine. But I need to use chars type in the vocab to exactly match my sentences in the metadata file. is that possible to do that?

Thank you

Morganh · March 8, 2022, 5:06pm

Please double check the dataset.
More info can be found in NeMo/audio_to_text.py at main · NVIDIA/NeMo · GitHub and NeMo/vocabs.py at 213d6685d8adfb943ba763d1c7e1e4eb9c68fb62 · NVIDIA/NeMo · GitHub

You can debug below files directly inside tao docker.
/opt/conda/lib/python3.8/site-packages/nemo/collections/asr/data/audio_to_text.py
/opt/conda/lib/python3.8/site-packages/nemo/collections/asr/data/vocabs.py

system · March 22, 2022, 5:06pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Tao Finetuning TAO Toolkit	24	1173	December 26, 2022
TAO Toolkit Text-to-speech Chinese support TAO Toolkit	3	505	February 1, 2022
Error training from scratch with character 'O' in LPRNet TAO Toolkit	14	1096	June 25, 2021
Tao speech_to_text evaluate+infer show very weak results TAO Toolkit	26	2231	March 8, 2022
How to use MFA toolkit for Fastpitch Riva riva	1	477	June 17, 2021
Errors encountered when using TAO to train LPRnet TAO Toolkit	19	812	November 17, 2021
[TTS] Riva support Fastpitch + GST (global style token) model? Riva	2	615	February 3, 2023
LPRNet - Poor Accuracy when training from scratch TAO Toolkit	9	1016	October 12, 2021
[TLT3.0][RIVA][Jasper] KeyError manifest.yaml not found TAO Toolkit riva	26	1229	September 7, 2021
Faster-RCNN for Character recognition TAO Toolkit	8	891	October 12, 2021

How to Train FastPitch with custom labels?

Related topics