Training Your Own Voice Font Using Flowtron

Originally published at: https://developer.nvidia.com/blog/training-your-own-voice-font-using-flowtron/

Recent conversational AI research has demonstrated automatically generating high quality, human-like audio from text. For example, you can use Tacotron 2 and WaveGlow to convert text into high quality, natural-sounding speech in real time. You can also use FastPitch to generate mel spectrograms in parallel, achieving good speedup compared to Tacotron 2. However, current text-to-speech…