Enhancing Naturalness in Text-to-Speech with NVIDIA Riva

ro.goab · January 2, 2024, 12:14am

Can NVIDIA Riva’s text-to-speech feature be enhanced to include speech disfluencies like 'hmm’s and 'uh’s, to make the generated speech sound more natural, particularly while the system processes calculations?

rvinobha · January 8, 2024, 7:32am

Hi @ro.goab

Thanks for your interest in Riva

Apologies this is not supported currently
For Speech Recognition we have work boosting capability
https://docs.nvidia.com/deeplearning/riva/user-guide/docs/tutorials/asr-improve-recognition-for-specific-words.html

Thanks

Topic		Replies	Views
Create Speech AI Applications in Multiple Languages and Customize Text-to-Speech with Riva Technical Blog	6	1347	January 9, 2025
Nvidia RIVA - How to do voice conversion? Riva	3	708	November 11, 2022
RIVA TTS server doesn't enunciate some words Riva	1	118	November 28, 2024
Just Released: New Updates to NVIDIA Riva Technical Blog	0	359	October 6, 2022
Just Released: New Updates to NVIDIA Riva Technical Blog	0	296	September 26, 2022
Clarification about the NVIDIA services for conversational Intelligence Riva inception	2	750	December 19, 2022
Problems running TTS Es Multispeaker FastPitch HiFiGAN in RIVA Riva	6	1266	January 30, 2023
Build Speech AI in Multiple Languages and Train Large Language Models with the Latest from Riva and NeMo Megatron Technical Blog	0	397	March 28, 2022
Deploying NVIDIA Riva Multilingual ASR with Whisper and Canary Architectures While Selectively Deactivating NMT Technical Blog	0	142	February 20, 2025
RIVA Not able to execute Speech recognition with Virtual Assist Riva	1	524	October 28, 2023

Enhancing Naturalness in Text-to-Speech with NVIDIA Riva

Related topics