Model optimizations to consider for Citrinet (and speech recognition in general)

pineapple9011 · November 5, 2021, 3:47pm

Hardware - GPU T4
Operating System: Docker (Riva Server Image)
Riva Version: 1.6.0

Other than the current default optimizations performed for speech recognition models (dynamic batching and sequencing), are there any suggestions regarding instance group counts and increasing the max batch size?

The RIVA docs highlight the potential performance of each model (here) but don’t go into detail about which flags to consider passing to riva-build for each model.

Some insight would be appreciated!

SunilJB · November 9, 2021, 6:15am

Hi @pineapple9011
I am not sure if I understood your query correctly.
Could you please check below link in case it’s helpful in your case?
https://docs.nvidia.com/deeplearning/riva/user-guide/docs/service-asr.html#citrinet-acoustic-model

Thanks

Topic		Replies	Views
Riva Citrinet Language Model Riva	4	1056	November 22, 2021
Rebuilding the asrset3 citrinet offline pipeline but with larger chunk size Riva	10	1480	February 16, 2022
Is there any configuration to limit the maximum number of concurrent requests processed in riva? Riva	4	751	March 28, 2023
Speech Recognition: Deploying Models to Production Technical Blog	0	415	November 9, 2021
RIVA Non-reproduicable ASR outputs compared to NeMo model Riva	2	435	June 18, 2024
Riva 1.8.0b0 riva-build speech_recognition --nn.trt_max_workspace_size does not actually set workspace size Riva riva	4	1270	December 22, 2021
[TTS] Use denoiser in Riva with: FastPitch+HifiGan Riva	2	705	February 14, 2023
Nvidia Riva handling Concurrent requests Riva	1	775	May 12, 2022
Model.plan has bigger size than its .rmir on Riva2.0 Embedded Riva riva	2	761	June 23, 2022
Help with custom deploy and perform inference using citrinet-mandarin NGC pre-trained model in Riva Riva riva	6	1235	October 12, 2021

Model optimizations to consider for Citrinet (and speech recognition in general)

Related topics