I’m trying to deploy a custom Arabic ASR model (converted from NeMo .nemo to .riva) with Silero VAD v5 integration using riva-build speech_recognition in Riva 2.17.0, but I’m getting a “Model class silerovad not recognized” error.
What I’m trying to achieve:
Deploy my custom ASR model with neural VAD-based endpointing instead of decoder-based (greedy_ctc) endpointing.
What I’ve tried:
Questions:
Does riva-build speech_recognition support integrating an external VAD model file directly, or must VAD be deployed separately using Riva Quick Start’s asr_accessory_models approach?
If riva-build supports it, what’s the correct syntax to specify the VAD model file?
If not, can I deploy my custom .rmir alongside Quick Start’s Silero VAD accessory model?
@mehadi.hasan
I’ve tried Riva version 2.14.0 and 2.17.0 but I get the same issue in both. And I ran out of disk space, so I didn’t try version 2.19.0.
Let me know if there’s any way to handle this within this version, otherwise I can give 2.19.0 version a try.
@Vaneeza_Ahmad note that support for SileroVAD was added with Riva 2.18.0, see Riva Quickstart (Riva SDK) documentation (see Release Notes — NVIDIA Riva for details).
Note also, that the Riva Quickstart (Riva SDK) is already at version 2.24.0, and in this form supported only on NVIDIA Jetson Thor (Support Matrix — NVIDIA Riva).
In parallel, a lot of work has been done with Riva ASR NIM, which is the current way to go for datacenter deployments. There’s an awesome pack of documentation available for it, including instructions on deploying custom models as NIM ( Deploying Custom Models as NIM — NVIDIA NIM Riva ASR ). Release Notes — NVIDIA NIM Riva ASR and Support Matrix — NVIDIA NIM Riva ASR are also available. FYI. internally NIM is still based on Riva, hence the build/deploy steps are similar or identical to Riva SDK, but Riva versions may differ among NIM models, with most recent once using Riva at version 2.24.0.