What is the difference between Riva ASR w/wo NIM?

naturalx · October 15, 2024, 11:27am

I can find two documents for Riva

without NIM : Riva — NVIDIA Riva
with NIM : Riva ASR NIM Overview - NVIDIA Docs

Seems like both use docker and triton server. When using Riva, what is the benefit of using NIM?

schilton · October 18, 2024, 10:14pm

The Riva NIMs are not yet at feature parity with the classic Riva container. With the former, you can edit the config.sh script provided in the Riva Skills Quick Start resource folder to activate any combination of ASR, NMT, and TTS services; choose from among over a dozen ASR languages and 5 TTS languages; and specify your desired model architectures for any of the above services. Once you’ve edited config.sh , run riva_init.sh to download the models and deploy the pipeline, then run riva_start.sh to launch the Riva server.

Conversely, each Riva NIM gives you only one model (and, in the case of ASR and TTS, only one language) at a time. Running multiple Riva services as NIMs simultaneously is tricky, but doable. The simplest method I know is to download the constituent models of each NIM to the same directory and then launch one NIM. I demonstrate how to do so in this dev blog and this video. I imagine one could set up a docker-compose file or a Helm chart to accomplish the same task, but in the former case, you’d almost certainly have to change the port that each NIM except one listens on: by default, each Riva NIM listens on port 50051, just like the classic container.

Here’s an advantage to using Riva NIMs rather than the classic Riva container: If you don’t want to run the Riva server yourself, you can query the hosted endpoint (and associated function ID for each NIM) described in the Try API tab on each Riva NIM’s model page for inference. Moreover, since our entire software development paradigm is shifting toward NIMs, once the Riva NIMs are at feature parity with the classic Riva container, we’ll probably start developing NIMs exclusively. It can’t hurt to start familiarizing oneself with Riva NIMs now, if they support the desired model architectures and languages.

naturalx · October 19, 2024, 11:40am

Thanks for your clarification. It helped me alot. Thanks!

system · November 2, 2024, 11:40am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can we use Riva as a standalone package? Riva	9	797	November 28, 2022
Quickly Voice Your Apps with NVIDIA NIM Microservices for Speech and Translation Technical Blog nim	1	14	September 18, 2024
Running Riva and TAO using SSH Riva	6	583	September 22, 2022
Problems when running ./riva_init.sh with custom Quartznet Model Riva	1	750	September 7, 2021
Is there an example of a node.js engine for ASR/TTS? Riva	2	767	February 27, 2023
Riva_server fails if not all Triton models are loaded Riva	6	1001	January 27, 2023
Unable to download RIVA models during riva_init.sh Riva	2	913	October 21, 2022
Riva 2.16 quick start error - riva_init.sh - invalid API key Riva ubuntu , nim	5	105	August 7, 2024
Riva_start.sh will not start the server Riva riva	4	1050	August 31, 2023
Getting Error on command bash riva_init.sh Riva	10	1035	March 28, 2023

What is the difference between Riva ASR w/wo NIM?

Related topics