Integrating NVIDIA Triton Inference Server with Kaldi ASR

jwitsoe · August 25, 2020, 3:59am

Originally published at: Integrating NVIDIA Triton Inference Server with Kaldi ASR | NVIDIA Technical Blog

Speech processing is compute-intensive and requires a powerful and flexible platform to power modern conversational AI applications. It seemed natural to combine the de facto standard platform for automatic speech recognition (ASR), the Kaldi Speech Recognition Toolkit, with the power and flexibility of NVIDIA GPUs. Kaldi adopted GPU acceleration for training workloads early on. NVIDIA…

anderson.kl.1 · September 11, 2020, 11:19pm

The Jupyter Notebook referred to in this post is not accessible. Would it be possible to include it with the github repo vs. your internal gitlab?

rpettit1 · November 4, 2021, 5:28pm

Upon following the instructions in the quick start guide we were able to launch the Triton server successfully, but upon running scripts/docker/launch_client.sh the client hung after outputting “Opening GRPC contextes…” (without the “done”) and before outputting “Streaming utterances…”

A quick glance at the client code appears it is likely hanging on line 273 of kaldi-asr-client/kaldi_asr_parallel_client.cc in the TritonASRClient constructor call.

Has anyone seen this issue before?

rpettit1 · November 9, 2021, 9:22pm

This problem appears to be on our end, result of a k8s networking issue in our on-prem k8s cluster. Was eventually able to successfully run the Triton Kaldi ASR client against the server.

Topic		Replies	Views
NVIDIA Accelerates Real Time Speech to Text Transcription 3500x with Kaldi Technical Blog	5	518	March 25, 2021
Simplifying AI Inference with NVIDIA Triton Inference Server from NVIDIA NGC Technical Blog	3	468	October 29, 2020
GPU-Accelerated Speech to Text with Kaldi: A Tutorial on Getting Started Technical Blog	7	846	March 6, 2021
Run Triton kernels on Jetson AGX Orin Jetson AGX Orin inference-server-triton	14	3571	June 14, 2023
Extracting Features from Multiple Audio Channels with Kaldi Technical Blog	0	358	August 24, 2020
Triton server died before reaching ready state. Terminating Jarvis startup Riva riva	6	6609	August 20, 2021
ASR and TTS Integration NVIDIA AI Workbench nim	2	66	September 15, 2024
Riva Speech Server Fails to Start Due to Model Loading Errors Riva conversational-ai , riva	3	124	January 27, 2025
Error running ASR Jupyter notebook Riva riva , generative_ai	3	449	March 4, 2024
GPU support with Triton iGPU image and Python Backend Jetson Orin Nano python	9	378	October 14, 2024

Integrating NVIDIA Triton Inference Server with Kaldi ASR

Related topics